mirror of
https://github.com/zvx-echo6/recon.git
synced 2026-05-20 06:34:40 +02:00
feat(wiki-index): add wave 2 pipeline for wikidata-only places
Processes places with wikidata but no wikipedia tag: - Batch resolve Q-IDs via Wikidata API (50/request) - Validate resolved titles against local ZIM - Generate summaries with Gemini API (3-4 sentences) - Circuit breaker: 50 consecutive 429s triggers 5min pause - Revalidate any remaining unvalidated entries Filters for US+CA places, skips existing wave 1 entries. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
This commit is contained in:
parent
b250d0c257
commit
6be1e4cfa6
1 changed files with 1128 additions and 0 deletions
1128
scripts/wiki_index_wave2.py
Executable file
1128
scripts/wiki_index_wave2.py
Executable file
File diff suppressed because it is too large
Load diff
Loading…
Add table
Add a link
Reference in a new issue