recon/scripts
Ubuntu 5d618da2a4 Add wiki_index_wave3.py with parallel resolve
Wave 3 pipeline for processing 253K+ place types with NO wiki/wikidata
tags (US+CA only). Uses Gemini to resolve Wikipedia titles.

Key feature: resolve_wikipedia_titles() now uses ThreadPoolExecutor
with 5 parallel workers, improving throughput from ~14/min to ~75/min.

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-04-30 21:37:51 +00:00
..
__init__.py Initial commit: RECON codebase baseline 2026-04-14 14:57:23 +00:00
aa_download.py Initial commit: RECON codebase baseline 2026-04-14 14:57:23 +00:00
aa_download_pass2.py Initial commit: RECON codebase baseline 2026-04-14 14:57:23 +00:00
backup.sh Initial commit: RECON codebase baseline 2026-04-14 14:57:23 +00:00
cleanup_outliers.py Initial commit: RECON codebase baseline 2026-04-14 14:57:23 +00:00
domain_reenrich.py Initial commit: RECON codebase baseline 2026-04-14 14:57:23 +00:00
domain_remap.py Initial commit: RECON codebase baseline 2026-04-14 14:57:23 +00:00
migrate_domains.py Initial commit: RECON codebase baseline 2026-04-14 14:57:23 +00:00
migrate_skill_level.py Initial commit: RECON codebase baseline 2026-04-14 14:57:23 +00:00
rebuild_qdrant.py Initial commit: RECON codebase baseline 2026-04-14 14:57:23 +00:00
reenrich_reference.py Initial commit: RECON codebase baseline 2026-04-14 14:57:23 +00:00
repair_corrupted.py Initial commit: RECON codebase baseline 2026-04-14 14:57:23 +00:00
validate.py Initial commit: RECON codebase baseline 2026-04-14 14:57:23 +00:00
wiki_index_wave2.py feat(wiki-index): add wave 2 pipeline for wikidata-only places 2026-04-29 19:50:43 +00:00
wiki_index_wave3.py Add wiki_index_wave3.py with parallel resolve 2026-04-30 21:37:51 +00:00