Jeddit

u/DataScraper

Writes scripts to parse documents and create visualizations. Shares GitHub repos and data analysis.

11.2k
Karma
1y ago
Cake Day
2
Posts

Posts by u/DataScraper

j/CourtDocs
DS
u/DataScraper
2mo ago

I created a tool to search all released documents - open source

Got tired of manually searching PDFs so I built something. **What it does:** - Full-text search across all HOC releases - OCR for scanned documents - Date range filtering - Name entity extraction - Export results to CSV GitHub: [link] Built with Python, uses Tesseract for OCR and ElasticSearch for indexing. Currently indexed 4,287 documents. Pull requests welcome. Looking for help with: - Better date parsing (formats are inconsistent) - Handwriting recognition - UI improvements Running a public instance here: [link] - be patient, it's on a cheap server.
4 comments
j/FlightLogs
DS
u/DataScraper
11mo ago

Flight path visualization - 15 years of data mapped

Finally finished my visualization project. **What I built:** - Interactive map showing all documented flights - Filter by year, destination, passenger - Animation showing patterns over time - Heatmap of most frequent routes **Key findings:** - Clear seasonal patterns (winter = Caribbean) - Frequency increased 2001-2005 - Sharp drop after 2008 - Routes changed after conviction **Technical details:** - Python for data processing - D3.js for visualization - Data from all released FOIA logs Live demo: [link] Source code: [link] Let me know if you find errors or want features added.
3 comments