The Pandora Papers’s 11.9 million records arrived from 14 different offshore services firms in a jumble of files and formats – even ink-on-paper – presenting a massive data-management challenge
Machine learning can help journalists by identifying files more likely to contain stories, for example, finding information-rich tax returns from a massive trove of documents.
A new partnership between journalists and machine learning scientists aims to enhance the investigative reporting process. Here’s what we learned so far.
Using algorithms and careful analysis, ICIJ screened millions of records and found cases where patients had died, but the incident hadn't been reported as a death.