Help us improve Datashare, our document analysis tool

Datashare has been helping reporters, citizen sleuths and data nerds analyze documents for more than a year. We’d like your help to make it even better.

Datashare has been helping journalists, citizen sleuths and general data nerds (like us!) analyze documents for more than a year now.

Thanks to feedback received via our public Github, emails and during conferences, we’ve made a number of improvements to the software over the past 12 months.

While Datashare is still in beta, our team now needs a more detailed analysis of how users are interacting with the current version, and what they think of it.

If you’re a Datashare user, we’d love you to fill this survey. It shouldn’t take more than 15 minutes. We’d be delighted to read your opinion and continue making Datashare even more user friendly:

powered by Typeform

If you don’t already have the latest version, you can (and should!) download it for free – or you can browse the online demo version.

In the latest version, you will find the following features:

  • A lighter version of Datashare – The default version is now easier to install and it can be used on more machines.
  • Document indexing – You can automatically analyze your documents in various formats (PDFs, images, emails, spreadsheets, presentations, etc.) and search all of them in Datashare.
  • Named entity finding – Datashare can automatically find names of people, organizations, locations and email addresses in your documents.
  • Search with operators and Regex – Datashare helps you make precise searches thanks to operators such as AND, OR, NOT, as well as allowing for queries that use wildcards, fuzziness, proximity searches, boosting operators, and regular expressions. You can explore your documents in tables, lists or in a grid view.
  • Filter documents – The list of filters now includes stars, tags, recommended by, file types, creation dates, languages, people, organizations, locations, paths, extraction level, indexing dates.
  • Batch search of list of queries in documents – Instead of searching for a list of queries one by one, just upload a list of search terms (e.g. a list of local elected officials or known subsidiaries of a company) and get the results in a tabular format.
  • Star, tag and recommend – To make it easier to navigate and organize documents you’re interested in, you can star and tag them. The new ‘Recommend’ button helps you work in collaboration with partners (if you installed Datashare on a server), by allowing users to flag documents of interest for each other.
  • Keyboard shortcuts – You can save time and use a list of keyboard shortcuts.
  • Insights – You can count and visualize the number of documents by creation date.
  • Collaborative version on a server – Datashare can be used by an organization or a group of people who want to work together on the same documents. We have documented the server mode here.
  • Four languages – Datashare is now available in English, Spanish, French and Japanese. You’re always welcome to help us internationalize Datashare by contributing to translations through Crowdin.

Follow the latest news with the hashtag #ICIJDatashare on Twitter.

Isabel dos Santos and Sindika Dokolo

Isabel dos Santos ordered to return to Angola $500 million in shares ‘tainted by illegality’

Aug 02, 2021
FinCEN Files

Lessons from award-winning FinCEN Files and Luanda Leaks investigations

Jul 23, 2021
European Parliament and EU flag

EU to propose watchdog to tackle anti-money laundering failures exposed by FinCEN Files

Jul 16, 2021
Protesters in London outside the Chinese Embassy

As global pressure over human rights abuses in Xinjiang picks up, China remains defiant 

Jul 15, 2021

On the decline since Panama Papers, Malta punished for dirty money reputation

Jul 08, 2021
Isabel dos Santos and Sindika Dokolo

Dutch court sides with report calling dos Santos-linked energy deal an ‘act of corruption’

Jun 28, 2021
ICIJ is dedicated to ensuring all reports we publish are accurate. If you believe you have found an inaccuracy let us know.