DocumentCloud is an index of primary source documents and a tool for annotating, organizing and publishing them on the web.

Documents are contributed by journalists, researchers and archivists. If your organization does document-driven investigations, we’d love to have you join us. Using the DocumentCloud workspace, you can upload documents, share them with your team, and conduct structured searches and analyses based on extracted entities — the people, places, and organizations mentioned in the text. As a contributor, you can download a lightweight document viewer to embed documents on your web site.

Take a look at how news organizations are beginning to use DocumentCloud to complement their reporting or review our help pages to get a sense of how DocumentCloud works.

At the moment, we're in the middle of our initial beta release. If you're a news organization or nonprofit that would like to join, please get in touch.

As we develop DocumentCloud, we're packaging up the components that support it, and releasing them as open-source projects. Our releases so far include the majority of our document processing code.

Latest Updates