Document type · reports

Reports Word documents

41,543 documents across 67 languages and 9 topics. See /classification for what this label covers.

41,543documents
67languages
9topics

Documents classified as long-form reports presenting findings or analysis. Examples include annual reports, research papers, case studies, white papers, assessments, and evaluations.

Useful for: long-document summarization, citation extraction, financial-statement parsing, tabular-data benchmarks, retrieval-augmented Q&A over reports.

TopicCount
Government 12,466
Healthcare 8,962
Education 6,792
Environment 4,576
Finance 3,260
Technology 1,943
General 1,460
Legal / Judicial 1,108
Nonprofit 976
LangCountShare
en 18,061 46.1%
zh 5,024 12.8%
cs 3,386 8.6%
ru 3,258 8.3%
es 2,125 5.4%
fr 1,038 2.6%
+ 61 more

Share is computed against the top 20 languages for this type (39,172 docs), matching what the API returns. A handful of documents fall outside the top 20 or have no detected language.

ID Filename Topic Lang Conf
f3b45293bf4f ser1.2016.docx Government ru 0.98
31dbb23d9bc9 січень -квітень - автозбереження (1).docx Government uk 0.98
51a8a5ca05ee ser 2015.docx Government ru 0.98
5520bbf1d604 2016_Annual_Building_Report.docx Government en 0.98
4495bef63769 Довідка_СЕР_січень_жовтень.docx Government uk 0.98

ID column shows the first 12 characters of the SHA-256 content hash; the full hash is the stable reference. Real public-web filenames vary widely: descriptive, numeric, or URL-fragment shaped.

# All reports documents
curl "https://api.docxcorp.us/manifest?type=reports" -o reports-manifest.txt

# High-confidence English subset
curl "https://api.docxcorp.us/manifest?type=reports&lang=en&min_confidence=0.8"

See /download for full access patterns.

All typesAll topics/classification