Tooling benchmarks

A compact readout of OSINT-oriented tasks. Browser scores require completing a search workflow and returning target evidence; output volume is not treated as quality.

3Categories
11Cases
50Evidence probes

Category readout

Coverage by task category.

Category Cases Coverage Evidence
Browser Automation450%partial
PDF Extraction / OCR398%complete
Scraping453%partial

Browser Automation

Open the task brief to inspect sources, search flows, and expected evidence.

Task brief4 investigative cases

Companies House registry search to company filing history

Load the UK Companies House register, type OPENAI into the search form, open OPENAI UK LTD, and navigate to its filing history so a reporter can inspect downloadable filings.

URL
https://find-and-update.company-information.service.gov.uk/
Search
OPENAI
must contain: OPENAI UK LTDmust contain: 14367667must contain: Filing historymust contain: View PDFmust contain: Confirmation statement

OpenSanctions name screening search

Load OpenSanctions, search for a named person, and return the sanctions/PEP evidence for the matching entity.

URL
https://www.opensanctions.org/search/
Search
Roman Abramovich
must contain: Roman Abramovichmust contain: Personmust contain: PEPmust contain: Sanctionedmust contain: Russia

OpenStreetMap place search for a municipality

Load OpenStreetMap, search for Zug Switzerland, and return the Nominatim result context and relation links so a reporter can verify the place and map target.

URL
https://www.openstreetmap.org/
Search
Zug Switzerland
must contain: Search Resultsmust contain: Nominatimmust contain: Zug, Schweiz/Suisse/Svizzera/Svizramust contain: relation/1686447must contain: relation/1683151

Wikidata entity search for structured company identity

Search Wikidata for OpenAI, open the matching entity record, and return structured identity evidence useful for company/person disambiguation.

URL
https://www.wikidata.org/wiki/Special:Search
Search
OpenAI
must contain: OpenAImust contain: Q21708200must contain: American artificial intelligence research organizationmust contain: openai.commust contain: instance of
dev-browser
100%
Playwright script
100%
browser-use terminal
0%
browser-harness
0%
ToolEvidenceCoverageSpeedSpendNote
dev-browsercomplete100%9,578 msn/aexecuted
Playwright scriptcomplete100%6,591 msn/aexecuted
browser-use terminalmissed0%4,042 msn/ano target evidence returned
browser-harnessmissed0%30 msn/aCDP keepalive ping timeout

PDF Extraction / OCR

Open the task brief to inspect sources, search flows, and expected evidence.

Task brief3 investigative cases

GIJN Citizen Investigations guide with OSINT topic coverage

Parse a known investigative PDF from local cache, with a source URL available for remote parsers.

URL
https://gijn.org/wp-content/uploads/2019/09/Citizens_Investigation_Guide.pdf
Page range
0-5
must contain: Citizen Investigationsmust contain: Bellingcatmust contain: Finding out who owns corporationsmust contain: Digging up property recordsmin chars: 50000
LlamaParse
100%
Poppler pdftotext baseline
100%
Docling
100%
Fireparse / Firecrawl document parse
100%
Extend Parse 2.0
100%
Surya OCR
83%
LangExtract over extracted text
83%
ToolEvidenceCoverageSpeedSpendNote
LlamaParsecomplete100%130,751 msn/aexecuted
Poppler pdftotext baselinecomplete100%125 msn/aexecuted
Doclingcomplete100%27,450 msn/aexecuted
Fireparse / Firecrawl document parsecomplete100%18,407 msn/aexecuted
Extend Parse 2.0complete100%31,995 ms426 creditsexecuted
Surya OCRcomplete83%194,577 msn/aexecuted
LangExtract over extracted textcomplete83%68,114 msn/aexecuted

Scraping

Open the task brief to inspect sources, search flows, and expected evidence.

Task brief4 investigative cases

Basel-Stadt council protocol listing

Retrieve a civic-protocol listing and preserve links or labels that would let a reporter reach meeting records.

URL
https://grosserrat.bs.ch/ratsbetrieb/ratsprotokolle?all=1
must contain: Grosser Ratmust contain: Ratsprotokollemust contain: PDFmin chars: 1000

Zurich Gemeinderat protocol listing

Retrieve a multilingual civic-protocol page and preserve meeting/protocol evidence.

URL
https://www.gemeinderat-zuerich.ch/protokolle
must contain: Gemeinderatmust contain: Protokollmust contain: Sitzungmin chars: 1000
Firecrawl scrape
75%
Obscura headless browser
46%
Exa contents API
38%
ToolEvidenceCoverageSpeedSpendNote
Firecrawl scrapecomplete75%1,242 msn/aexecuted
Obscura headless browserpartial46%1,032 msn/ano target evidence returned
Exa contents APIpartial38%4,022 ms$0.0020no Exa contents results returned