American Stories
A Billion Scale Dataset of Structured Texts and Layouts from U.S. Public Domain Newspapers
A Billion Scale Dataset of Structured Texts and Layouts from U.S. Public Domain Newspapers
A Massive Scale Semantic Similarity Dataset of Historical Newspaper Headlines
A Python Package for Deep Learning-Assisted String Matching
A Python Library for Document Image Analysis
A Unified Python Package for Record Linkage with Transformer Models