Debanjan ChaudhuriHow we processed Wikidata dumps over a Weekend using only shell scriptingProcessing huge data can be daunting, especially if the data cannot be fit into memory. Many recent frameworks rely on the MapReduce…4 min read·Apr 24, 2021----
Debanjan ChaudhuriUnsupervised Key-Value Extraction from Invoices and Contracts using Positional KnowledgeExtracting important information from differently formatted contracts and documents can be challenging. Here we will look into…4 min read·Feb 21, 2021----