LLM-powered extraction kept silently corrupting my database. Here's what I built to fix it. tags: node, llm, opensource, api (opens in new tab)
I've been building an extraction API for the past month. The use case is specific — reading informal WhatsApp orders in mixed Hindi/English/Malayalam and turning them into structured records for Indian distributors. Something like: "bhai 50 bags opc 53 cement calicut tuesday urgent" needs to become: { "product": "OPC Grade 53 Cement", "quantity": 50, "unit": "bags", "location": "Calicut", "delivery_date": "Tuesday", "urgency": "urgent" } Regex dies on the first message. Template matching dies...
Read the original article