JapanAI AutomationManufacturing

Inada Manufacturing — Quote turnaround: 6 hours to 48 seconds

Inada Manufacturing produces precision massage chairs for global markets. Each custom configuration quote required a 6-hour manual process. Our founding engineer built QuoteForge — an AI pipeline that returns a complete quote in 48 seconds.

Gemini 2.5 FlashClaude 3.5 SonnetNode.jsTypeScriptMongoDBRedisRailway

48s

Quote turnaround (was 6 hours)

450×

Speed improvement

~99%

Accuracy rate on structured quotes

The problem

Inada Manufacturing produces high-precision massage chairs sold into hotel, medical, and retail markets globally. Each product line supports hundreds of configuration variables — motor specifications, upholstery options, localization requirements, logistics zones, and custom branding.

A quote for a custom configuration required a sales engineer to manually cross-reference three internal pricing spreadsheets, a logistics rate table, a custom options matrix, and exchange rate feeds. The process took six hours per inquiry. At peak inquiry volume, this created a 3–5 day backlog.

Inada needed a system that could ingest incoming specification documents — often PDFs or structured emails from distributors — and return a complete, accurate quote without human intervention.

The architecture

Inbound ingestion   →  REST API / email webhook
Document parsing    →  Gemini 2.5 Flash (vision + OCR)
Spec extraction     →  Structured JSON (product code, qty, options)
Pricing resolution  →  Claude 3.5 Sonnet (rule application + edge cases)
Quote assembly      →  Node.js + Handlebars template
Output              →  PDF quote + JSON payload → Inada ERP webhook
Queue + retry       →  Redis BullMQ
Audit log           →  MongoDB (every decision traceable)

The pipeline uses Gemini 2.5 Flash for document understanding — its vision capability handles non-standard PDF layouts, hand-annotated spec sheets, and mixed-language inputs. Claude 3.5 Sonnet handles the reasoning layer: applying Inada's pricing rules, resolving ambiguous option combinations, and flagging edge cases for human review rather than guessing.

Every decision in the pipeline is logged to MongoDB with the model's confidence scores and the input state — giving Inada a complete audit trail for every quote, whether or not a human reviewed it.

The outcome

QuoteForge went live in Q3 2024. Quote turnaround dropped from an average of six hours to 48 seconds. Backlog cleared within two weeks of deployment. The sales team reports handling three times the inquiry volume with the same headcount.

Edge cases — configurations the model flags with low confidence — are routed to a human review queue. This accounts for approximately 4% of inquiries and takes 20–30 minutes (a reduction from 6 hours) because the model pre-fills everything it can.

What we'd do differently next time

The initial version used a single model for both extraction and reasoning. Splitting these into two discrete model calls — one fast/cheap for extraction, one capable for reasoning — reduced latency by 30% and cost per quote by 45%.

We'd also build the human review queue before the first deployment rather than retrofitting it. The initial assumption was that confidence scores alone would be sufficient signal — they're not. A structured review interface should be part of v1.

Want this for your business?

If your team is running a manual workflow that happens more than 50 times a month, there's a good chance we can automate the bulk of it.

Book a Discovery Call