Taming AI Rate Limits at RefundIQ
A fintech SaaS platform processing 80,000+ emails per user was hitting silent API failures, unpredictable sync times of 4+ hours, and inflated AI costs. This is how the entire Gemini API usage layer was redesigned — from an unbounded parallel fire-and-forget model to a controlled, resilient, cost-optimized pipeline.
NestJS
Gemini 2.5 Flash
Vertex AI
BullMQ / Redis
Node.js
TypeScript
MongoDB
