Rishabh Kumar

Building AI systems that run in production.

I build AI systems that run in production. Currently building autonomous email VAs processing real dealership workflows with sub-500ms hybrid RAG, real-time voice pipelines with Deepgram/ElevenLabs, and multi-tenant SaaS on AWS/GCP. I ship latency-sensitive AI systems from backend inference to production deployment.

EmailGitHubLinkedInX

Projects

Remalt - AI Workflow Builder200+ users

Visual AI workflow builder. Drag-and-drop 13 node types to compose multi-step AI pipelines without writing code.

Next.js 15SupabaseOpenRouterMastra@xyflow/react
CogniDB - Natural Language to SQL200+ stars

Open source NL-to-SQL library. Natural language database queries across MySQL, PostgreSQL, MongoDB with semantic caching. 40% inference cost reduction.

PythonMySQLPostgreSQLMongoDBNLP
Vibr - Text-to-Speech Platform

Multi-tenant TTS platform. Custom voice libraries across 13 categories, real-time waveform visualization, usage-based billing.

Next.jstRPCChatterbox TTSPostgreSQLCloudflare R2Clerk
ThreatModel-MCP - AI Security Assessment

Fine-tuned BERT on MITRE ATT&CK at 89% accuracy. GNN for attack path prediction. Deployed on AWS with auto-scaling.

PyTorchBERTAWS SageMakerMITRE ATT&CKGNN
OneTicket - Multilingual Event Booking AssistantTop 5 / 800 teams

Multilingual event booking assistant. GPT-4o via LangChain and LangGraph with dynamic DB tools and Twilio SMS. Smart India Hackathon.

GPT-4oLangChainLangGraphTwilio SMSTypeScript
Gmail & Calendar AI Assistant

Multi-agent system for Gmail and Google Calendar. Handles chained commands like 'find a meeting time in this email and schedule it.'

Mastra FrameworkGoogle Workspace APIsOAuthMulti-Agent Systems
FreshOut - Conversational E-Commerce

Context-aware e-commerce with AI command interpretation. Cross-tab state sync via dual-layer persistence.

Next.jsTypeScriptAI IntegrationIndexedDB

Experience

Connectivity CX2025 Dec — Present

AI Engineer, Chennai

Built Milo, a production email VA for UK automotive dealerships. Autonomous email processing with sub-500ms hybrid RAG. Multi-tenant Qdrant knowledge base isolation.

Hotelzify2024 Sep — 2025 Oct

AI Intern, Bengaluru

Multi-agent voice AI system orchestrating 5 specialized agents. Real-time Deepgram STT + ElevenLabs TTS on Plivo/Twilio. Qdrant with 1M+ embeddings at 98% retrieval accuracy.

Marki2024 Jan — 2024 Aug

Full Stack Engineer (ML Systems), Los Angeles, USA

Recommendation engine with transformer embeddings. Real-time inference API serving 1M+ predictions daily at 45ms P50. Apache Airflow ML pipeline on 4 GPUs.

Skills

PyTorchLangChainLangGraphLlamaIndexTransformersJAXvLLMDeepgram STTElevenLabs TTSRAGQdrantPineconeMulti-Agent SystemsLoRA/QLoRAFlash AttentionPythonTypeScriptFastAPINext.jsRedisPostgreSQLMongoDBDockerKubernetesAWSGCP

Education

Vellore Institute of Technology20222026

Bachelor of Technology in Computer Science and Engineering