Data Scientist — Computer Vision & NLP
Blackstraw AI, Chennai
Production-grade Agentic AI, Computer Vision, and Document Intelligence Systems
- LLM & Agentic AI: Architected production LLM and agentic AI systems automating enterprise workflows across document intelligence, vulnerability research, and product pipelines, reducing manual effort by 91%.
- Search-Augmented LLM: Built search-augmented LLM frameworks integrating SERP and external knowledge for structured reasoning and automated attribute extraction.
- Computer Vision Pipelines: Designed scalable CV and embedding-based pipelines for retail classification and similarity search supporting 200K+ product labels.
- Advanced OCR Stack: Developed OCR and document intelligence stack (SegFormer, DeepLabV3+, ViT-STR, super-resolution), reducing manual document review by 85%.
- Document Understanding: Built layout-aware document pipelines improving structured field extraction and reading-order consistency in complex documents.
- PII Detection: Implemented automated PII detection and redaction using NLP and embedding similarity matching across documents and images.
- Multi-Agent Workflows: Designed CrewAI-based automated codebase migration (Oracle->MySQL, Java->Python) with validation stages.
- Client Engagement: Led demos and technical walkthroughs, translated business requirements into AI pipeline designs.