cv
Basics
Name | Rishabh Shah |
rishabh.shah@duke.edu |
Work
- 2024 June - Present
LLM Intern Co-op
Nokia Bell Labs
- Developed an AI Analyst leveraging Llama 3.2 8B for scalable Generative AI applications, enhancing financial document processing by 30%. Integrated multi-agent RAG chatbots with MongoDB for real-time, personalized insights across 10+ million documents, aligning with reliability.
- Optimized fine-tuned LLMs with QLoRA, enabling tailored responses for dynamic financial scenarios and improving accuracy by 25%. Incorporated real-time data scraping and retrieval mechanisms to facilitate precise, query-based insights for investment and charity data, improving decision-making accuracy.
- 2023 October - Present
Graduate Research Assistant - Bergin Group
Duke University
- Analyzed data from remote sensors in Kenya, developed a data pipeline, and created calibration methods to ensure consistent sensor readings in the same environment
- Working towards investigating the impact of pollution on human health using Oura ring data, and analyzing factors such as SpO2, stress levels, and heart rate
- 2023 October - 2024 September
Duke Innovation Co-Lab Student Developer
Duke University
- Leading the development of the Chatbot module for the extension project of Nurse Hackathon
- Working to integrate LLMs like LLama, Falcon, and Mistral into low-latency systems, aiming to enhance their effectiveness and optimize for resource-constrained applications.
- 2020 September - 2023 August
Data Scientist
IBM
- Contributed to platform deployment and reporting for the Global Azure Team, utilizing Machine Learning and Azure Cloud expertise. Specialized in IBM's Content Intelligence platform for document intelligence
- Developed a Table detection Model(Computer Vision) and Similarity Search for Automatic Document Content Validation & processing ~100K Documents leading to 50% cost savings ($200-300 K per year) in the Manual Audit. Used Python, BERT, and Camelot for the same
- Developed a module for document meta-data extraction and hierarchy generation of connecting documents
- Enhanced IBM's Content Intelligence platform by developing document classification and custom key-value pair extraction modules, while optimizing end-to-end document processing
- Deployed end-to-end CI platform for clients on Azure Kubernetes Service using Terraform & Jenkins.
- 2020 June - 2023 July
Data Scientist
Prescience Decision Solutions
- Contributed to the development of a PDF document search module for an intelligent search platform ,leveraging ElasticSearch and applying retrieval techniques such as TF-IDF and BM25
- Used Transformers like BERT, XLnet for passage rankings
- 2020 January - 2023 June
Data Scientist Intern
IBM
- Made rule-based classification model, LSTM model, etc., for classification of different types of clauses
- Proposed and built two clause segregation approaches: Combined Random Forests with concrete features and other one utilized computer vision techniques.
Education
-
2023 August - 2024 December Durham, North Carolina, USA
Masters
Duke University
Artificial Intelligence
- Sourcing Data for Analytics, Modeling Process & Algorithms, Reinforcement Learning, Deep Learning Applications, Generative AI, Operationalizing AI (MLOps)
-
2016 August - 2020 August Rajasthan, India
Bachelors
NIIT University
Computer Science with minors in Artificial Intelligence
- Machine Learning, Deep Learning, Artificial Intelligence, Computer Vision, Natural Language Processing, Operating Systems, Algorithms and Data Structures, C++, Python, Information Retrieval