Research Interests

Machine Learning; Natural Language Processing; AI Alignment; Large Language Models; Human-Computer Interaction (HCI); Adversary Machine Learning; Interpretability

Experience

Research Assistant

Indian School of Business
Sep 2024 - Present

• Developing a custom NLP pipeline with Vision OCR, Text Classification and Token Classification models using Open LLMs.

Research Intern

CISPA Helmholtz Center for Information Security
Jun 2024 — Present

• Topic of Research: "Privacy Preserving Generation with Large Language Models"
• Researching on Private Large Language Models using various Differential Privacy Frameworks.
• Analyzing several pre-training/fine-tuning methods and model behavior on various downstream tasks.

Research Assistant

Indian School of Business
Aug 2023 — Jan 2024

• Developed NLP Classifier Models on a daily basis, achieving high accuracy rates exceeding 95%, demonstrating strong data analysis and modeling skills.
• Expertly finetuned large transformer models, attaining precision and reliability, resulting in improved model performance and efficiency.
• Created Transformer models for Sequence Classification, Sequence Generation, and Token Classification on a daily basis, showcasing consistent innovation and adaptability.
• Specialized in developing customized Named Entity Recognition (NER) solutions, optimizing data extraction and information retrieval processes.
• Proficiently worked with Big Data to analyze patterns and successfully deployed developed models, contributing to data-driven insights and decision-making.

AWS & Machine Learning Intern

F13 Technologies
Mar 2023 — Jun 2023

• Worked with AWS EC2, EBS, and S3 to host and develop websites.
• Successfully migrated non-AWS web applications to AWS infrastructure.
• Developed a sophisticated Content Recommendation system using AWS Personalize.
• Gained hands-on experience with over 50 AWS services during the training period.
• Proficient in utilizing tools like AWS Cost Calculator to optimize cloud resources.

Research Intern

National Institute of Technology Patna (NITP)
Jun 2022 — Aug 2022

• Topic of research: Agile motion of quadrupedal locomotion using Quad-SDK
• Testing Deep Learning Algorithm efficiency for better obstacle detection and increasing accuracy by 25%.
• Using an efficient control scheme to increase in terrain mapping accuracy by 12.7% when using the right number of contours.
• Global Planner’s code was optimized for a movement speed boost of 15%.
• Tools & Languages used: Linux, Git, ROS, Quad-SDK, Python, Catkin

Education

B. Tech — Computer Science & Engineering

D. Y. Patil International University
Oct 2021 — 2025

List of courses: Data Structures, Design & Analysis of Algorithms, Principles of Data Science, Intelligent Systems, Digital Signal Processing, Deep Neural Networks, High Performance Computing & Game Theory

Clubs & Co-Curriculars: TEDx Conferences, Google Development Students Club, Tech Cohorts and Hackathons.

Projects

Otaku Engine

May 2023

Otaku Engine is a content recommendation system for anime enthusiasts (weebs). It leverages machine learning techniques to provide personalized anime recommendations based on user interactions and ratings.

Twitter Tweet Classification Model

May 2023

Tweet Classification using Natural Language Processing. This NLP model classifies tweets as Disaster Tweets (1) or Non-Disaster Tweets (0). Model uses Logistic Regression(tf-idf), Naive Bayes(tf-idf), Logistic Regression to train the model and classify tweets.

iIMS — Intelligent Infrastructure Management System

Industry Project
Jun 2022 — Aug 2022

An AI system aims to solve the obstacles faced by universities/organizations in making optimal use of infrastructure using OpenCV and ML Frameworks.

AI Maze Solver

Feb 2022

AI based Maze Solver uses DFS or BFS Algorithm to encounter and solve maze's and helps you analyze which search problem has less path cost in similar scenarios.

Skills

Programming Languages

Python (w. RegEx), C++, C#, JavaScript, MySQL, MQL & Git

Tools & Technologies

Machine Learning (Transformers, TensorFlow & PyTorch), NLP, Computer Vision, Power BI & MongoDB

Cloud

AWS S3, EC2, Sagemaker, DynamoDB, Lambda and other popular AWS services.

Collaboration & Productivity

Asana, ClickUp, Notion, Jira, Miro & Deepnote

Additional Design Tools 

Adobe XD, Figma, Blender, Cinema 4D & Unreal Engine