Hi, I'm Vidit.

I'm a Data Scientist

I make impactful AI/ML based solutions that take things to the next level. I also create interesting self projects in my spare time and do problem solving on Leetcode. Let's connect!

Vidit Singh Negi | AI Developer

About.

Hey there! I'm Vidit Singh Negi, and I am a Data Scientist currently working at Clarivate. I Studied Computer Science(Btech) from Jaypee Institute of Information Technology, Noida.

I specialize in building AI/ML Solutions and Agentic Workflows, with a focus on development, optimization and Deployement. I'm passionate about creating clever and impactful models that provide great value in the industry.

I code in Python and C++, and have experience with Langchain/Langgraph Ecosystem, Pytorch, Tensorflow, Flask, FastAPI and various data processing libraries such as Panda, Numpy, OpenCV and more. I've also worked with services provided by AWS and Azure focusing on AI developement, like SageMaker, Bedrock, Guardrails, Step Functions, Lambda, SQS, and more. I Also have experience with Databases like Aurora Postgres, Redshift, MongoDB, and vector databases like Qdrant, Chroma and Pinecone.

When I'm not coding, I enjoy spending time making music and playing instruments. I believe that maintaining a healthy work-life balance is crucial for staying productive and motivated.

I'm always looking for new challenges and opportunities to learn and grow as a developer. If you're interested in working together or have an opportunity that might be a good fit for me, please feel free to reach out! 🔗

Used at work

PythonC++PytorchAWSLangchainLanggraphAWS SageMakerAWS BedrockMLflowLightningAccelerateWandbOpenCVTensorflowPandasNumpyScikit-LearnFlaskFastAPIDockerMongoDBHugging FaceStreamlitSelenium

Learning

TypeScriptCUDATritonKubernetes

Projects.

An image of the Flash Attention (Triton) project.

Flash Attention (Triton)

Pytorch - Triton - CUDA - Transformers

Attention Mechanism Acceleration using Triton
Learn more >

An image of the Multimodal Vision-Language (PaliGemma) project.

Multimodal Vision-Language (PaliGemma)

Pytorch - Flask - Vision Transformers - Gemma

A state-of-the-art model for visual-language understanding, transforming image-based question answering with deep contextual intelligence.
Learn more >

An image of the Chest Cancer Detection web-app project.

Chest Cancer Detection web-app

Tensorflow - MLflow - CI/CD - Docker - Flask - AWS

This web-app helps detect tumors that are not easy to locate in the medical industry.
Learn more >

An image of the Custom Article Q&A tool project.

Custom Article Q&A tool

Langchain - FAISS - Pinecone - Streamlit

A specialized tool designed for researchers, enabling them to extract information from online sources and engage in interactive queries for deeper insights.
Learn more >

An image of the Sign Language Detection project.

Sign Language Detection

Tensorflow - MediaPipe - OpenCV

This project was made with a target to classify the hand gestures in a sign language.
Learn more >

An image of the Abandoned-Object-Detection project.

Abandoned-Object-Detection

Tensorflow - Yolo - DeepSort - OpenCV

This project was built to detect all the abandoned or unattended objects (mainly bags) in crowded areas through cctv/surveillance footage.
Learn more >

An image of the Real time DeepFake Detection project.

Real time DeepFake Detection

Django - Tensorflow - OpenCV - LSTM/GRUs

This Project detects Deepfake videos generated by AI.
Learn more >

Experience.

Clarivate
May 2025 - Present
Associate Data Scientist
Noida

Build an end-to-end Text-to-SQL based Agentic Chatbot for Derwent Patent Analytics. Utilized Langchain/Langgraph, MLflow and AWS Services for developement. Used Promptfoo And Ragas for automated security assessment. Made multiple RAG based tools, secured the chatbot by using AWS bedrock Guardrails, integrated buffered streaming for smooth user experience and deployed on ECS Cluster. Worked with AI Classifier, a massive project on AWS, used Step Functions for orchestration, Sagemaker for BERT hierarchical training and Inference, also used Lambda and SQS to make the pipeline.

LangchainLanggraphFastAPIAWS SageMakerAWS BedrockPromptfooRagasMLflowTransformers
Monotype
Aug 2024 - May 2025
AI/ML Trainee
Noida

Owned many projects and collaborated across multiple teams. Researched, built, trained, optimized and deployed GenAI models including Diffusion models, GANs and Transformers, using Pytorch, FastAPI, docker, Wandb, AWS/Azure. Made Bert-Diffusion Model for font generation, reduced designing time by 30% for designers. - Patent. Built custom 3-staged diffusion model for Japanese fonts generating characters with >90% IOU (Improved by 15%).

PytorchFlaskFastAPIOpenCVDockerAccelerateLightningPandasNumpyAWS/Azure/RunpodDiffusionGANsTransformers
Polynomial.AI
June 2024 - July 2024
Python Developer Intern
Remote

Worked with building POS systems. Built REST-APIs with Flask, utilized MongoDB for data storage, did automation using Selenium, and trained ML models for classification. The automation pipeline saved many hours of manual sales report extraction, improving productivity by 25%. Made Flask APIs to handle requests, worked with MongoDB and AWS S3 to store the sales reports.

PythonFlaskMongoDB SeleniumAWS
GreenTech ITS LLP
Arp 2024 - May 2024
Computer Vision Intern
New Delhi

Worked on ANPR system and made Abandoned Object Detection System with object detection, OCR, YOLOv8 and YOLO-NAS, DeepSort, Background Subtraction, Depth Estimation, Implemented Kafka. Optimized Model inference speed using TensorRT and ONNX and made it 4x faster. Built Docker containers to deploy on toll plaza sites. Increased the ANPR accuracy from 85% to 93%.

TensorflowPytorchOCRYOLOTensorRTONNXKafkaDockerDepth EstimationBackground Subtraction

Contact.

Send me an email if you want to connect! You can also find me on Linkedin.

viditsn@gmail.com