yellatp

Pavan Yellathakota Hey DataGeeks, I'm Pavan πŸ‘‹
I am a Data Explorer passionate about diving into every field where data is prominent. My journey spans the full spectrum from Market Research & Supply Chain Analytics to designing Databases & ETL Pipelines. I extend this expertise into AI, developing Machine Learning and Deep Learning models with a specific focus on BERT-based Text & Semantic Analysis.

Data Scientist | ML Engineer | Product Analytics

πŸ“ Location : Seattle, WA, USA
πŸ“ž Mobile : +1 (929) 278-4589
βœ‰οΈ Email : pavan.yellathakota.ds@gmail.com
Linkedin : https://linkedin.com/in/yellatp
GitHub :   https://github.com/yellatp



πŸ‘¨β€πŸ’» Professional Summary

Data Scientist with 3+ years of experience developing predictive models and automated data infrastructure. Proven track record in improving search precision, designing quantitative research pipelines, and implementing data-driven solutions for marketing and product growth. Skilled in bridging the gap between data engineering and stakeholder decision-making through statistical validation, A/B testing, and interactive analytics.


πŸ› οΈ Technical Skills

| Domain | Stack | |β€”β€”β€”β€”β€”β€”β€”β€”β€”β€”-|β€”β€”-| | Languages & Databases | Python Pandas NumPy Scikit-learn SQL PySpark PostgreSQL R | | AWS Cloud Data | AWS S3 Athena Glue SageMaker Lambda Redshift | | ML Frameworks | XGBoost Hugging Face OpenAI NLTK Spacy | | Tools & Visualization | Tableau QuickSight Power BI Excel Git |


πŸ’Ό Professional Experience

Alphonso AI, backed by Shipley Center for Innovation | Founding ML Engineer

Potsdam, NY | Jul 2025 – Present

Key Technologies Used
Python FastAPI PostgreSQL Docker pgvector HuggingFace Gemini Vertex AI

Student Managed Investment Fund, Clarkson University | Graduate Quantitative Researcher

Potsdam, NY | Sep 2024 – Apr 2025

Key Technologies Used
Python BERT HuggingFace Vertex AI Pandas

HAVK Mladost (Elite Athletics Club) | Graduate Data Science Consultant

Potsdam, NY | Oct 2023 – May 2025

Key Technologies Used
AWS S3 Glue PySpark FastAPI Python

eAppSys Limited | Business Data Analyst

Hyderabad, India | Jul 2022 – Dec 2022

Key Technologies Used
Python Prophet SARIMAX Oracle OCI

Kantar GDC India | Data Analyst

Pune, India | Sep 2021 – May 2022

Key Technologies Used
Python PySpark Pandas SQL


πŸ—οΈ Some Notable Projects

| Project | Description | Tech Stack | |:---:|:---|:---:| | **[Text-Analysis-using-NLP-LDA](https://github.com/yellatp/Text-Analysis-using-NLP-LDA)** | NLP project focused on topic modeling and text analysis. | NLP, LDA, Python | | **[Detoxify Telugu](https://github.com/yellatp/detoxify-telugu)** | Toxic comment classification for Telugu language. | NLP, Deep Learning | | **[Synthetic Data Generator](https://github.com/yellatp/Synthetic-Data-Generator)** | Tool to generate synthetic datasets for testing/training. | Python, Data Gen | | **[BingeMax Recommendation Engine](https://github.com/yellatp/BingeMax-Personalized-Movie-Recommendation-Engine)** | Personalized movie recommendation system. | ML, Recommender Systems | | **[Fintech Sales GAP Analysis](https://github.com/yellatp/Fintech-Sales-GAP-Analysis)** | Analyzing sales gaps in fintech products. | Data Analysis, Visualization | | **[KonnectR Fullstack App](https://github.com/yellatp/KonnectR_flask_fullstack_app)** | Fullstack web application built with Flask. | Flask, Python, Web | | **[PreOwned Cars Price Prediction](https://github.com/yellatp/PreOwnedCars_Price_Prediction_Model_V1.0)** | ML model to predict prices of used cars. | Regression, Scikit-learn | | **[Fake News Classifier](https://github.com/yellatp/Fake-News-Classifier)** | Identification of fake news articles using ML. | Classification, NLP | | **[Content Strategy Netflix](https://github.com/yellatp/Content-Strategy-Analysis-NETFLIX)** | Data-driven strategy analysis for Netflix content. | Data Science, EDA | | **[Supply Chain Analysis](https://github.com/yellatp/Supply-Chain-Analysis-Python)** | Optimization and analysis of supply chain data. | Python, Logistics | | **[GenZ Career Preferences](https://github.com/yellatp/GenZ-Career-Preferences-Report)** | Analysis report on GenZ career trends. | Research, Analytics | | **[Website A/B Testing](https://github.com/yellatp/Website-AB-Testing-Python)** | Statistical analysis of A/B test results. | Statistics, Python |



Last Updated: 2026 by PAVAN YELLATHAKOTA </sub> </p>