Shashank Guda

01 — About

Intro

With over 4 years of experience in data analytics and applied machine learning, I specialize in transforming complex data into meaningful insights that inform strategy and drive impact. My work spans various domains, supporting teams in solving business challenges through data.

I hold a Master of Science in Applied Data Science from Syracuse University, where I focused on data-driven problem solving, AI systems, and scalable analytical solutions. My background combines consulting, research, and product oriented work, enabling me to bridge the gap between data science and real world outcomes.

I'm particularly interested in how Gen AI and machine learning can be applied responsibly and effectively across industries from building smarter tools to enabling better decisions. I'm always open to opportunities where data, innovation, and impact intersect.

PythonSQLData EngineeringLLMs / RAGMachine LearningDatabricksSnowflakeAzurePower BIPySparkPrompt Engineering

Education

Institution	Degree	Year	GPA
Syracuse University	M.S. Applied Data Science	May 2025	3.97 / 4.0
SRM University	B.Tech. Electronics & Communication Engineering	May 2021	9.15 / 10

🏆

Graduate Student Excellence Award — Applied Data Science, Syracuse University

Awarded annually to one graduating student for academic excellence and research contributions. Recognized for impactful, innovation-driven work at the intersection of data science and real-world problem solving. Also recipient of a full tuition scholarship.

"Badges are not just symbols — they're proof of dedication to continuous growth."

Certifications & Badges

Ongoing commitment to staying current across tools, analytics, and AI. Each certification represents a new capability acquired.

View All Badges ↗

02 — Experience

Work

EY

From Oct 2025
To Present

Sr. Consultant — Data Engineering

Currently working at Ernst & Young as a Senior Data Engineering Consultant, contributing to enterprise-scale data transformation and analytics initiatives.

Inferenz.ai

From May 2024
To Aug 2024

Jr. AI/ML Engineer (Intern)

Contributed to scalable, production-ready AI applications focused on NLP and generative AI. Optimized conversational systems and integrated Retrieval-Augmented Generation (RAG) with enterprise-grade infrastructure.

AI Chatbot & RAG

Redesigned chatbot architecture using async API calls and parallel processing in Snowflake.
Applied context pruning and advanced prompt design to reduce token consumption.
Deployed a RAG pipeline combining OpenAI with a Snowflake-hosted vector store.

40%Response Time ↓

28%Token Usage ↓

25%Accuracy ↑

✓RAG in Prod

Tredence Inc.

From Jun 2021
To Jun 2023

Analytics Consultant ↑ promoted from Data Analyst

Grew from Data Analyst to Analytics Consultant, partnering with Unilever to deliver data-driven strategies improving market expansion, store performance, and pipeline scalability across 16 markets.

Data Analyst (Jun 2021 – Jan 2023)

Built the foundation for large-scale data pipeline development, dashboarding, and cross-functional stakeholder reporting.
Developed and maintained SQL-based data workflows for downstream analytics teams.

Analytics Consultant (Jan 2023 – Jun 2023)

Forecasted demand and identified optimal locations for 1,200 Unilever stores using geospatial analytics and Power BI — contributing $1M+ in annual revenue gains.
Designed scalable ETL pipelines in Databricks and PySpark, integrating 52+ CSV data sources.
Automated data validation reducing manual checks by 80%; built Power BI dashboards achieving 98% data coverage.
Reduced pipeline latency by 35% using Azure Data Factory and Databricks.

$1M+Revenue Gained

80%Manual QA ↓

35%Latency ↓

15%Profitability ↑

Cognizant

From Jan 2021
To Jun 2021

Data Engineer (Intern)

Contributed to internal tools through relational database management (PostgreSQL, MySQL) and responsive web interface development.

Campus Leadership

iSchool, SU

2023–2025

Recitation Lead — IST 195

Led and mentored 100+ undergraduates in "Information Technologies," delivering weekly sessions and bridging students to faculty.

SU

2023–2025

Board Member — University Conduct Board

Appointed to review student conduct cases and uphold institutional values of fairness, integrity, and accountability.

What Others Say

Shashank consistently went above and beyond as a recitation lead — delivering on time, enhancing the student experience, and contributing meaningfully to the class culture. A natural leader that any team would benefit from.

Jeff Rubin — SVP & Chief Digital Officer, Syracuse University

Shashank is a smart, curious, and hardworking student who consistently goes above and beyond. In our Generative AI class, he led his team in building an impressive chatbot and actively supported his peers — a true reflection of why he earned the Graduate Student Excellence Award.

Jeff Saltz — Professor, School of Information Studies, Syracuse University

Shashank is a brilliant, driven, and highly skilled data science consultant with a rare ability to turn complex ideas into impactful solutions. His work ethic, leadership, and collaborative mindset make him an asset to any team.

Scott Bryan — President & CEO, Macronomics Inc. & Advisor, E78 Partners

Shashank brought deep analytical thinking, technical expertise, and strong leadership to our data science team. He took initiative on high-impact projects, automated complex pipelines, and consistently delivered results under pressure.

Keval R Menon — Senior Manager (Analytics), Tredence Inc.

Shashank has a sharp analytical mind and a knack for solving complex problems. His solutions consistently exceeded expectations, and his collaborative nature made him a valuable asset to the team.

Rahul Kumar — Manager, Tredence Inc.

From technical execution to research passion, Shashank stood out across projects. His performance on the Unilever initiative and award-winning delivery reflect his excellence and commitment to impact.

Archana Mishra — Associate Manager, Tredence Inc.

1 / 6

03 — Academic

Projects

Project 01 — 🏆 Award Winner

iHoop Insights — NBA Injury Prediction

PythonRandom ForestEDASports Analytics

Winning team of the Orange Hoops Data Science Challenge. Predicts injury risk in basketball players analyzing performance and physiological metrics across 2,604 records. Random Forest model achieved AUC 0.90 and recall 0.98 for injured players.

Award Announcement Notebook GitHub Video

Project 02

Anomaly Detection in Metro Train APU

LSTM AutoencoderK-MeansPySparkTime Series

Detecting anomalies in the Auxiliary Power Unit of metro trains using 1.5M rows of sensor data to enable predictive maintenance. Anomalies peaked during 2–5 AM and aligned with recorded failure events.

Notebook GitHub

Project 03

COMPASS — University Recommendation System

OpenAI GPT-4ChromaDBStreamlitRAG

AI-powered university guidance system for international students. Personalized recommendations with interactive chat, application tracking, and resource generation based on field of study, budget, and location preferences.

Live App GitHub Video

Project 04

Tokyo Olympics in Data

Azure Data FactoryDatabricksSynapse AnalyticsPower BI

End-to-end cloud data pipeline analyzing the 2021 Tokyo Olympics dataset across the full Azure data stack — from ingestion to Power BI dashboards revealing athlete demographics and country performance insights.

GitHub Video

Project 05 — 🏆 Award Winner

Sage — First Aid Simplified

LangChainLLMsStreamlitNLP

Health chatbot diagnosing injuries and providing precautions based on user input. Leverages LangChain for natural language processing with real-time conversational health advice. Recognized with the Wolfram Award for innovation in health tech.

GitHub

Project 06

LEAP — Personalized Learning Path Generator

GROQ APIStreamlitPython

Generates personalized learning paths based on educational background, skills, and career goals. AI-driven plans include curated resources and estimated timelines — with downloadable .docx output.

Live App GitHub

Project 07

EqualEyes — Inclusive Image Captioning

ViT-GPT2BLIPCNNVision Transformers

Advances image captioning beyond simple object identification by combining image recognition and language modeling for rich, context-aware descriptions. Designed for accessibility with audio output for visually impaired users.

GitHub

Project 08

Austin Animal Center — Adoption Strategy Analysis

PythonGeospatialEDAData Viz

Comprehensive analysis of animal intakes, outcomes, and stray locations from Austin Animal Center. Identifies urban hotspots and data-driven recommendations for sterilization programs, adoption campaigns, and resource allocation.