Summary
As an MTech Intern, you will work closely with data engineers and domain SMEs on real-world data, pipelines, and AI prototypes, gaining hands-on exposure to data engineering, machine learning, and emerging GenAI use cases.
This internship is designed to provide strong industry grounding, not just academic experimentation.
About the Role
Key Responsibilities
- Assist in data preparation, transformation, and analysis using SQL and Python/PySpark
- Write and optimize SQL queries to extract and analyze data from structured datasets
- Support development of data pipelines and feature engineering workflows
- Assist in building and testing Agentic AI modules under guidance (including GenAI use cases)
- Work with senior team members to prototype Agentic AI or GenAI-driven solutions
- Perform data quality checks and basic validation of datasets
- Document learnings, code, and analysis in a clear and structured manner
- Collaborate with team members following Agile / sprint-based execution
Required Qualifications
- Currently pursuing MTech / MS in:
- Computer Science with Data Science/AI as subjects
- Or a closely related field
Core Technical Skills
- SQL – writing queries, joins, aggregations
- Python – data processing, basic scripting
- Exposure to PySpark or Spark SQL (coursework or projects is sufficient)
- Basic understanding of:
- Data structures
- Databases
- Data processing concepts
- Fundamentals of Machine Learning (regression, classification, basic evaluation metrics)
- Awareness of Large Language Models (LLMs), Generative AI concepts, Agentic AI
- Any academic or personal project involving:
- ML models
- NLP
- Chatbots
- AI prototypes
Hands-on depth is not expected—curiosity and willingness to learn is.
Preferred
- Exposure to cloud platforms (AWS / Azure / GCP) through coursework or labs
- Experience with Git / version control
- Experience working on college projects, hackathons, or research work
- Strong analytical thinking and problem-solving mindset
Soft Skills & Mindset
- Strong learning orientation and curiosity
- Ability to break down problems and ask the right questions
- Clear communication of ideas and findings
- Comfortable working in a team-based environment
What the Intern Will Gain
- Hands-on exposure to real enterprise data and use cases
- Practical experience in SQL, Python, PySpark, and data pipelines
- Introduction to AI/ML and GenAI applications in industry
- Mentorship from experienced Data Scientists and Engineers
Why Novartis: Helping people with disease and their families takes more than innovative science. It takes a community of smart, passionate people like you. Collaborating, supporting and inspiring each other. Combining to achieve breakthroughs that change patients’ lives. Ready to create a brighter future together? https://www.novartis.com/about/strategy/people-and-culture
Benefits and Rewards: Learn about all the ways we’ll help you thrive personally and professionally.
Read our handbook (PDF 30 MB)