Data Engineering Architect

Data Engineering Architect

Job Description

2,000,000 patient-years of data and one of the largest and most diverse datasets in the pharma industry collected over the past two decades is waiting for you to unlock its true potential! At Novartis, we believe that Machine Learning and Artificial Intelligence, combined with the wealth of our data and internal scientific expertise, can transform the life sciences industry and change the way we develop medicines today.

A key program to make this a reality is data42, which will bring together all relevant data from across research and development at Novartis and prepare it for data science and AI. The program will accelerate the generation of data-driven insights significantly, to unlock the next breakthrough in innovative medicines for patients.

As a member of the Data Engineering team within NIBR Informatics, you will be dedicated to data42 and will collaborate with data scientists, informaticians and other engineers to implement world-class data-centric solutions for drug discovery. You will research and leverage new and cutting-edge technologies and adapt to new scientific concepts and problem domains. You will define, advocate and educate how to best implement FAIR practices. You will work within a team to develop and apply creative and scalable solutions to complex technical problems, and in doing so you will help us to reimagine medicine.

Your responsibilities include but are not limited to:

•Design and implementation of a federated data processing and analytics platform
•Advanced distributed analytics workflows with Spark
•Utilizing Scala/Spark to integrate, manage and analyze multi-terabytes of data

As most of these projects push the limits of current hardware and software stacks, you will require a thorough understanding of computer science fundamentals, and the desire to encourage innovation.

The Novartis Group of Companies are Equal Opportunity Employers and take pride in maintaining a diverse environment. We do not discriminate in recruitment, hiring, training, promotion or any other employment practices for reasons of race, color, religion, gender, national origin, age, sexual orientation, marital or veteran status, disability, or any other legally protected status.

Minimum requirements

What you’ll bring to the role:

•An advanced degree in Computer Science or similar, or equivalent experience
•Scala and Spark expertise
•Significant experience building large scale software systems and leading teams in the design and implementation of creative solutions to difficult computational problems (with emphasis on performance and near real-time data analytics)
•Excellent interpersonal skills with the ability to communicate effectively in a matrix environment
•Demonstrated technical leadership skills and a thorough understanding of agile software development processes
•Experience with distributed data processing and management systems

Desirable requirements:

•Experience with AWS cloud technologies and stack
•Knowledge of Polyglot databases, NoSQL, RDBMS and other new SQL
•RDF, formal logic, or other advanced modeling tools expertise
•Data mining techniques
•Experience in life sciences is advantageous but not essential

Position will be filled commensurate with experience

Why consider Novartis?

750 million. That’s how many lives our products touch. And while we’re proud of that fact, in this world of digital and technological transformation, we must ask ourselves this: how can we continue to improve and extend even more people’s lives?

We believe the answers are found when curious, courageous and collaborative people like you are brought together in an inspiring environment. Where you’re given opportunities to explore the power of digital and data. Where you’re empowered to risk failure by taking smart risks, and where you’re surrounded by people who share your determination to tackle the world’s toughest medical challenges.

We are Novartis. Join us and help us reimagine medicine
Cambridge, MA
Information Technology
Full Time