Job Details

AI Ops Data Scientist

Gaithersburg, Maryland, United States
Company AstraZeneca is a global, innovation-driven biopharmaceutical business that focuses on the discovery, development and commercialization of prescription medicines for some of the world's most serious diseases. But we're more than one of the world's leading pharmaceutical companies. At AstraZeneca, we're proud to have a unique workplace culture that inspires innovation and collaboration. Here, employees are empowered to express diverse perspectives and are made to feel valued, energized and rewarded for their ideas and creativity.Role We are looking for a data scientist to join our new AI Ops platform team in Science IT. The ideal candidate will have industry experience working in a range of different cloud environments where they devised and deployed large-scale production infrastructure and platforms for data science. The position will involve taking these skills and applying them to some of the most exciting data & prediction problems in drug discovery.The successful candidate will be part of a new, close-knit team of deeply technical experts and together have the chance to create tools that will advance the standard of healthcare, improving the lives of millions of patients across the globe. Our data science environments will support major AI initiatives such as clinical trial data analysis, knowledge graphs, imaging & omics for our therapy areas. You will also have responsibility to help provide the frameworks for data scientists to develop scalable machine learning and predictive models with our growing data science community, in a safe and robust manner.As a strong software leader and an expert in building complex systems, you will be responsible for inventing how we use technology, machine learning, and data to enable the productivity of AstraZeneca. You will help envision, build, deploy and develop our next generation of data engines and tools at scale. You will be bridging the gap between science and engineering and functioning with deep expertise in both worlds.Key AccountabilitiesLiaise with R data scientists to understand their challenges and work with them to help productionise models and algorithms.Be part of the development roadmap to build and operationalise our data science environment, platforms and tooling.Support any external opportunities, through close partnership and engagement such as Benevolent.AI collaboration.Deployment of systems, applications and tooling for data science on cloud environments.Understanding of the necessary guardrails required for different use cases and data sensitivities.Adapt standard machine learning methods to best exploit modern parallel environments (e.g. distributed clusters, multicore SMP, and GPU).Provide the necessary infrastructure and platform to support the deployment and monitoring of ML solutions in production Optimizing solutions for performance and scalability.Liaise with the Data Engineering team to ensure that the platform and the solutions deployment therein benefit from an optimised and scalable data flow between source systems and analytical modelsImplementing custom machine learning code and developing benchmarking capabilities to monitor drift of any analyses over time.Understanding of the latest AI webservices and data science tools, from DataBricks to citizen data science tools like Dataiku, C3.AI and Domino. Experience working on regulatory data would be helpful but not essential.Liaise with other teams to enhance our technological stack, to enable the adoption of the latest advances in Data Processing and AIBeing an active member of the Data Science team, you will benefit from, and contribute to, our expanding bank of Data Science algorithms and work efficiently with our data science infrastructure.Testing and assessing the quality of new tools.Team recruitment, training provision and coachingCandidate Knowledge, Skills and ExperienceBSc inComputer Science or related quantitative field or MSc/Ph.D degree in Computer Science or related quantitative field.More than 2 years of experience and demonstrable deep technical skills in one or more of the following areas: machine learning, recommendation systems, pattern recognition, natural language processing or computer vision.Experience managing an enterprise platform and service, handling new customer demand and feature requests.Strong software coding skills, with proficiency in Python and Scala preferred.Significant experience with AWS cloud environments, working knowledge of Google and Azure platforms. Knowledge of Kubernetes, S3, EC2, Sagemaker, Athena, RDS and Glue is essential. Certification in appropriate areas will be viewed favourably.Experience with best practice of data transport and storage within cloud system.Experience building large scale data processing pipelines. e. g. Hadoop/Spark and SQL.Experience provisioning computational resources in a variety of environments.Experience with containers and microservice architectures e.g. Kubernetes, Docker and serverless approaches.Experience with automation strategies e.g. CI/CD, gitops.Use of Data Science modelling tools e.g. R, Python, SAS and Data Science notebooks (e.g. Jupyter).Creative, collaborative, & product focused.Ability to just get things done.

Send application

Mail this job to me so I can apply later

Apply With CV

You are not logged in. If you have an account, log in to your account. If you do not have an account, why not sign up? It only takes a minute!

latest videos

Upcoming Events