Job description Posted 28 July 2020

Machine Learning Engineer: Python, Azure, DevOps, Data

We are looking for a self-directed Machine Learning Data Engineer with strong coding skills to join our growing team. You’ll have gone through the full project lifecycle – and understand how design decisions play out in production. You may have worked in traditional data warehousing environments, but you are up to speed with more contemporary data architectures based on open source technologies. Data Engineer will work with our platform/software Engineers and ML Engineers and Data scientists on AI initiatives

Key Responsibilities:

  • Implement data flows to connect operational systems, data for analytics and Machine Learning (ML) systems
  • Document source-to-target mappings
  • Re-engineer manual data flows to enable scaling and repeatable use
  • Support the build of data streaming systems
  • Write ETL scripts and code to make sure the ETL process performs optimally
  • Develop Data pipelines that can be re-used


  • 10+ years of industry experience
  • Experience with varied types of data: tabular, graph, time-series, geospatial, image, etc.
  • Practical knowledge of:
  • Different types of database – relational; document; graph; columnar; key-value.
  • Large scale data processing platforms, typically based on Hadoop / Spark.
  • Business intelligence / analytics products or frameworks
  • Data visualisation frameworks
  • Experience of systems deployment and configuration for cloud platforms, including selection of relevant PaaS / SaaS offerings.
  • Knowledge of data integration technologies and ETL tools e.g. Talent, SAS, Datastage, Databricks
  • Ability to write good quality code in a language like Python,R or Scala, incorporating disciplines such as Test Driven Development and structured version control; familiarity with Python a bonus.
  • Strong analytical and problem-solving skills
  • Self-motivated and ability to be resilient
  • Good communication skills, both verbally and in writing
  • While you may not have an exclusively agile background, you strive to work to these principles.
  • Nice to have
  • Knowledge of distributed computing & information security. E.g Familiar with Security concepts on both an infrastructural and application level (SSL, secrets management, database encryption, etc.)
  • Knowledge of and/or experience with Continuous Integration / Deployment set-ups, and especially with any of the following – Azure DevOps

Role is initially working from Home

Additional information about the process

Role is initially working from Home

Join GSK’s vision to do more, feel better and live longer:

Who will I be working with?