Skip navigation

Have you considered using our job search? Click here to search our current jobs.

Have you considered using our job search? Click here to search our current jobs.

Data Engineer

Summary

Salary
Confidential
Team
Data Science
Location
Shanghai - China

About us

Our Data Team  

 

As the ‘hub’ team delivering the data, technology and community provision enabling Dyson’s global data and analytics capabilities, Global Data Function (GDF) have end-to-end responsibility for data from foundations (DQ, MDM) to management (data platforms, integrations), to value realisation (analytics enablement and delivery).  

 

The team are a multi-disciplinary, global team providing round-the-clock development and operations – including product and project management, community enablement, governance, data architecture, data engineering, data science, and analytics expertise.  

 

Involved with every aspect of Dyson’s global business - from finance to product development, manufacturing to owner experience – data is enjoying record-breaking investment and mandate for 2021 and beyond, seeking to deliver solutions generating impressive and tangible business value. 

About the role

As a Data Engineer you will part of a pioneering team building the analytics platform for the Greater China region and will be responsible for developing, industrialising, and optimising Dyson's big data platform running on AWS. You will ingest new data sources, write data pipelines as code, and transform, enrich and publish data using the most efficient methods.

Working with data from across Dyson’s global data estate, you will understand the best way to serve up data at scale to a global audience of analysts. You will work closely with data architects, data scientists and data product managers on the team to ensure that we are building an integrated, performant solutions.

Ideally you will have a Software Engineering mind-set, be able to leverage CI/CD and apply critical thinking to the work you undertake. The role would suit candidates looking to make the move from working with traditional big data stacks such as Spark and Hadoop to using cloud native technologies (DataFlow, Redshift, Docker/Kubernetes, Pub/Sub, Cloud Functions). Candidates who also have strong software development skills and wishing to make the leap to working with Data at scale will also be considered.

Responsibilities include:

  • Designing and building end to end Data Engineering solutions on the AWS Platform (Redshift)

  • Being a proactive member of DevOps / Agile scrum driven team; always looking for ways to tune and optimise all aspects of work delivered on the platform

  • Aligning work to both core development standards and architectural principles

About you

Essential:

  • Detailed understanding of the AWS data landscape

  • Strong programming skills in languages such as Python/Java/Scala including building, testing and releasing code into production

  • Strong SQL skills and experience working with relational/columnar databases (e.g. Redshift, SQL Server, Postgres, Oracle, Presto, Hive, BigQuery)

  • Knowledge of data modelling techniques and integration patterns

  • Practical experience writing data analytic pipelines

  • Experience integrating/interfacing with REST APIs / Web Services

  • Experience handling data securely

  • Experience with DevOps software delivery and CI/CD processes

  • A willingness to learn and find solutions to complex problems

  • Familiarity with commercial data sources used in the Greater China Region

     

Desirable:

  • Experience migrating from on-premise data stores to cloud solutions

  • Experience of designing and building real/near real time solutions using streaming technologies (e.g. Dataflow/Apache Beam, Fink, Spark Streaming etc)

  • Hands-on experience with cloud environments (AWS and GCP preferred)

  • Building API's and apps using Python/JavaScript or an alternative language

  • Practical experience with traditional Big Data stacks (e.g Spark, Flink, Hbase, Flume, Impala, Hive etc)

  • Experience with non-relational database solutions (e.g. Redshift, Big Query, Big Table, MongoDB, Dynamo, HBase, Elasticsearch)

  • Experience with AWS data pipeline, Azure data factory or Google Cloud Dataflow

  • Working with containerization technologies (Docker, Kubernetes etc…)

  • Experience working with data warehouse solutions including extracting and processing data using a variety of programming languages, tools and techniques (e.g. AWS Data Pipelines, SSIS, Azure Data Factory, T-SQL, PL-SQL, Talend, Matillion, Nifi)

Benefits

#LI-DYSON

Interview guidance

We are following the government guidelines regarding COVID19. At this time all interviews will be conducted via video or telephone. We’re taking these precautionary measures to protect both our employee and candidate wellbeing. Our Talent Acquisition team will work with you and provide further information as appropriate.