- Data Science and Engineering
- Poland - Krakow Office
We are on an exciting data journey, aiming to transform our business through the Connectivity of our machines, and the MyDyson app.
About the role
The Lead Data Engineer will be responsible for developing, constructing, testing, and maintaining data pipelines, ensuring optimal data delivery architecture through ongoing projects. This role requires a deep understanding of data architecture, data engineering, data analysis, and hands-on experience in orchestrating data flows.
You will play an important part in maintaining and improving our Connected Intelligence Platform, Connected Control Tower, and Data Science toolset to ensure that the answers to crucial questions can either be self-served or provided through modelling and investigation of the data.
Lead and take complete responsibility for data projects from start to finish.
Create and maintain data pipelines to extract, transform, and load data from various sources into a central repository for analysis;
Work closely with data scientists, analysts, and business stakeholders to understand their data needs and implement solutions that enable efficient data analysis.
Ensure data is accurate, complete, consistent, and accessible by performing data quality checks, tracking data sequence, and data security measures;
Leverage cloud computing platforms such as GCP (Google Cloud Platform) to deploy and scale analytics solutions, as well as scalability, security, and cost optimization of data infrastructure;
Optimize the performance of data storage and retrieval systems, data pipelines, and machine learning models to ensure they can handle the volume and complexity of data;
Using programming languages and tools such as Python and SQL, to implement data pipelines, data models, and machine learning models;
Keep up to date with the latest technologies and trends in data engineering, machine learning, and analytics, and continually seek opportunities to improve the organization’s data infrastructure and analytics capabilities.
Build and maintain relationships with external partners, such as Google, to ensure we fully use their toolset and remain at the forefront of data technology.
Proven experience as a Senior Data Engineer, Data & Analytics Engineer, or similar role.
Strong proficiency in at least one major programming language (e.g., Java, Scala, Python) and comfortable working with SQL to implement data pipelines, data models, and machine learning models.
Strong background in at least one of the following: distributed data processing or software engineering of data services.
Experience with database technologies (relational, NoSQL) and dimensional data modeling techniques and best practices.
Experience with big data tools and data pipeline orchestration tools (e.g., Apache Airflow), experience with dbt is a plus.
Experience with version control systems like Git.
Experience with cloud computing platforms like AWS, GCP, or Azure to deploy and scale their solutions.
Experienced with industry-standard visualization and Business Intelligence tools like Tableau, Looker, Power BI, etc.
Familiar with Infrastructure as a Code principles.
Performance related bonus
Pension scheme with competitive employer contributions
Free fruit delivered for office staff, free coffee and tea
Cafeteria Benefit – wellness programme, cinema tickets, Multisport card etc.
Medical: Employee cover + opportunity to buy additional cover for family
Employee Assistance Program for employee and dependents
Dyson is an equal opportunity employer. We know that great minds don’t think alike, and it takes all kinds of minds to make our technology so unique. We welcome applications from all backgrounds and employment decisions are made without regard to race, colour, religion, national or ethnic origin, sex, sexual orientation, gender identity or expression, age, disability, protected veteran status or other any other dimension of diversity.