Software Engineer Data Quality Framework (PySpark / Databricks)

Heerlen, Limburg · APG · bemiddeld door 2 brokers

Tarief
max. €105 /uur
Looptijd
3 maanden
Locatie
Heerlen, Limburg
Remote
Hybride
Opdrachtgever
APG
ZZP
Toegestaan

Omschrijving

Functieomschrijving: We are looking for a Senior Data Quality Framework Developer to help build and evolve our data quality framework in Databricks using PySpark. You combine strong software engineering with platform thinking and help translate data quality needs into scalable, maintainable solutions. Dit ga je doen: Within our Data Capabilities team at APG Asset Management, you will develop and maintain reusable data quality framework components in Databricks. Your focus is on rule execution, result handling, contract-driven validation and the operational setup needed to run the framework reliably across environments. As a senior developer, you are responsible for, among other things: Design, develop and improve the PySpark-based data quality framework and reusable validation components in Databricks; Build and maintain rule execution, result handling and contract-driven validation capabilities; Own engineering quality through refactoring, testing, CI/CD, packaging and release practices; Support Databricks operations including jobs, alerts, triggers, notifications and multi-environment setup; Work with stakeholders to refine requirements and contribute to documentation, standards and team coaching. Our team is a multi-disciplinary team responsible for the end-to-end delivery of solutions. We work in an agile setting and combine engineering, functional and platform expertise to deliver reliable capabilities that support business processes and data products. We aim for flexible, high-performing teams with strong collaboration and T-shaped skills. Your work location will be Amsterdam or Heerlen. Dit breng je mee: As a Senior Data Quality Framework Developer, you combine strong engineering skills with a quality mindset and platform ownership. You know how to build reusable solutions, work across development and operations, and collaborate effectively with both technical and non-technical stakeholders. Additionally we ask for: At least five years of relevant experience in software engineering, data platform engineering or data engineering; Strong proficiency in Python and PySpark, with a good understanding of the Spark framework and optimization practices; Experience developing and operating Databricks jobs in a production environment; Good understanding of software design principles, including object-oriented programming, modular design, maintainable code structures and package management; Experience with testing, CI/CD, deployment practices and development on a major cloud provider (Azure preferred) in multi-environment setups; Background in Computer Science, Software Engineering, Data Engineering or a related field; Strong communication skills in English and the ability to work with a broad stakeholder group. You work independently, are proactive, and take ownership by picking up tasks on your own. Nice to haves: Experience with data quality checks, data quality frameworks, rule engines or metadata-driven validation patterns; Experience building o

Bemiddeld door 2 brokers

Deze opdracht wordt door meerdere brokers aangeboden. Klik door naar de versie van jouw voorkeur.

Reageer op deze opdracht via APG

Je wordt doorgestuurd naar de website van APG. ZZPdock is geen tussenpartij.

Bekijk meer opdrachten

Vergelijkbare opdrachten