Data Platform Developer
About GHGSAT
GHGSat offers greenhouse gas detection, measurement, and monitoring services to industrial and government customers around the world. We use our own satellites and aircraft sensors, combined with third-party data, to help industrial emitters better understand, control, and reduce their emissions.GHGSat’s capability is unique: The company provides high-resolution, local measurements of atmospheric methane and carbon dioxide concentrations from space, enabling us to detect greenhouse gas emitters and visualize and quantify their emissions.
Job Description:
Are you seeking a fast-paced environment where your work can make a meaningful impact? Look no further! As a Data Platform Developer, you will play a pivotal role in designing, building, and maintaining data and machine learning infrastructure, with a strong emphasis on automation, efficiency, and scalability.
In this role, you will be working on a small team to design, build, and deploy infrastructure and tooling for internal stakeholders across the organization. The primary focus of this team is to help enable internal stakeholders better leverage data to gain new insights, train machine learning models, and develop new products and tools.This role will have a primary focus on providing infrastructure to team members, however, in this fast-paced environment there will be many opportunities to engage in more traditional data and machine learning engineering work.
Key Responsibilities:
- Design and implement secure and scalable platform solutions and data pipelines to support internal data science and engineering efforts.
- Implement tooling and APIs and tools to improve data accessibility and streamline R&D, business intelligence, and product development processes across the organization.
- Work closely with data scientists, analysts, and other engineering teams to understand requirements and ensure the availability and reliability of services.
- Developing common practices and architectures to be shared by multiple teams for MLOps and data pipeline tooling.
- Monitor and manage the health and performance of data infrastructure.
- Maintain comprehensive documentation of architecture, and APIs.
- Proactively troubleshoot and optimize data products to enhance performance and integrate best practices throughout the data development lifecycle.
Requirements
Required Qualifications:
- Bachelor's degree in computer science, Engineering, or a related field. Advanced degree or relevant certifications are a plus.
- 5+ years of hands-on experience in a data engineering, software engineering, or DevOps related role.
- Demonstrated experience in architecting robust and scalable systems for live production environments.
- Strong problem-solving skills with the ability to dive into new problems and domains to troubleshoot and resolve complex issues.
- Proven experience using Kubernetes to host applications and parallelize data processing pipelines.
- Proven expertise in using Terraform to automate the deployment of scalable and reusable infrastructure.
- Proficient with managing the deployment of applications in a cloud-based environment using CI/CD.
- Proficient with using Python or similar programming languages to solve problems with data and maintain data infrastructure.
- Experience maintaining production databases (SQL, NoSQL, data lake).
- Excellent communication skills, with the ability to collaborate effectively with cross-functional teams and write clear and detailed documentation.
Nice to have:
- Experience in supporting data ETL workflows and collaborating closely with Analytics Engineers and Data Scientists.
- Experience building robust, production grade data pipelines. Experience with airflow and DBT are a plus.
- Experience working with geospatial data (postGIS, STAC).
- Relevant certifications in cloud platforms (AWS, Azure, GCP) or terraform. AWS stack is preferred.
- Experience with the deployment, monitoring, and maintenance of production machine learning pipelines (MLOps).
- Experience securing applications in the cloud to meet compliance targets.
- Competitive salary + stock options for all full-time employees
- Full comprehensive benefits
- Statutory leave + paid time off
- Flexible hybrid work environment
GHGSat offers a creative and highly motivating work environment. We offer competitive salaries, health and social benefits including flex-time and continuing development. We are an open and transparent company, and we are committed to preserving a diverse work environment.