hero
Glynn Capital
49
companies
1,534
Jobs

Software Engineer, Data Movement and Orchestration

Stripe

Stripe

Software Engineering
Toronto, ON, Canada
Posted on Dec 13, 2024

Who we are

About Stripe

Stripe is a financial infrastructure platform for businesses. Millions of companies—from the world’s largest enterprises to the most ambitious startups—use Stripe to accept payments, grow their revenue, and accelerate new business opportunities. Our mission is to increase the GDP of the internet, and we have a staggering amount of work ahead. That means you have an unprecedented opportunity to put the global economy within everyone’s reach while doing the most important work of your career.

About the team

The Data Transformation and Movement team operates the critical infrastructure that powers near-realtime and batch data processing at Stripe. The team supports a variety of use cases, including Payment, Ledger, ML, Fraud Detection, Product Analytics, Regulatory Reporting, Financial Data Reconciliation, and externally facing products like Radar and Sigma. As an example of the scale, the team’s systems serve hundreds of teams, thousands of workflows, 100,000+ task executions, O(billion) streaming transformations, and moving terabytes of data processing over 1 GB/second every day. Our users inside Stripe include other engineering teams, Data Scientists, Sales & Operations, Finance, etc.

This role could be on any one of the following sub-teams:

Data Movement builds and operates a constellation of multi-region, high scale ingestion systems that moves data from all online sources into Iceberg, with sub-minute latency. On the cusp of innovation, we're pushing the boundaries of open-source Iceberg and Spark for real-time ingestion.

Data Orchestration builds and operates the time-based and event-based orchestration infrastructure that powers and accelerates batch data pipelines.

Data Transformation builds and operates the transformation abstractions and infrastructure that support frictionless data development across the board, sub-minute event data to enormous daily partitions - or even for-all-time snapshots.

Our team operates on a wide range of tech stacks including Kafka, Event Bus, Change Data Capture, Flink, Spark, Airflow, Hive MetaStore, Trino, Pinot, SQL, Python, Java, Scala, S3, and Iceberg.

What you’ll do

As a Software Engineer on our team, you will do the following:

  • Design, build, and maintain innovative next-generation or first-generation versions of key Data Platform products, with an emphasis on usability, reliability, security, and efficiency.
  • Design ergonomic APIs and abstractions that build a great customer experience for internal Stripes, that will in turn enhance the experience of millions of Stripe users.
  • Ensure operational excellence and enable a highly available & reliable Data Transformation & Movement platform across streaming and batch workloads.
  • Collaborate nimbly with high-visibility teams and their stakeholders to support their key initiatives - while building a robust platform that benefits all of Stripe in the long term.
  • Plan for the growth of Stripe’s infrastructure by unblocking, supporting, and communicating proactively with internal partners to achieve results.
  • Connect your work with improvements in the usability and reliability of Open Source Software (OSS) like Apache Airflow, Iceberg, Spark and contribute back to the OSS community.

Who you are

We’re looking for someone who:

Minimum requirements

  • A strong engineering background and interest in Data
  • Has experience operating or enabling large-scale, high-availability data pipelines from design, to execution and safe change management. Expertise in Spark, Flink, Spark, Airflow, Python, Java, SQL, and API design is a plus.
  • Has experience developing, maintaining, and debugging distributed systems built with open source tools
  • Has experience building infrastructure-as-a-product with a strong focus on users needs
  • Has strong collaboration and communication skills, and can comfortably interact with both technical and non-technical participants.
  • Has the curiosity to continuously learn about new technologies and business processes.
  • Is energized by delivering effective, user-first solutions through creative problem-solving and collaboration.

Preferred qualifications

  • Has experience writing production-level code in Expertise in Scala, Spark, Flink, Spark, Airflow, Python, Java, and SQL is a plus.
  • Has experience designing APIs or building developer platforms
  • Has experience optimizing the end to end performance of distributed systems
  • Has experience with scaling distributed systems in a rapidly moving environment
  • Has experience working with data pipelines
  • Genuine enjoyment of innovation and a deep interest in understanding how things work

This role is available either in an office or a remote location (typically, 35+ miles or 56+ km from a Stripe office).

Office-assigned Stripes spend at least 50% of the time in a given month in their local office or with users. This hits a balance between bringing people together for in-person collaboration and learning from each other, while supporting flexibility about how to do this in a way that makes sense for individuals and their teams.

A remote location, in most cases, is defined as being 35 miles (56 kilometers) or more from one of our offices. While you would be welcome to come into the office for team/business meetings, on-sites, meet-ups, and events, our expectation is you would regularly work from home rather than a Stripe office. Stripe does not cover the cost of relocating to a remote location. We encourage you to apply for roles that match the location where you currently or plan to live.

The annual US base salary range for this role is $150,500 - $269,200. For sales roles, the range provided is the role’s On Target Earnings ("OTE") range, meaning that the range includes both the sales commissions/sales bonuses target and annual base salary for the role. This salary range may be inclusive of several career levels at Stripe and will be narrowed during the interview process based on a number of factors, including the candidate’s experience, qualifications, and location. Applicants interested in this role and who are not located in the US may request the annual salary range for their location during the interview process.

Additional benefits for this role may include: equity, company bonus or sales commissions/bonuses; 401(k) plan; medical, dental, and vision benefits; and wellness stipends.

Office locations

Seattle, Toronto, New York, South San Francisco HQ, or Chicago

Remote locations

Remote in Canada, or United States

Team

Data Platform

Job type

Full time