We are developing a SaaS product that simplifies financial planning and analysis of cloud billing data for large enterprises with complex cloud spending requirements. We're looking for a data analyst to become our expert in cloud billing data accuracy, diving deep into complex datasets to validate and ensure the integrity of our data processing systems.
This is a unique learning opportunity where you'll develop deep expertise in cloud billing β which informs millions (and up) dollars of decisions. You'll work with large datasets (tables with hundreds of columns and hundreds of millions of rows) and become the person who helps decide what comprehensive cloud billing data looks like.
Write complex SQL queries to validate data accuracy across our processing pipelines
Investigate and fix data quality issues in our SQL processing systems
Develop expertise in cloud billing data, particularly AWS Cost and Usage Reports (CUR)βfamously inconsistent and counter-intuitive data that requires deep understanding
Work with ClickHouse, Parquet files, and S3 to analyze and validate large-scale datasets
Collaborate with engineering to identify and resolve data pipeline issues
Build monitoring queries and data quality checks using Airflow
Document data patterns, edge cases, and validation procedures
Intermediate to advanced SQL proficiencyβyou're comfortable with CTEs, window functions, complex joins, and performance considerations
If your idea of SQL is "SELECT * and piping it into pandas for analysis," this won't be a good fit
Some Python experience (helpful but not required to be expert-level)
Strong analytical mindset with attention to detailβyou enjoy digging into data inconsistencies
Ability to deliver results in hours instead of days
Comfortable learning complex technical domains from scratch
Background from bootcamps, self-taught experience, or academic projects welcome
Interest in industrial engineering, supply chains, or other big systems is a plus
Experience with ClickHouse, Parquet, or columnar databases
Familiarity with AWS services and billing concepts
Previous experience working with large-scale datasets
This is a learning opportunity for the specialized domain of cloud financial data: itβs more abstract than the typical data in tech and characterizes the systems underneath. You'll work directly with our head of engineering and have the chance to grow your career toward engineering roles or become a domain expert in cloud cost management.
AWS
Python + Flask
React + TypeScript
PostgreSQL
ClickHouse
Airflow
We are a small and growing team (less than 10 people!), which means you get the opportunity to be on the ground floor of building the product and company. Our founders are the founders of The Duckbill Group, who bring their wealth of domain expertise and deep industry and customer connections in cloud cost management to the product. Our customers are among the biggest cloud spenders in the world, which means the scale and complexity of the data challenges we solve are truly at the cutting edge. We're currently in a semi-stealth mode while we're focusing on building the initial product.
We work together in the office in San Francisco three days per week, so you must be located in the SF Bay Area and willing to work in the office on a regular basis.