Compute: The compute pod builds and manages our Kubernetes-based platform (https://medium.com/airtable-eng/managing-kubernetes-resources-across-multiple-clusters-ec4b3a58005c) that supports every service at Airtable, including all new AI services such as vector databases, AI evals store, and document extraction and understanding services
- Region level disaster recovery, and bringing up compute platform from 0->1 in a new region
- Building custom Kubernetes operators for reliably managing some of our most critical workloads
- Data Infrastructure: The Data Infrastructure team’s mission is to enable data-driven decision making at Airtable by providing reliable, self-service, high-performance analytics infrastructure
- Observability: The Observability team's mission is to empower Airtable's engineering team with effective monitoring and debugging tools
- We provide actionable insights into errors and crashes, and improve performance through enhanced visibility
- You’ll build tools that are in-use by nearly every engineer at Airtable in our logging, metrics, and tracing pipelines
- Storage: The Storage team’s mission is to accelerate product development at Airtable by providing scalable, reliable, and easy-to-use storage abstractions
- You will own all aspects of building, running, and improving these systems, from the underlying infrastructure all the way to the developer-facing code abstractions
- This will support improved performance in our secondary regions (EU and Australia) as well as other customer-driven projects
- Proactively identify and lead significant improvements to Airtable’s infrastructure, working across teams and product areas to maximize business and engineering impact
- Work on systems-level problems in a complex design space where scalability, efficiency, reliability, and security really matter
- Build clean, reusable, and maintainable abstractions that will be used by Airtable’s engineers for years to come
Take full ownership of components of Airtable’s infrastructure, including responsibility for reliability, performance, efficiency, and observability of our production environment