
Technical Lead, Site Reliability Engineering, Google Cloud
- Sydney, NSW
- Permanent
- Full-time
- Bachelor’s degree in Computer Science, a related field, or equivalent practical experience.
- 8 years of experience with software development in one or more programming languages.
- 3 years of experience designing, analyzing, and troubleshooting distributed systems.
- 3 years of experience leading projects.
- Experience working in computing, distributed systems, storage, or networking.
- Experience in designing, analyzing, and troubleshooting distributed systems.
- Ability to debug, optimize code, and to automate routine tasks.
- Excellent verbal and written communication and problem-solving skills.
- Engage in and improve the lifecycle of services from inception and design, to deployment, operation, and refinement.
- Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning, and launch reviews.
- Maintain services once they are live by measuring and monitoring availability, latency, and overall system health.
- Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity.
- Practice sustainable incident response and blameless postmortems.