Tesla | Senior or Staff Site Reliability Engineer (SRE), Manufacturing Systems | Fremont, CA
The Core Automation Services (CAS) team at Tesla is building applications to enable manufacturing, with an eye towards reliability, availability, scalability, speed and security. We're a diverse team composed of Controls Automation Engineers, Software Engineers, and various other disciplines that help facilitate automated manufacturing processes. As an SRE on the CAS team you'll be working with the infrastructure, systems and applications that act as the middleware layer between Programmable Logic Controllers (PLCs) and the outside world, such as Databases, MES systems and other services.
Location: Fremont, CA
Responsibilities:
* Support interim HMI/SCADA vendor application (Ignition from Inductive Automation) * Building tooling around it, evaluating its usage, and helping to ensure its reliability, availability and security * Design software and systems that enable automated manufacturing at Tesla * Assist Software, Controls, Manufacturing and other types of Engineers with onboarding and integrating services into the Tesla technology stack * Ensuring best practices and observability of the service, such as metrics, logging, tracing, and alerting * Automate configuration and deployment of services * Consult on and design infrastructure, systems and application architecture Apply at:https://www.tesla.com/careers/search/job/site-reliability-en...
https://www.tesla.com/careers/search/job/site-reliability-en...
=======================
Tesla | Database Site Reliability Engineer, Manufacturing Systems | Fremont, CA
As a Database SRE on the CAS team you'll be setting up and managing the databases, including MySQL, CockroachDB, FoundationDB, Clickhouse, and InfluxDB that back various software and systems that enable manufacturing in our various factories.
Location: Fremont, CA
Responsibilities:
* Evaluate current database deployments and make recommendations for how to improve their reliability, availability, scalability and security * Design and implement automation for managing the deployment and upgrades of the databases * Define Disaster Recovery and Business Continuity plans for the various database deployments * Assist Software, Controls, Manufacturing and other types of engineers with using databases sustainably * Ensuring best practices and observability of the databases, such as metrics, logging, tracing, and alerting * Consult on and design infrastructure, systems and application architecture Requirements: * Experience with running databases on bare-metal or VMs * Expert skills in Linux and its administration * Experience in a high level language such as Go, Python and/or Java * Understand the concepts of Observability and Infrastructure as Code * Comfortable on an on-call rotation * Comfortable doing live troubleshooting of issues on NOC bridges/outage calls * Habitual documenter and spreader of knowledge * Willing to mentor other team members and engineers with less database knowledge * Strong bias for action vs endless planning, willing to get hands dirty and make mistakes sometimes * 3+ years as DBA/SRE Apply at: https://www.tesla.com/careers/search/job/database-site-relia...