RedwoodCityRecruiter Since 2001
the smart solution for Redwood City jobs

Director of Site Reliability Engineering

Company: Oracle
Location: Redwood City
Posted on: September 15, 2020

Job Description:

As SRE Director, you'll define how to use latest technologies to identify and optimize the operational efficiency. You will be responsible for the infrastructure and reliability of PaaS services striving to continuously improve operations. You will work with a team pushing the boundaries of a scalable, self-healing, autonomous platform built on Kubernetes, Docker, Prometheus, and Grafana. What you will do Serve as a strategic engineering leader and technical expert for the SRE team, creating an environment where managers and engineers are empowered and supported to do their best work Define vision, strategy and roadmaps for SRE and operations teams Drive and own the measuring of SLA/SLO/SLI/uptime and ensure organization is meeting set goals Collaborate with Engineering teams to understand deployment practices and processes and work towards iteratively improving the release pipeline to ensure a highly resilient deployment strategy, ideally with zero downtime Build capabilities to ensure 24/7 platform support Lead efforts to implement multi-region Cloud Infrastructure capabilities Collaborate with engineering leadership across the organization to ensure alignment on infrastructure needs in support of product roadmaps and deliverables Identify and execute on cost strategy and cost projection for platform cloud hosting and observability tooling Ensure best engineering practices through automation, infrastructure as code, robust system monitoring, alerting, auto scaling, self healing, etc... Ensure the timely releases of the team's projects to production and remove any impediments as they arise Analyze system failures and develop rapid response processes to ensure such failures do not reoccur -- Work cross-functionally with product development, product management, program Management and Oracle Cloud infrastructure operations teams Predict and provide notice of potential system vulnerabilities for current and future solutions and implementations. Provide specific recommendations and guidance to address such vulnerabilities Analyze, build and maintain all automation tools and processes to ensure the highest standards of reliability and robustness Fully understand our customers' service needs and ensure we meet these needs Participate in 24x7 site reliability rotations and escalation workflows Preferred Skills and Experience 5+ years of experience in site reliability and technical operations leadership with experience building large and geographically disperse infrastructure supporting business critical cloud services 5+ years of people management and team leadership experience including headcount planning and developing strong and motivated teams Architect-level understanding of one or more of the major public cloud services (AWS & Azure), using them to effectively design secure and scalable services Strong knowledge of modern container platforms like Kubernetes, Docker, etc. Experience building and supporting critical services with a focus on automation, availability and performance Experience working on large-scale cloud based (Azure & AWS is preferred) distributed systems including multi-tiered architecture and micro-services Experience working with monitoring tools (Prometheus, Grafana, NewRelic, ELK stack, etc) and Database technologies (Oracle DB, Elastic Search, Redis and Kafka preferred) Experience with 24/7 site monitoring and ability to own uptime & performance SLA's Has built and managed globally distributed teams to operate a large-scale SaaS platform Strong foundational knowledge and prior experience with implementing SRE practices within a team Excellent people manager who can be a hands-on player/coach. You have experience with agile planning, deployment, integration, test/validation, and configuration across multiple DevOps tooling platforms Experience integrating new technology and platforms with mission critical legacy systems Ability to stitch best of breed tools and practices together to solve business problems, embracing the security (confidentiality, integrity, availability) and complexity challenges that may arise Possesses an automate everything mindset, from CI/CD based deployment to team collaboration Experience with Agile and DevOps methodologies Effective verbal, written communication and interpersonal skills including interfacing with customers on a professional and cooperative level Able to develop and maintain strong relationships with Oracle customers BS degree in Computer Science or related degree or equivalent experience No matter your role in our team, you will find yourself in an exciting and challenging environment where every person is empowered to show initiative, be outspoken, and be proactive and not reactive. Oracle is dedicated to the continual growth and development of its staff, striving constantly to strengthen our expertise as well as develop new skills. Our team is spread all around the world in four continents - we provide a full range of opportunities and challenges to apply your kills and grow your career in this new and exciting arena.

Keywords: Oracle, Redwood City , Director of Site Reliability Engineering, Engineering , Redwood City, California

Click here to apply!

Didn't find what you're looking for? Search again!

I'm looking for
in category
within


Other Engineering Jobs


Cloud Analytics Engineer
Description: Cloud Analytics Engineer The Modernize Analytics team is a group of highly skilled Cloud Analytics Architects and Engineers who will work with customers to develop modern analytic architectures and develop (more...)
Company: Oracle
Location: San Francisco
Posted on: 09/18/2020

Salesforce QA Automation Engineer
Description: Our Banking client in San Francisco is looking for a QA Automation Engineer to join the team on a Contract to Hire basis. MUST HAVE EXPERIENCE WORKING IN A SALESFORCE ENVIRONMENT Only considering local (more...)
Company: Clarity Technology Partners
Location: San Francisco
Posted on: 09/18/2020

Lab Automation and Process Development Engineer
Description: Invitae is a healthcare technology company that leverages genetic information to empower doctors and patients to make informed medical decisions. Our development team works on a variety of projects ranging (more...)
Company: Invitae
Location: San Francisco
Posted on: 09/18/2020


Data Engineer - Analytics
Description: Public social network tech company in San Francisco is hiring a Data Engineer with 1-3 years of professional experience in a data engineering role.
Company: Workbridge Associates
Location: San Francisco
Posted on: 09/18/2020

Senior Chief Engineer
Description: JOB TITLE br br Senior Chief Engineer br br JOB DESCRIPTION SUMMARY br br Senior Chief Engineer is responsible for the effective daily leadership of his/her staff, managing the engineering (more...)
Company: C&W Services
Location: San Francisco
Posted on: 09/18/2020

DevOps Engineer
Description: We are witnessing a massive shift of consumer presence from offline to online. With it, there is a need for technologies that enable online businesses to thrive. Bolt is at the center of this universe (more...)
Company: Bolt
Location: San Francisco
Posted on: 09/18/2020

Senior Front End Engineer
Description: This San Jose based startup is looking to add a Senior Front End Engineer to their growing team. The responsibilities of this role include being the expert on all things about web technologies, building (more...)
Company: Motion Recruitment
Location: San Jose
Posted on: 09/18/2020

Information Security Engineer
Description: Blackstone Talent Group, an award-winning technology consulting and talent agency is seeking another Sr. Info Security Engineer Contract to join our team at our client's site in Hayward,
Company: Blackstone Talent Group
Location: San Francisco
Posted on: 09/18/2020

Mechanical Engineer
Description: San Jose, California based Mountz Inc. provides torque tools and solutions to a variety of industries like aerospace, automotive, electronics, energy, medical, packaging and more. Using our products makes (more...)
Company: Mountz Inc.
Location: San Francisco
Posted on: 09/18/2020

Embedded RF Design Verification Test Engineer for an Embedded Chip Company
Description: JOB DESCRIPTION:Embedded RF Design Verification Test Engineer for an Embedded Chip Company in San Jose, CAThis person will execute tests on Client's world class lineup of WiFi combo modules targeted at (more...)
Company: OSI Engineering
Location: San Jose
Posted on: 09/18/2020

Log In or Create An Account

Get the latest California jobs by following @recnetCA on Twitter!

Redwood City RSS job feeds