Senior Staff High Performance Computing Engineer
Company: Guardant Health
Location: Redwood City
Posted on: June 23, 2022
Job Description:
Company DescriptionCompany DescriptionGuardant Health is a
leading precision oncology company focused on helping conquer
cancer globally through use of its proprietary blood tests, vast
data sets and advanced analytics. The Guardant Health Oncology
Platform leverages capabilities to drive commercial adoption,
improve patient clinical outcomes and lower healthcare costs across
all stages of the cancer care continuum. Guardant Health has
launched liquid biopsy-based Guardant360 -, Guardant360 CDx and
GuardantOMNI - tests for advanced stage cancer patients. These
tests fuel development of its LUNAR program, which aims to address
the needs of early stage cancer patients with neoadjuvant and
adjuvant treatment selection, cancer survivors with surveillance,
asymptomatic individuals eligible for cancer screening and
individuals at a higher risk for developing cancer with early
detection.Job DescriptionGuardant's HPC team builds and operates
the computational technology backbone of the company.This includes
scalable data storage that holds PBs of genomics data, high
performance compute clusters running a custom bioinformatics
pipeline in production and R&D environments, and the software
infrastructure that hosts an ecosystem of services for internal
data processing and external data integration. To facilitate
Guardant Health's fast growth in the next few years, the HPC team
is looking for a strong technical engineer who can help maintain
and help grow the HPC infrastructure during its aggressive
expansion, while working with corporate IT, SQA and DevOps/SRE
teams.This role can be remotely worked part-time, but requires a
very hands-on on, on-premise presence when on rotation, minimally.
In this role, you will primarily:
- Help manage multiple HPC clusters and cluster file
systems.
- Identify, select, qualify, and operationalize most efficient,
effective Server hardware.
- Help research, develop, and implement the next generation HPC
solution.
- Troubleshoot the production system stack down to source code
level e.g., shell scripts, python and others.
- Collaborate regularly with internal teams and key stakeholders
to understand requirements, cost-optimize solutions, and build
knowledge of major workloads and technologies.
- Maintains, monitors, and supports the infrastructure
environment and/or facilities.
- Used and maintained enhanced production monitoring and
additional capability.
- Review and validate hardware failure data as part of
qualification process.
- Support improvements for increased system reliability and
performance.
- Supports in a senior role multiple systems or applications of
medium to high complex (complexity defined by size, technology
used, and system feeds and interfaces) with multiple concurrent
users, ensuring control, integrity, and accessibility.
- Work with offsite consultants to maintain the
infrastructure.
- Work with vendors to troubleshoot, upgrade and repair systems
as needed.
- Participate in a 24/7 on-call rotation.About You:You enjoy an
agile, very fast paced and highly technical environment. You are a
self-driven accomplished technologist who strives to be ever
improving your skills, value to the company and improve the
computational infrastructure. You are dedicated to engineering
excellence yet pragmatic and flexible. You have the ability to
maintain the day-to-day support SLA while running various key
projects that move the business forward.
- 6+ years of Linux/Unix administration, knowledge of Unix
network protocols, TCP/IP network fundamentals, core infrastructure
technologies and virtualization
- 6+ years of large-scale data storage and compute clusters (HPC)
infrastructure
- 4+ years working in and with on-premises and cloud-based (AWS,
Google, IBM and Azure) datacenters.
- 3+ years of building software release and ops processes and
automation toolset
- 5+ years providing documentation of system
administration.QualificationsFollowing Skills Sets are Preferred:
- Experience administering IBM's General Parallel File
System
- Experience administering Grid Engine scheduler
- Experience with using Bright Cluster Manager
- Experience with cloud bursting technologies
- Experience with wide area file systems
- Experience with docker and container technologies
- Experience with Kubernetes, preferably with Certified
Kubernetes Administrator (CKA, up to date)
- Operating infrastructure compliant with HIPAA and SOX
standardsEducationB.S. in Computer Science or related
fieldAdditional InformationEmployee may be required to lift routine
office supplies and use office equipment. Majority of the work is
performed in a desk/office environment; however, there may be
exposure to high noise levels, fumes, and biohazard material in the
laboratory environment. Ability to sit for extended periods of
time.Guardant Health is an Equal Opportunity Employer. All
qualified applicants will receive consideration for employment
without regard to race, color, religion, sex, sexual orientation,
gender identity, national origin, or protected veteran status and
will not be discriminated against on the basis of disability.All
your information will be kept confidential according to EEO
guidelines.To learn more about the information collected when you
apply for a position at Guardant Health, Inc. and how it is used,
please review our Privacy Notice for Job ApplicantsPlease visit our
career page at: http://www.guardanthealth.com/jobs
Keywords: Guardant Health, Redwood City , Senior Staff High Performance Computing Engineer, Engineering , Redwood City, California
Didn't find what you're looking for? Search again!
Loading more jobs...