DevOps / Site Reliability Engineer - GCP - Hypergrowth Healthcare GenAI Startup
Palo Alto, CA
The position is based in the company’s office in Palo Alto. All of the company’s engineers work in the office 5 days/week, and it seeks candidates who will be excited about working in a fast-paced environment where teammates are together in person. Work hours tend to average 40/week in the office, 50/week total. It could be 60/week on occasion when being on call (which is shared with the entire engineering team) or in the midst of a big deliverable.
The company’s product is B2B2C and involves healthcare, agentic AI, generative AI, and LLMs. The company seeks candidates who would be passionate about improving healthcare by increasing efficiency, raising its quality, and lowering the cost.
The company was founded about 2 years ago, and has raised a very large round of funding. It has many large customers and revenues are rapidly growing. The company has about 70 employees and 30 engineers and is hiring rapidly.
The company is targeting a salary range of $200k-$275k, plus equity which can be very lucrative.
Job Responsibilities:
- Working with a team of 3 peers.
- Working with software engineers to design and deploy scalable, fault-tolerant, and secure production systems on cloud platforms such as GCP, or Azure. This position will be focused on GCP.
- Designing and implementing infrastructure automation and deployment pipelines using technologies such as Terraform, Ansible, and Jenkins.
- Implementing and maintaining monitoring and logging systems to ensure the reliability and performance of the healthcare AI platform.
- Collaborating closely with software engineers, research scientists, and other cross-functional teams to develop and maintain reliable and scalable infrastructure that enables rapid iteration and deployment of products.
- Developing and maintaining security and compliance policies and procedures for the healthcare AI platform.
- Collaborating with cross-functional teams to troubleshoot and resolve complex issues related to infrastructure, deployment, and operations.
- Implementing and maintaining disaster recovery and business continuity plans.
- Developing and maintaining documentation related to infrastructure, deployment, and operations.
Qualifications:
- A Bachelor's or Master's degree in Computer Science, Computer Engineering, or equivalent experience.
- At least 5 years of professional experience in DevOps engineering.
- Expertise in infrastructure automation and deployment tools such as Terraform, Ansible, Jenkins, or GitLab CI/CD.
- Skilled with Google Cloud Platform (GCP).
- Skilled with containerization technologies such as Docker and Kubernetes.
- Experience with monitoring and logging tools such as ELK, Grafana, or Datadog.
- Familiarity with security and compliance best practices and tools such as HashiCorp Vault, AWS KMS, or Azure Key Vault.
- Strong problem-solving skills with the ability to work independently and collaboratively in a team environment.
- Excellent communication and interpersonal skills.
Nice to have:
- Experience with HIPAA and SOC2.
- Experience implementing HIPAA and SOC2 compliance.
- Experience working in an HPC Environment.
About Skyrocket Ventures
Skyrocket Ventures is a recruiting firm for hundreds of high growth technology companies that range from industry leaders to top-tier startups. This opportunity is with one of our client companies for a full-time permanent hire. Please only apply if you are authorized to work in the U.S.
Please note that even if this job is not a perfect match, we encourage you to apply as long as it is in the ballpark. Companies are often flexible in hiring candidates who do not perfectly fit their written job description, as long as the most important qualifications are there and the candidate is good in general.
Most of the jobs we are recruiting for are not posted online, so if you would like to know of all the opportunities we have that match your interests and qualifications, then please get in touch with us.
After you apply to this job posting, we’ll consider you for this job as well as any other potential matches with our client companies. If we have any potential matches, we’ll share your resume with those companies and contact you about any interview opportunities we can get you.
Thank you, and we wish you a great job search!
...Role: Certified Scrum Master Location: Okemos, MI | Hybrid (minimum of 3 days a week in Okemos) Duration: 12 months+ Primary Job Responsibilities: 1. Performs as a Scrum Master for 2-4 more teams. 2. Performs assessments at the team or train level...
...providing technical support, maintaining computer systems and networks, and ensuring the... ...Provide technical support both in-person and remotely via phone, email, or remote desktop... ...of schooling and experience in Computer Science, Information Technology, or related field...
...understanding of non-destructive layer management within Photoshop. Knowledge of file management and workflow a plus Comfortable in a Mac OS environment Use of a Wacom tablet is required Extremely organized A team player with a positive and upbeat attitude...
...Roof Inn property information, room information, policies and booking conditions in all databases (including the Property Management... ...provides support loading and maintaining all promotional rates on the Online Travel Agency (OTA) sites, adding and removing room types or...
...with PLM systems. Job Duties Include: Assist with entire design process including research, design, and sketching Conduct seasonal and market research to identify new ideas and opportunities Create and sketch design concepts and gather feedback Assist...