Senior Site Reliability Engineer
Science 37 (View all Jobs)
Raleigh, North Carolina (Remote)
Interview Process
1. 30 min call with hiring manager on previous work experience 2. Take-home interview (given 1 week to complete) 3. 1 hour Zoom interview with panel of engineers to discuss homework 4. 30 min informal call with hiring manager to ask questions about role and company
Programming Languages Mentioned
JavaScript, Python, SQL, Golang
This is a fully Remote and Work From Home (WFH) opportunity within the US
Science 37 is accelerating the research and development of breakthrough biomedical treatments by bringing clinical trials to patients' homes. The Science 37 Operating System (OS) enables universal access to patients and providers, leading to faster enrollment, greater retention and a more representative patient population. To help us achieve our goal, we are seeking a Senior Site Reliability Engineer eager to make an impact within a mission-driven organization.
POSITION OVERVIEW
Science 37 is accelerating the research and development of breakthrough biomedical treatments by bringing clinical trials to patients' homes. Backed by venture investors such as Glynn Capital, Google Ventures, Redmile Group, dRx Capital and Lux Capital, we are revolutionizing the clinical trial industry one patient at a time. To help us achieve our goal, we are seeking a Senior Site Reliability Engineer eager to make an impact within a mission-driven organization.
The Senior Site Reliability Engineer will join the Cloud & Release Engineering team and serve a critical role in maintaining the reliability of Science 37 products through the implementation of observability toolsets and creating automated processes to replace previously manual work operations tasks.
DUTIES AND RESPONSIBILITIES
Duties include but are not limited to:
- Ensure the observability of Science 37 systems through full use of observability toolsets.
- Support Development by making it easy to integrate with the observability toolsets.
- Work with Development teams to ensure that logs, traces and metrics are easily consumable and provided at the appropriate level of granularity.
- Create internal and external dashboards to visualize key performance and reliability metrics.
- Work with Development and Product teams to ensure alert definitions and channels are aligned with business and operations requirements.
- Ensure that all logs are retained in accordance with Standard Operating Procedures and Regulatory requirements.
- Work with Development, Product and Cloud teams to conduct root cause analysis on system and application failures.
- Develop automated remediation/healing processes to reduce risk and increase reliability of systems
- Conduct annual Disaster Recovery testing for all Science 37 products.
- Work with Development and Cloud engineering teams to understand changes in products and adapt SRE processes and toolsets accordingly.
- Ensure consistency of AWS environments through Infrastructure as Code and Configuration Management systems.
QUALIFICATIONS & SKILLS
Qualifications
- Bachelor's Degree in Computer Science or equivalent professional experience
- 3-5 years of experience in Site Reliability Engineering, Software Development or Cloud Operations
Skills/Competencies
- Proficient with at least one of the following languages:
- Python
- Golang
- Spring and/or Spring Boot
- NodeJS
- Working knowledge of/experience with NewRelic and/or Splunk
- Working knowledge of/experience with Hashicorp Terraform
- Working knowledge of/experience with Ansible, Puppet, Chef or other Configuration Management tools
- Working knowledge of/experience with Amazon Web Services
- Working knowledge of/experience with RDBMS and/or NoSQL
- Working knowledge of/experience with Linux CLI
- Ability to bridge the communication gap between technical and non-technical stakeholders
- Demonstrated ability to analyze requirements and resolve ambiguity
- Demonstrated ability to write technical documentation in a clear, concise manner
- Ability to mentor and train less experienced team members
- Working knowledge of data manipulation in either RDBMS or NoSQL (or both)
- Working knowledge of Linux CLI for debugging and troubleshooting
- One or more Associate or Professional Certificate
- Background in healthcare or life science is a plus
Capabilities
- Ability to clearly communicate in non technical and technical English (both verbal and written)
- Ability to participate in an on-call rotation
REPORTING
This role will report to the Director of Cloud & Release Engineering, who will also assign projects, provide general direction and guidance.
DIRECT REPORTS
No direct reports
Science 37 is an equal opportunity/affirmative action employer. All qualified applicants will receive consideration for employment without regard to sex, gender identity, sexual orientation, race, color, religion, national origin, disability, protected Veteran status, age, or any other characteristic protected by law.
Science 37 values the well-being of its employees and aims to provide team members with everything they need to succeed.
Submit your resume to apply!
To learn about Science 37's privacy practices including compliance with applicable privacy laws, please click here
Please mention No Whiteboard if you apply!
I'm a one-man team looking to improve tech interviews, and could use any support! 😄