Site Reliability Engineering (REMOTE)

As a Site Reliability Engineer at STRATIS, you’ll contribute towards the codebases of our growing IoT ecosystem, enabling scalable smart-access and smart-building deployments. This is a role that will work with our cloud infrastructure (Kubernetes), and support our mostly-Python backend and our React web apps / React Native mobile apps.

Primary Responsibilities
  • Be a team player: actively provide code reviews for team members and mentor less experienced colleagues when able
  • Own end-to-end responsibility on projects of increasing complexity
  • Make steady progress on tasks; know when to ask for help in order to get "unblocked"
  • Fully understand tools used by the team and be able to assist others if needed
  • Able to interpret and follow requirements and be comfortable raising questions or issues with proposed solutions or a clear path to gaining a solution
  • Write clean, correct code incorporating feedback from code reviews; consistently follow best practices
  • Contribute to the technical design of features with teammates and architects
  • Take the initiative to fix small problems before being asked, without compromising progress on assigned tasks
  • Build both deep and broad knowledge about our system
  • Make well-reasoned design decisions, recognizing tradeoffs and considering priorities
  • Identify potential problems/risks in your own work and the work of others. Propose solutions to problems identified
  • Provide on-call support as required by the team
  • Work effectively in a team environment, across the organization, as well as work independently
Required Knowledge
  • Bachelor’s degree in computer science, information technology, or engineering OR equivalent work experience
  • Experience with supporting software in production (preferably with Kubernetes and/or Docker)
  • Knowledge of a dynamic language like Python or Javascript
  • Experience with container orchestration tools, preferably Kubernetes
  • Experience with at least one cloud platform (e.g. AWS, GCP, or Microsoft Azure)
  • Able to determine when to use appropriate cloud platform technologies as they relate to existing infrastructure
 

PREFERRED KNOWLEDGE/SKILLS/ABILITIES

  • Experience in automating releases, continuous integration/delivery systems and relevant tools (e.g. Jenkins or Azure Pipelines)
  • Monitoring and observability systems (e.g. Prometheus, Grafana )
  • Experience with service meshes, such as Linkerd or Istio #