SRE lead ( Python scripting , RUST, c++, Prometheus Grafana and ELK )

Company: SigmaWay
Location: Not Specified, Not Specified, United States
Type: Full-time
Posted: 28.APR.2021

Summary

Apply by Email/Direct Application at SRE Lead Location: SF - 100% remote Duration: 6 month Must haves: RUST or C++ - - if no RUST be op...

Description

Apply by Email/Direct Application at

SRE Lead

Location: SF - 100% remote

Duration: 6 month

Must haves:

RUST or C++ - - if no RUST be open to learning

Python - scripting

5+ years of experience

PrometheGrafana and ELK - monitoring

networking protocols

Product Launch - in May

launching new product, fully open and they can by their block chain products, buy their tokens.

will have an event for the new launch

blockchain company - when engineers want to do coding on internet - its on AWS, AWS gets to decide.

they are reinventing the internet - if they use Dfinity platform they do not need to use AWS

Job Description:

Responsibilities:

  • Implement tools that ensure high availability of DFINITY's product
  • Gain deep knowledge of DFINITY's complex applications
  • Identify opportunities to automate or improve processes and then implement the automation
  • Coordinate incident response across multiple teams -- clearly understanding and communicating what is going on, next steps, who is responsible for what, and so on
  • Implement observability tools to ensure visibility into service stability and performance
  • Be on-call for production services
  • Operating, troubleshooting, and deploying software to Unix systems
  • Thinking about things in a systemic, methodical way, especially when troubleshooting

Required Skills:

  • Expertise in observability and monitoring of applications, services, and networks, using tools such as PrometheGrafana and ELK logging
  • Unix/Linux experience, including application installation, configuration, and maintenance
  • Significant experience with site reliability, developer productivity, devops, or server infrastructure engineering (including on call incident response)
  • Understanding of Internet networking protocols: TCP/IP, TLS, DNS, HTTP/S, SMTP
  • Experience troubleshooting issues across the entire stack (hardware, software, network, etc)
  • Experience writing automation scripts and utilities in a scripting language such as Python, Perl, Shell, PHP, etc
  • Experience with incident and problem management • Strong communication and interpersonal skills

Desired Skills

  • Experience coding in Rust or C++
  • Experience supporting large-scale, mission critical services
  • Experience with CI/CD pipelines
- provided by Dice

 
Apply Now

Share

Free eBook

Flash-bkgn
Loader2 Processing ...