SRE /Site Reliability Engineer /DevOps Engineer - Santa Clara

Company: ITC Infotech
Location: Santa Clara , California, United States
Type: Full-time
Posted: 13.FEB.2019

Summary

About us We are a growing Global IT Consulting Organization 200 mil+ in revenues, 8000+ resourcesrsquo footprint across Americas, Europe, So...

Description

About us We are a growing Global IT Consulting Organization 200 mil+ in revenues, 8000+ resourcesrsquo footprint across Americas, Europe, South Africa, Asia Pacific and India. Presence in 24 states in the US. Plenty of successful use cases across diversified verticals including Banking Financial Services, Healthcare, Mfg, CPG-Retail, Hi-Tech Technology footprint in IoT, ML, AI, AR, RPA, Blockchain, Big Data, Product Engineering (we own a technology agnostic DevOps product), App Dev, Infra, PTC - PLM, ERP, BI, DW, Data Sciencehellip Innovation Lab in San Jose Backed by a parent company with over 50 billion market cap over 24,000 employees. Job Role As a Site Reliability Engineer, you will own and improve service reliability and availability of this revolutionary platform. You will have your hand on the pulse of the service and will play a key role in the DevOps team that deploys, manages and supports the service. Additionally, you will have the opportunity to develop future automation and tooling that will allow us to continuously improve our service as we scale. You will contribute to the full service lifecycle from service development to live-service response, as we continuously deploy new and innovative functionality for our customers Responsibilities bull Development and Operations (DevOps) subject matter expert bull Work hand-in-hand with micro-service software developers, architects, and field integration resources to architect and deliver Client needs. bull Contribute to the development of new tools and automation that ensures the service can be optimized and tuned with minimal human intervention. bull Accountable for working upstream with micro service developers on monitoring, tools and architecture to deliver security, reliability, manageability and availability at scale bull Point of escalationdecision maker on response level of incidents bull Participate in the Core SRE on-call roster and respond with command and control incident management during High-Pri Events while maintaining internal and external SLAs bull Act as Technical Duty Officer who leads resolution effort of the most complex service problems from network layer to the application at scale bull Drive Problem ManagementRetrospectives (ldquopost mortemsrdquo) bull Strong contribution and maintenance of our knowledge base bull Analyze trends and make recommendations in the areas of monitoring, incident and change management, cloud orchestration and support. bull Contribute to the future growth of the team by conducting candidate screenings and assessments bull Accountable for deploying services to production environments Technologies bull Experience with Docker and SaltStackAnsible, Kubernetes orchestration tools, etc. bull Knowledge of MongoDB, Cassandra databases, IIS Servers on GCPAzureAWSOpenStack bull GCP, Azure, OpenStack and AWS concepts and APIs bull Experience designing, setting up and maintaining, refining (noise reduction, auditing) monitoring tools such as Prometheus, Prometheus exporters, Grafana, Alertmanager, etc. bull Demonstrable experience in one or more languages Python, Powershell, BASH, C, .NET bull Strong knowledge of TCPIP networking, DNS, VPNs, HTTP, load-balancers (such as NGINX), highly available microservice architecture bull Team Foundation ServerVisual Studio, Atlassian suite (Jira, Confluence), Git bull Network analysis, performance and application issues using tcpdump, Fiddler and Wireshark. Qualifications bull Bachelor's Degree in CS, MIS, or equivalent experience in infrastructure, systems, engineering or development environment. bull 8+ years of relevant experience with WindowsUnix systems fundamentals, monitoring, Cloud services, networking, storage, database, and application knowledge. If interested please email me anil.goud (at)itcinfotech dot com Thank you, Anil Goud Talent Acquisition Consultant - NA Phone Email anil.gouditcinfotech.com mailtoanil.gouditcinfotech.com Linkedin https https

 
Apply Now

Share

Free eBook

Flash-bkgn
Loader2 Processing ...