Site Reliability Engineer
Office Depot Inc
Contract Boca Raton , Florida, United States Posted 3 years ago
About Position
Site Reliability Engineer (Contract)
$60.00 / Hourly
Boca Raton , Florida, United States
Site Reliability Engineer
Contract Boca Raton , Florida, United States Posted 3 years ago
Skills
1. Independently designs implements productionizes and maintains site reliability guidelines processes and systems 2. Service Level Definition Configuration and Measurement: Define SLIs SLOs & SLAs specific to each application or system: Configuration of monitoring & alerting tools suitable for each product and/or platform team Measure reliability & resilience (through pre-defined SLIs & SLOs) utilizing -monitoring/alerting tools to drive continuous improvement based on data analysis 3. Incident Management Facilitation of incident response through the engagement of various teams and stakeholders while providing robust communication and visibility to the organization during service interruptions Provide Root Cause Analysis for failures Experience with a modern incident management platform (OpsGenie) to effectively drive incident response and problem resolution 4. Monitoring & Alerting Debug defects as well as develop dashboards using modern monitoring tools (e.g. New Relic Splunk AIOPs) to enable a reduction in mttd (detection time) & mttr (resolution time) Build monitors and alerts designed to manage SLAs optimize performance and minimize outages Construct E2E customer journey dashboards and alerts for customized transactions and applications 5. Automates reliability requirements into system and application implementations and updates; including the implementation of self-healing solutions (ansible terraform etc). 6. Work with product management team to contribute to 1) the identification of reliability features & requirements and 2) level of effort estimatesDescription
The ideal candidates should have advanced coding skills in Java, Go, Python, Shell and YAML, preferably with a minimum of 3-5 years of experience in all of these or similar languages.
Candidates should have 3+ years’ experience in SRE and either or both of the following roles: DevOps, Software Engineering, leveraging automation extensively to achieve key deliverables.
Summary:
The role of Sr. Site Reliability Engineer is to support and enforce reliability elements into technological solutions that deliver an exceptional customer experience. As part of Office Depot’s Site Reliability Engineering team, you’ll leverage your development background to promote a framework which will deliver optimal levels of performance and reliability throughout Office Depot’s systems and services. You will collaborate with our product teams and software developers to improve the resiliency of our applications through development based on reliability requirements. You’ll bridge the gap between platform and product teams to ensure deployment consistency throughout our Technology organizations while utilizing your operational excellence to provide stability across our customer-facing sites and services. This is an opportunity to shape and strengthen our SRE practice, serving as a key contributor to a versatile, high velocity team.
By applying to a job using PingJob.com you are agreeing to comply with and be subject to the PingJob.com Terms and Conditions for use of our website. To use our website, you must agree with the Terms and Conditions and both meet and comply with their provisions.