Site Reliability Engineer Job at Interactive Resources - iR, Austin, TX

SjlJWjdGeldwSkxOYVpKWHRBVHFZUTMxOFE9PQ==
  • Interactive Resources - iR
  • Austin, TX

Job Description

Our client is seeking a highly motivated and skilled Site Reliability Engineer (SRE) to join their Advisor Platform Engineering team. This critical position focuses on maintaining the availability, performance, and scalability of a mission-critical Azure-hosted platform serving thousands of financial professionals nationwide.

As an individual contributor, you’ll leverage your growing expertise in cloud infrastructure, automation, and observability to improve and support platform reliability. You’ll work hand-in-hand with Agile development teams, integrating reliability practices throughout the application lifecycle and driving meaningful improvements.

This role is ideal for someone who thrives on solving complex infrastructure challenges and enjoys working with cutting-edge cloud technologies.

This is an FTE Direct hire opportunity. You will be working onsite with the team in Austin, Texas!

What you get to go do in this exciting role:

  • Azure Infrastructure Management: Oversee the performance, availability, and capacity of key Azure services including VMs, App Services, Function Apps, Container Apps, Azure SQL, Cosmos DB, and more.
  • Enhance Observability: Define and refine SLIs/SLOs, configure monitoring, logging, and alerts using Azure Monitor, Application Insights, and Log Analytics (KQL).
  • Automation & Tooling: Eliminate manual processes by developing automation scripts and tools using PowerShell, Bash, Python, and optionally C#/.NET.
  • Incident Management: Take part in a rotating on-call schedule, leading incident resolution, root cause analysis, and implementing post-incident improvements.
  • Cross-Team Collaboration: Partner with developers, QA, and tech teams throughout the SDLC to ensure performance and reliability goals are met.
  • Capacity & Performance: Contribute to system load testing, performance tuning, and capacity planning, especially within a .NET/React microservices architecture.
  • Documentation & Knowledge Sharing: Maintain system documentation, runbooks, FAQs, and mentor peers in SRE practices.
  • Integration Support: Troubleshoot API integrations, SSO setups, and secure file transfer protocols.
  • Continuous Improvement: Contribute ideas and solutions to improve automation, cost efficiency, security posture, and reliability processes.

What you need to be successful in this role:

  • Strong hands-on experience with Azure cloud services, including IaaS and PaaS (networking, compute, storage, databases, messaging, security).
  • Deep knowledge of Azure observability tools and KQL for metrics/log analysis.
  • Proficient in scripting languages such as PowerShell, Bash, or Python.
  • Experience with CI/CD practices and tools (preferably Azure DevOps).
  • Solid understanding of Git workflows and platforms (e.g., Azure Repos, GitHub).
  • Strong foundation in networking concepts (DNS, TLS, firewalls, etc.).
  • Skilled in diagnosing complex, distributed system issues.
  • Strong communicator with collaborative mindset; effective independently or in a team.
  • Familiarity with Agile methodologies and Scrum practices
  • Bachelor’s degree in Computer Science, Information Technology, Engineering, or related field—or equivalent hands-on experience.
  • 2–5 years of experience in Site Reliability Engineering, DevOps, Systems Administration, or a related field with operational focus.
  • Prior experience supporting production systems in Azure is strongly preferred.
  • Proven ability to implement observability practices and contribute to on-call operations.
  • Track record of successful automation and operational improvement.
  • Background in financial services or other regulated industries is a plus.

Certifications (Preferred but not required)

  • Microsoft Certified: Azure Administrator Associate (AZ-104)
  • Microsoft Certified: DevOps Engineer Expert (AZ-400)
  • ITIL v4 Foundation or equivalent service-management credential
  • Other relevant cloud or infrastructure certifications

Job Tags

Similar Jobs

EFitz Logistics

Class A Truck Driver | No-Touch | Home biweekly Job at EFitz Logistics

 ...We are hiring CDL A drivers for our home weeklyaccount.Drivers must have at least 4 months of experience solo driving a tractor-trailer. Job Details: Drivers are home biweekly for a34-hour reset. Earn $1354 - $1666 average weekly. Operate in a extended... 

Agile Premier

Application Manager - Dynamics AX Job at Agile Premier

 ...data reporting systems, or business intelligence platforms such as Dynamics AX, SAP S/4, NetSuite, IFS, JD Minimum of 3 years of...  ...Dynamix AX 2012 R2/R3 or Dynamics 2009 is a plus Experience with CRM applications is a plus (Dynamics CRM, Shopify, Salesforce) Microsoft... 

Intel

Sr. Post-silicon Validation and Debug Engineer Job at Intel

**Job Details:****Job Description:**Intel is shaping the future of technology to help create a better future for the entire world. Our work in pushing forward fields like AI, analytics, and cloud-to-edge technology is at the heart of countless innovations. With a career... 

CHS Inc.

Class A CDL Driver - 3,000 Sign-On Bonus, Great Benefits, and Home Every Night Job Job at CHS Inc.

 ...professional manner.- Provide excellent customer service.- Report all accidents, traffic violations, and damage to vehicles.- Driving Tractor Trailer delivering various types of tanker loads to various customers.- Filling out daily load/unload reports.Minimum... 

MarketScale

Part Time Creative Producer Job at MarketScale

 ...Creative Producer MarketScale Production Team Part-Time | Remote | Ongoing Role MarketScale is hiring a Part-Time Creative Producer to join our growing Production Team. At MarketScale, we help B2B brands turn their industry expertise into dynamic mediafueling...