Azure Site Reliability Engineer (SRE)
Azure Site Reliability Engineer (SRE)
Job type:
Skillhouse Contract
Specialization:
Software/Web Development
Language Level:
Japanese Level - High Intermediate (JLPT Level 2),English Level - Advanced (TOEIC 860)
Location:
Sumida-Ku
Salary:
¥750,000.00 - ¥850,000.00 Monthly
Job Reference:
493686
A global and one of the world’s largest Insurance Service provide is seeking an experienced Azure Site Reliability Engineer (SRE) to join its Data Management Office (DMO) – SRE team.
Responsibilities:
- Ensure high availability, resilience, and performance of Azure data platforms and services
- Monitor system health using metrics, logs, and alerts; proactively identify and mitigate risks
- Define and maintain SLIs, SLOs, and error budgets to support service reliability
- Actively participate in incident response, including P1 / P2 production incidents
- Troubleshoot and resolve issues across Azure infrastructure, data pipelines, and services
- Lead or contribute to root cause analysis (RCA) and post-incident reviews
- Implement corrective and preventive actions to avoid recurrence
- Design and implement CI/CD pipelines using Azure DevOps (YAML)
- Manage Azure infrastructure and SQL database environments (Azure SQL, SQL Server)
- Build and enhance automation for monitoring, alerting, recovery, and operational tasks
- Improve runbooks, operational documentation, and on-call readiness
- Partner with engineering teams to design systems with reliability and operability built in
- Support release, deployment, and change activities in collaboration with SRE Program Managers and engineering teams
- Validate operational readiness for new features, platform changes, and data pipelines
- Ensure compliance with enterprise change and governance processes
- Implement system monitoring using Azure Monitor, Log Analytics, and custom dashboards
Required Skills:
- 4~5 years of technical work experience
- Strong hands-on experience as an Azure SRE, Cloud SRE, or Platform Engineer
- Hands-on experience with YAML, Terraform, ARM Templates, or Ansible
- Strong background in Azure DevOps, CI/CD, and Infrastructure as Code (IaC)
- Solid knowledge of Azure resource management, monitoring, and SQL administration
- Experience with Azure Synapse, Azure Log Analytics, and Windows Server environments
- Proven ability to troubleshoot, analyze, and resolve production issues quickly
Why should you apply:
- Opportunity to work with global teams and great Work-Life-Balance
- Great team dynamics and learning opportunity
- Opportunities to work with World’s leading insurance company (fortune 500 company)
Company Details:
A US based world’s leading insurance providers, offering a broad range of life, health, and retirement solutions to individuals, families, and businesses. The company is heavily invested in digital transformation, utilizing advanced technologies like cloud computing, data analytics, AI, and cybersecurity to enhance customer experience and streamline operations. As part of its values, it has a strong focus on creating a diverse environment, and in particular on the appointment of women in high-level position.
Working Hours: 9:00 - 18:00 (Mon-Fri)
Working Style: 3 days’ work in office, and 2 days’ work from home
Holidays: Saturday, Sunday, National Holidays, Year-end and New Year Holidays, Paid Holidays
Services/Benefits: Transportation expenses up to 20,000 yen per month, plus Paid leave, plus social insurance (health insurance, welfare pension, and work-related accident insurance), Periodic health examination, and Employment insurance



