109 Site Reliability jobs in the Philippines

Site Reliability Engineer

Makati City, National Capital Region ₱104000 - ₱130878 Y Drake International Philippines

Posted today

Job Viewed

Tap Again To Close

Job Description

Drake International Philippines is actively hiring for an
IT Observability Engineer / Site Reliability Engineer
that is eager to boost their growing career upward

ABOUT THE ROLE:

Job Title: IT Observability Engineer / Site Reliability Engineer

Employment Type:
6-month contract (Renewable)

Work Set-up and location: Onsite, Makati

Work Schedule:
Mondays to Fridays

Here's what we're looking for an IT Observability Engineer / Site Reliability Engineer:

  • Must have
    4+ years of experience
    in IT operations or a similar role, with a solid foundation in monitoring and observability principles,
  • Must be proficient in
    Prometheus, Grafana, Splunk, Jaeger
    , or similar industry-leading tools,
  • Must have solid experience with cloud-based observability platforms like
    AWS CloudWatch
    or
    Azure Monitor,
  • Must have knowledge of security monitoring tools and incident response best practices and experience with incident response methodologies and best practices, and;
  • Must have experience in
    scripting and automation
    , with proficiency in languages like Python, Bash, or Go for data manipulation and automation tasks.

Do you think you're a perfect fit for the job? Wait no more and let Drake help you with your career as an
IT Observability Engineer / Site Reliability Engineer
Apply now

This advertiser has chosen not to accept applicants from your region.

Site Reliability Engineer

₱900000 - ₱1200000 Y Reed Elsevier Philippines

Posted today

Job Viewed

Tap Again To Close

Job Description

Join us and enjoy benefits designed to help you thrive:

  • Flexible hybrid work setup (1-2 days/month onsite reporting)
  • IT Equipment provided
  • HMO coverage starting from Day 1 for you and FOUR FREE dependents
  • Attractive retirement package with company matching
  • Life and Accident Insurance starting Day 1
  • 24 Annual PTOs, additional 6 once you reach your 5th year with us
  • Competitive benefits with annual merit increase and incentives
  • Continuous improvement for our employees (workshops, certification programs, learning sessions, etc.)

Accountabilities:

  • Infrastructure & Cloud Management
  • Design and maintain scalable, secure, and highly available infrastructure ( AWS).
  • Implement Infrastructure as Code (IaC) using tools like Terraform, CloudFormation, and Ansible.
  • Manage container orchestration platforms (ECS/ECR).
  • Automation & CI/CD
  • Build and maintain CI/CD pipelines for automated testing, deployment, and rollback.
  • Automate routine operational tasks to reduce manual effort and improve reliability.
  • Integrate security and compliance checks into pipelines (DevSecOps).
  • Monitoring & Observability
  • Set up and maintain monitoring, logging, and alerting systems (e.g., Prometheus, Grafana, ELK).
  • Define and track SLIs, SLOs, and SLAs.
  • Implement distributed tracing and performance profiling.
  • Incident Management & Reliability Engineering
  • Participate in on-call rotations and lead incident response efforts.
  • Conduct root cause analysis and write postmortems.
  • Design self-healing systems and automated recovery mechanisms.
  • Apply chaos engineering principles to test system resilience.
  • Security & Compliance
  • Manage secrets, access controls, and identity policies (IAM, Vault).
  • Ensure infrastructure and deployments meet compliance standards (e.g., SOC 2, ISO
  • Remediate vulnerability issues.
  • Collaboration & Mentorship
  • Work closely with software engineers, QA, and product teams to ensure smooth releases.
  • Mentor junior engineers and contribute to team knowledge sharing.
  • Participate in architectural reviews and technical planning.

Qualifications:

  • Bachelor's degree holder
  • Proven experience in SRE, DevOps, or software engineering roles.
  • Strong scripting skills (Python, Bash).
  • Expertise in Linux systems, networking (TCP/IP, DNS), and cloud platforms (AWS).
  • Experience with CI/CD tools (GitLab, Github Actions, Jenkins) and container orchestration (AWS ECS/ECR).
  • Experience with automation and configuration management tools (Ansible, Terraform)
  • Experience with designing highly available, scalable, and secure systems
  • Excellent problem-solving and communication skills.
  • AWS Certified Solutions Architect – Associate/Professional and AWS Certified DevOps Engineer – Professional is an advantage but not required.
  • Preferably with Application Development background
This advertiser has chosen not to accept applicants from your region.

Site Reliability Engineer

₱30000 - ₱150000 Y Braintrust

Posted today

Job Viewed

Tap Again To Close

Job Description

Job Description
*Compensation range varies off level of experience: *
Jr SRE $12k-$8k/yr, Intermediate: 20k- 30k/yr, Senior: 35k - 50k/yr

Some travel may be required.

*Card payment domain knowledge/experience is key: *
Our client, a global Business Process Outsourcing (BPO) businesses is looking for Site Reliability Engineers (SRE) to support their client, a global payment technology company that provides platforms to consumers, businesses and organizations to make electronic payments. The successful candidate will be responsible for ensuring site reliability & performance, monitoring & alerting, and supporting emergency response situations. This would require working closely with software engineers, DevOps and product teams to maintain robust infrastructure and automation that supports mission-critical applications.

The ideal candidate creates a bridge between development and operations by applying a software engineering mindset to service management. We are seeking an individual who is highly motivated, intellectually curious, and seeks out opportunities for improvement.

*The Role: *
This role involves working with a team of talented SREs/DevOps Engineers to support highly scalable services. Responsibilities include:

  • Responsible for pipeline build and maintenance in accordance with

the clients tooling and conventions.

  • Participate in the software development lifecycle, working closely with the

development team to ensure that designed solutions meet non-functional

requirements such as availability, performance, security and

maintainability standards.

  • Maintain services through monitoring of metrics, system health, and

analysis of reports.

  • Provide support for production and in-house systems. Participate in on-

call Production support rota.

  • Incident management, on call support and root cause analysis conducting post incident reviews and 5-Whys analysis.
  • Remediate system vulnerability , security and resiliency measures.
  • Improve process and systems within the Program.
  • Lead incident management efforts by proactively monitoring and analyzing ISO 8583 financial transaction messages across the 4-party payment model (Cardholder, Merchant, Acquirer, Issuer).

*Skills & requirements:
MIN 2+ years of experience *

  • Card payment domain knowledge (mandatory)
  • Experience with CI/CD and Build pipelines using Jenkins.
  • Experience in public and private Cloud offerings (PCF, Azure, AWS etc.).
  • Knowledge of NoSQL & SQL databases such as Mongo / Oracle/

Postgres.

  • Experience and knowledge of managing distributed systems and working

with microservices.

  • Familiarity with Unix tooling, with strong scripting skills
  • Exposure to working with Monitoring and Alerting tools such as Splunk,

Dynatrace

  • Proficiency in one of the following: Python, Java, GO or equivalent.
  • Familiarity defining SLO's and SLA's
  • Prior experience of working in an SRE/DevOps team and excellent understanding of SRE/DevOps principles.
  • High degree of initiative and self-motivation, with a willingness to take on

challenging opportunities.

  • Excellent communication and relationship building/collaboration skills.
This advertiser has chosen not to accept applicants from your region.

Site Reliability Engineer

₱1500000 - ₱2500000 Y QualityKiosk Technologies Pvt. Ltd.

Posted today

Job Viewed

Tap Again To Close

Job Description

Experience:
6 to 10 years

Location:
Makati

About QualityKiosk Technologies

QualityKiosk Technologies is one of the world's largest independent Quality Engineering (QE) providers and digital transformation enablers, helping companies build and manage applications for optimal performance and user experience. Founded in 2000, the company specializes in providing quality engineering, QA automation, performance assurance, intelligent automation (IA) and robotic process automation (RPA), customer experience management, site reliability engineering (SRE), digital testing as a service (DTaaS), cloud, and data analytics solutions and services. With operations spread across 25+ countries and a workforce of more than 4000 employees, the organization enables some of the leading banking, e-commerce, automotive, telecom, insurance, OTT, entertainment, pharmaceuticals, and BFSI brands to achieve their business transformation goals. The company is banking on its speed of execution and technology advancement as key factors to drive a 5X growth in the next five years, both in revenues and number of employee.

Job Description

  • Firsthand experience implementing or deploying AppDynamics solution into applications in

production environment.

  • Hands-on experience in AppDynamics (Java, .net agent, EUM, BIQ, Server & Network),
  • Business Transaction Configuration, Dashboard Configuration, Incident/Alert Configuration,
  • Task Scheduling, Plugin Configuration
  • Strong understanding of application platforms, including network, database, runtime,
  • application, and user interface.
  • Excellent communication, collaboration, and conflict resolution skills with the ability to adapt to various business needs. Knowledge of ansible will be the advantage.
  • Experience in designing and implementing various tools like Dynatrace.
  • Should have worked extensively in implementation, configuration and maintenance of APM tools named Dynatrace
  • Application performance management Tool: Dynatrace (OneAgent, Synthetic Script, DCRUM(1)Enterprise Synthetic script , client automation ).
  • Good to have knowledge of Python and Node JS.
  • One Agent Certification ( Dynatrace associate) would be an added advantage
  • Relevant experience in Elastic Stack (Elastic Search, Logstash, Kibana, Filebeat, Ingest Pipeline)
  • Strong experience in installing and configuring ELK. Strong experience in Design, build, deploy, maintain, and enhance ELK platform
  • Strong experience in using Elastic search Indices, Elastic search APIs, Kibana Dashboards, Log stash and Log Beats
  • Good experience in using or creating plugins for ELK like authentication and authorization plugins
  • Troubleshoot and resolve complex issues related to data ingestion, storage, search, and visualization within the ELK stack.
  • Good experience in enhancing Open-source ELK for custom capabilities
  • Experience in integrating ELK with enterprise tools and APIs, for example for authentication and authorization
  • Capacity Planning of Elastic Search Cluster
  • Fine-tuning
This advertiser has chosen not to accept applicants from your region.

Site Reliability Engineer

₱60000 - ₱81000 Y Cambridge University Press & Assessment

Posted today

Job Viewed

Tap Again To Close

Job Description

  • Salary:

Php 60,000 to Php 81,000
- Location:

Manila
- Country:

Philippines
- Business Unit:

Technology
- Vacancy Type:

Permanent
- Closing Date:

9 October 2025

Meet the recruiter

Imee Santos

Work setup: Hybrid (open to 2x a week in the office)

Work schedule: 10AM to 6PM Manila time

Employment type: Permanent

Location: Makati City, Metro Manila

Pay range: Php 60,000 to Php 81,000

We value transparency and encourage applicants comfortable with this range to apply.

Discover a world of endless possibilities with Cambridge University Press & Assessment, a distinguished global academic publisher and assessment organization proudly affiliated with the prestigious University of Cambridge.

We are recruiting for a Site Reliability Engineers who will be part of our SRE function within the Platform Operations Team. This is a new team of engineers who will work alongside English Technologies existing Platform Support and Engineering teams.

Why Cambridge?

Cambridge University Press & Assessment is a world-renowned not-for-profit academic publisher and assessment organisation, proudly part of the prestigious University of Cambridge. With a legacy rooted in over 800 years of educational excellence, we are dedicated to unlocking the potential of learners and educators across the globe.

Joining Cambridge's second largest global office in the Philippines —operating for over 22 years with 1,300+ colleagues— means becoming a part of an extraordinary institution renowned worldwide. We are recognised as a Great Place to Work for three consecutive years, reflecting our inclusive culture, strong sense of purpose, and commitment to the professional growth and well-being of our people. At Cambridge, we don't just publish books or deliver tests—we empower progress, inspire curiosity, and champion the pursuit of knowledge.

What can you get from Cambridge?

At Cambridge, you'll become a part of a vibrant and forward-thinking community that transcends tradition, fostering a culture of continuous growth and personal development. Here, we provide the right environment for you to thrive, supporting your professional journey and empowering you to reach your highest potential, that is why our pay philosophy is intricately tied to your skills and competencies, ensuring that your compensation aligns with the unique value you bring to the role you are applying for.

The organization offers a wide range of benefits and opportunities including:

  • Regular Employment on Day 1
  • HMO Coverage and Life Insurance on Day 1
  • Paid Annual Leaves (Vacation, Well-being, Flexible, Holiday, and Volunteering leaves)
  • Vesting/Retirement package
  • Opportunities for career growth and development
  • Access to well-being programs
  • Flexible schedule, hybrid work arrangement and work-life balance
  • Opportunity to collaborate with colleagues from diverse branches that will expand your horizons and enrich your understanding of different cultures

What will you do as a Site Reliability Engineer?

  • The Site Reliability Engineer will join a new SRE function within the Platform Operations Team working alongside existing Platform Support and Engineering teams.
  • The role will be responsible for support and design aspects of the English Engineers ecosystem (Platforms, Applications, Services and Websites).
  • Responsible for creating and maintaining software and processes that ensure the reliability and availability of the English digital platforms/websites and their software delivery pipelines.

What makes you the ideal candidate for this role?

  • Education & Experience: Degree or equivalent experience with at least 3 years in AWS Cloud Engineering, Architecture, or Infrastructure, combined with 3+ years in a Systems Admin or DevOps role.
  • DevOps & Delivery Model: Experience with DevOps delivery for infrastructure, applications, and configuration, including Infrastructure as Code (Terraform, CDK), CI/CD (GitHub Actions, Bitbucket Pipelines), and containerization/orchestration (Docker, Kubernetes).
  • Monitoring & Logging: Expertise with central logging systems (ELK/EFK stack), monitoring tools (New Relic, Datadog, Grafana, Alert Manager, PagerDuty, site24x7), and troubleshooting production issues in cloud environments.

  • Cloud Infrastructure: Deep knowledge of AWS services such as Fargate, Route53, CloudWatch, API Gateway, Lambda, CodePipeline, CloudFormation, DynamoDB, and networking.

  • Application & Database: Breadth of experience across Elasticsearch, MySQL, PostgreSQL, Java, , Git/GitHub, and Confluent/Kafka.
  • Technical Skills: Strong troubleshooting, debugging, documentation, and communication abilities.

  • Ways of Working: Experience working in Agile product development environments and collaborating with global teams across cultures.

Are you driven by desire to be part of a globally renowned institution that celebrates innovation, embraces inclusion, and empowers learners? Then, we invite you to Pursue your Potential with us.

Applications received through the system will be reviewed on a rolling basis and may close the vacancy once sufficient applications are received. Therefore, if you are interested, tailor-fit your CV (advantageous if you submit one with a Cover Letter) and submit as early as possible

This advertiser has chosen not to accept applicants from your region.

Site Reliability Engineer

Pasay, Camarines Sur ₱1000000 - ₱1500000 Y Vestas

Posted today

Job Viewed

Tap Again To Close

Job Description

Are you ready to guide the development of innovative infrastructure solutions for a technology-focused entity in the renewable energy sector? We are seeking a Senior Systems Engineer committed to automation, monitoring, and asset management—someone who takes charge of what happens next and promotes continuous improvement in our digital landscape.
This is a technology leadership role (without resource management), where you will be responsible for designing, implementing, and optimizing critical Linux-based infrastructure solutions, working with large datasets, automation frameworks, and observability platforms.
Who We Are
We are Orchestration and Monitoring Fulfillment, a division within Core Infrastructure & Network, focused on delivering scalable, automated, and reliable IT solutions. Our team ensures operational excellence in system monitoring and asset management while continuously enhancing processes through automation.
As a Senior Systems Engineer, you will work closely with stakeholders and cross-functional teams, making strategic decisions and implementing solutions that will shape the future of our IT landscape.
Enterprise Cyber Security & Infrastructure Services > Infrastructure Services > Core Infrastructure & Network
Responsibilities
Your Role as a Senior Systems Engineer;

This is not just another engineering job; it is a chance to contribute to, design, and implement highly effective solutions while collaborating with global teams. You will be responsible for technical decision-making, infrastructure design, and automation strategies to ensure system efficiency and scalability.

Key Responsibilities

  • Monitoring & Observability

  • Design, build, and optimize monitoring solutions tailored for our environment

  • Deploy scalable, automated monitoring frameworks that enhance system visibility
  • Work with stakeholders to define monitoring needs and implement proactive alerting systems

  • Automation

  • Identify opportunities for automation to streamline operations and reduce manual efforts

  • Select effective automation tools and frameworks to support decision-making
  • Develop and maintain automation scripts and workflows to enhance system efficiency

  • Asset Management

  • Work with large datasets in our homegrown asset management environment

  • Ensure accurate tracking, reporting, and lifecycle management of IT assets
  • Optimize asset data integrity and develop solutions for better governance and control

Qualifications

  • 5-7 years of experience in a lead capacity within Linux system administration
  • Skill in database management, emphasizing the value of extensive knowledge
  • Comprehensive background in ETL applications and data integration methodologies
  • Deep understanding of automation frameworks like Foreman, Puppet, and Ansible

Competencies

  • A self-motivated, methodical, and progressive engineer, team members take initiative and produce effective solutions
  • Dedicated to automation, efficiency, and data-driven decision-making
  • Someone who thrives in problem-solving and technical decision-making
  • A collaborative individual who can work across teams and engage key stakeholders
  • Dedicated to ongoing enhancements and keeping up with industry advancements

What We Offer
We offer you an exciting role with great professional and personal development opportunities in an inspiring, progressive, international work environment at an established manufacturer of wind turbines. We offer attractive company perks like fitness subsidy, health insurance, pension, life insurance, medical allowance, travel allowance, internet allowance, etc. We have a modern, inspiring office overlooking Manila Bay, conveniently located near public transport. We believe in work-life balance and plan annual off-site outings, team building, and sports events. In Vestas, you will experience an innovative environment where learning and growth are consistent, enhancing your personal and professional development.

Additional Information
We do amend or withdraw our jobs and reserve the right to do so at any time, including before the advertised closing date. Please be advised to apply on or before the
31st of October 2025.
Additional Benefits

  • Wellness Subsidy
  • Retirement Benefit Plan

BEWARE – RECRUITMENT FRAUD
It has come to our attention that there are a number of fraudulent emails from people pretending to work for Vestas. Read more via this link,

DEIB Statement
At Vestas, we recognise the value of diversity, equity, and inclusion in driving innovation and success. We strongly encourage individuals from all backgrounds to apply, particularly those who may hesitate due to their identity or feel they do not meet every criterion. As our CEO states, "Expertise and talent come in many forms, and a diverse workforce enhances our ability to think differently and solve the complex challenges of our industry". Your unique perspective is what will help us powering the solution for a sustainable, green energy future.

About Vestas
Vestas is the energy industry's global partner on sustainable energy solutions. We are specialised in designing, manufacturing, installing, and servicing wind turbines, both onshore and offshore.

Across the globe, we have installed more wind power than anyone else. We consider ourselves pioneers within the industry, as we continuously aim to design new solutions and technologies to create a more sustainable future for all of us. With more than 185 GW of wind power installed worldwide and 40+ years of experience in wind energy, we have an unmatched track record demonstrating our expertise within the field.

With 30,000 employees globally, we are a diverse team united by a common goal: to power the solution – today, tomorrow, and far into the future.

Vestas promotes a diverse workforce which embraces all social identities and is free of any discrimination. We commit to create and sustain an environment that acknowledges and harvests different experiences, skills, and perspectives. We also aim to give everyone equal access to opportunity.

To learn more about our company and life at Vestas, we invite you to visit our website at and follow us on our social media channels. We also encourage you to join our Talent Universe to receive notifications on new and relevant postings.

This advertiser has chosen not to accept applicants from your region.

Site Reliability Engineer

Pasay, Camarines Sur ₱1200000 - ₱1500000 Y Vestas

Posted today

Job Viewed

Tap Again To Close

Job Description

Are you ready to guide the development of innovative infrastructure solutions for a technology-focused entity in the renewable energy sector? We are seeking a Senior Systems Engineer committed to automation, monitoring, and asset management—someone who takes charge of what happens next and promotes continuous improvement in our digital landscape.
This is a technology leadership role (without resource management), where you will be responsible for designing, implementing, and optimizing critical Linux-based infrastructure solutions, working with large datasets, automation frameworks, and observability platforms.
Who We Are
We are Orchestration and Monitoring Fulfillment, a division within Core Infrastructure & Network, focused on delivering scalable, automated, and reliable IT solutions. Our team ensures operational excellence in system monitoring and asset management while continuously enhancing processes through automation.
As a Senior Systems Engineer, you will work closely with stakeholders and cross-functional teams, making strategic decisions and implementing solutions that will shape the future of our IT landscape.
Enterprise Cyber Security & Infrastructure Services > Infrastructure Services > Core Infrastructure & Network
Responsibilities
Your Role as a Senior Systems Engineer;

This is not just another engineering job; it is a chance to contribute to, design, and implement highly effective solutions while collaborating with global teams. You will be responsible for technical decision-making, infrastructure design, and automation strategies to ensure system efficiency and scalability.

Key Responsibilities

  • Monitoring & Observability

  • Design, build, and optimize monitoring solutions tailored for our environment

  • Deploy scalable, automated monitoring frameworks that enhance system visibility
  • Work with stakeholders to define monitoring needs and implement proactive alerting systems

  • Automation

  • Identify opportunities for automation to streamline operations and reduce manual efforts

  • Select effective automation tools and frameworks to support decision-making
  • Develop and maintain automation scripts and workflows to enhance system efficiency

  • Asset Management

  • Work with large datasets in our homegrown asset management environment

  • Ensure accurate tracking, reporting, and lifecycle management of IT assets
  • Optimize asset data integrity and develop solutions for better governance and control

Qualifications

  • 5-7 years of experience in a lead capacity within Linux system administration
  • Skill in database management, emphasizing the value of extensive knowledge
  • Comprehensive background in ETL applications and data integration methodologies
  • Deep understanding of automation frameworks like Foreman, Puppet, and Ansible

Competencies

  • A self-motivated, methodical, and progressive engineer, team members take initiative and produce effective solutions
  • Dedicated to automation, efficiency, and data-driven decision-making
  • Someone who thrives in problem-solving and technical decision-making
  • A collaborative individual who can work across teams and engage key stakeholders
  • Dedicated to ongoing enhancements and keeping up with industry advancements

What We Offer
We offer you an exciting role with great professional and personal development opportunities in an inspiring, progressive, international work environment at an established manufacturer of wind turbines. We offer attractive company perks like fitness subsidy, health insurance, pension, life insurance, medical allowance, travel allowance, internet allowance, etc. We have a modern, inspiring office overlooking Manila Bay, conveniently located near public transport. We believe in work-life balance and plan annual off-site outings, team building, and sports events. In Vestas, you will experience an innovative environment where learning and growth are consistent, enhancing your personal and professional development.

Additional Information
We do amend or withdraw our jobs and reserve the right to do so at any time, including before the advertised closing date. Please be advised to apply on or before the
30th of September 2025.
Additional Benefits

  • Wellness Subsidy
  • Retirement Benefit Plan

BEWARE – RECRUITMENT FRAUD
It has come to our attention that there are a number of fraudulent emails from people pretending to work for Vestas. Read more via this link,

DEIB Statement
At Vestas, we recognise the value of diversity, equity, and inclusion in driving innovation and success. We strongly encourage individuals from all backgrounds to apply, particularly those who may hesitate due to their identity or feel they do not meet every criterion. As our CEO states, "Expertise and talent come in many forms, and a diverse workforce enhances our ability to think differently and solve the complex challenges of our industry". Your unique perspective is what will help us powering the solution for a sustainable, green energy future.

About Vestas
Vestas is the energy industry's global partner on sustainable energy solutions. We are specialised in designing, manufacturing, installing, and servicing wind turbines, both onshore and offshore.

Across the globe, we have installed more wind power than anyone else. We consider ourselves pioneers within the industry, as we continuously aim to design new solutions and technologies to create a more sustainable future for all of us. With more than 185 GW of wind power installed worldwide and 40+ years of experience in wind energy, we have an unmatched track record demonstrating our expertise within the field.

With 30,000 employees globally, we are a diverse team united by a common goal: to power the solution – today, tomorrow, and far into the future.

Vestas promotes a diverse workforce which embraces all social identities and is free of any discrimination. We commit to create and sustain an environment that acknowledges and harvests different experiences, skills, and perspectives. We also aim to give everyone equal access to opportunity.

To learn more about our company and life at Vestas, we invite you to visit our website at and follow us on our social media channels. We also encourage you to join our Talent Universe to receive notifications on new and relevant postings.

This advertiser has chosen not to accept applicants from your region.
Be The First To Know

About the latest Site reliability Jobs in Philippines !

Site Reliability Engineer

₱900000 - ₱1200000 Y Comrise

Posted today

Job Viewed

Tap Again To Close

Job Description

We are seeking a Site Reliability Engineer (Cloud) to join our growing technology team. In this role, you will be responsible for maintaining and enhancing the reliability, performance, and scalability of our cloud infrastructure. You'll apply software engineering principles to operations tasks, helping ensure the continuous availability and resilience of our cloud-based platforms on AWS and/or Azure.

This is a high-impact role for someone with a DevOps mindset, a strong grasp of cloud architecture, and a commitment to building self-healing, scalable, and observable systems.

Key Responsibilities

  • Design, implement, and support highly available and scalable cloud-based infrastructure.
  • Monitor system performance, uptime, and health metrics to proactively identify and resolve issues.
  • Develop and maintain automated tools and scripts for system deployment, configuration, and monitoring using Python, Bash, or PowerShell.
  • Use Infrastructure-as-Code (IaC) tools such as Terraform, CloudFormation, or ARM templates to manage cloud resources.
  • Set up, tune, and manage monitoring, logging, and alerting systems (e.g., CloudWatch, Datadog, ELK Stack, Prometheus, Grafana).
  • Work closely with engineering teams to ensure CI/CD pipelines are efficient, secure, and reliable.
  • Participate in on-call rotations and manage incident response, root cause analysis, and preventive measures.
  • Implement and enforce SLIs, SLOs, and SLAs to track and improve system reliability and performance.
  • Continuously improve system observability, fault tolerance, and disaster recovery strategies.

Minimum Qualifications

  • Bachelor's degree in Computer Science, Engineering, or related field (or equivalent experience).
  • 4+ years of experience in a cloud infrastructure, DevOps, or SRE role.
  • Proficiency with AWS and/or Azure cloud environments.
  • Strong scripting skills in Python, Bash, or PowerShell.
  • Hands-on experience with IaC tools (Terraform, CloudFormation, or similar).
  • Solid understanding of monitoring, metrics, and alerting systems.
  • Proven experience in incident management and system troubleshooting.
  • Strong focus on automation, performance tuning, and reliability engineering.

Job Type: Full-time

Benefits:

  • Company Christmas gift
  • Company events
  • Health insurance
  • Paid training

Application Question(s):

  • How many years of experience do you have in managing cloud infrastructure on AWS and/or Azure?
  • How many years of experience do you have in using Infrastructure-as-Code (IaC) tools like Terraform, CloudFormation, or ARM templates?
  • What is your asking salary in PHP?
  • Viber number:

Work Location: In person

This advertiser has chosen not to accept applicants from your region.

Site Reliability Engineer

Pasig City, National Capital Region ₱600000 - ₱1200000 Y Asia Select, Inc. (ASI)

Posted today

Job Viewed

Tap Again To Close

Job Description

Responsibilities:

  • Ensure CLIENTS's multiple systems are operating at peak efficiency, performance and uptime.
  • Assist in providing root cause analysis of complex faults in a large distributed system, and work with multiple teams to see the issue through to resolution and improvements.
  • Participate in ongoing technology refresh initiatives or special projects as required.
  • Use best of breed tooling to support you in ensuring operational stability and to minimize customer disruption.
  • Assist in creating metric collection and visualization tools to allow you to assist in capacity-planning and trouble-shooting, and take pre-emptive actions in support of overall system stability.
  • Contribute to monthly reporting on platform cost, capacity, incidents and performance.
  • Work with team to carry out deployments of new releases of CLIENTS's SaaS applications to production and other environments with minimal to no impact on customers, and refi ne and enhance the tools to achieve this.
  • Identify and automate tasks wherever possible to maintain or increase our high server to engineer ratio moving forward.
  • Participate in on-call roster to ensure uptime is exceeded and platform owned services are operating effectively.
  • Conduct performance and reliability tests to establish limits, bottlenecks or single points of failure and resolve them.
  • From time to time be called on to work flexible hours to complete tasks that would otherwise disrupt a great customer experience.

Qualification:

  • Experience with the key aspects of a modern SaaS platform provider and comfort in being an active participant in platform design:
  • o Kubernetes (AKS or EKS advantageous)
  • o Public cloud from Azure or AWS
  • o Terraform or similar IAC technology
  • o CI/CD via GitHub Actions, Azure DevOps, Concourse or similar
  • Proven, self-motivated ability to continually renew and expand your knowledge, and a keen desire to keep abreast of new tools and technologies.
  • Keen interest in, or experience with, operating and managing complex systems in customer-facing production web environments. Operation and architecture of multi-tier distributed systems, ideally involving real-time event processing.
  • Practical knowledge of a scripting language (ruby, bash, python, etc.)
  • Exposure to monitoring, alerting and visualisation tools (Grafana, SumoLogic, Datadog etc.)
  • The ability to help identify the right tool for the job and to identify opportunities to make yours's and others lives easier through automation.
  • Understanding of automated provisioning. Knowledge of tooling such as Chef, Ansible or Puppet advantageous.
  • Experience building or using containerisation and PaaS products advantageous.
  • Understanding of relational database systems and their operation.
  • Experience with caching, in-memory databases and NOSQL, a bonus.
  • Passion for the web operations industry. We're doing exciting things and want to work with people who share our passion and vision.

Hybrid, 3X WF0 (Tues to Thurs), WFH (Monday and Friday)

Shift : starts at 6am

Working schedule : Mondays to Fridays, following Philippine holidays

This advertiser has chosen not to accept applicants from your region.

Site Reliability Engineer

Pasay, Camarines Sur ₱900000 - ₱1200000 Y VESTAS SHARED SERVICE A/S

Posted today

Job Viewed

Tap Again To Close

Job Description

Are you ready to guide the development of innovative infrastructure solutions for a technology-focused entity in the renewable energy sector? We are seeking a Senior Systems Engineer committed to automation, monitoring, and asset management—someone who takes charge of what happens next and promotes continuous improvement in our digital landscape.

This is a technology leadership role (without resource management), where you will be responsible for designing, implementing, and optimizing critical Linux-based infrastructure solutions, working with large datasets, automation frameworks, and observability platforms.

Who We Are

We are Orchestration and Monitoring Fulfillment, a division within Core Infrastructure & Network, focused on delivering scalable, automated, and reliable IT solutions. Our team ensures operational excellence in system monitoring and asset management while continuously enhancing processes through automation.

As a Site Reliability Engineer, you will work closely with stakeholders and cross-functional teams, making strategic decisions and implementing solutions that will shape the future of our IT landscape.

Responsibilities

Your Role as a Site Reliability Engineer;

This is not just another engineering job; it is a chance to contribute to, design, and implement highly effective solutions while collaborating with global teams. You will be responsible for technical decision-making, infrastructure design, and automation strategies to ensure system efficiency and scalability.

Key Responsibilities

  • Monitoring & Observability

  • Design, build, and optimize monitoring solutions tailored for our environment

  • Deploy scalable, automated monitoring frameworks that enhance system visibility
  • Work with stakeholders to define monitoring needs and implement proactive alerting systems
  • Automation

  • Identify opportunities for automation to streamline operations and reduce manual efforts

  • Select effective automation tools and frameworks to support decision-making
  • Develop and maintain automation scripts and workflows to enhance system efficiency
  • Asset Management

  • Work with large datasets in our homegrown asset management environment

  • Ensure accurate tracking, reporting, and lifecycle management of IT assets
  • Optimize asset data integrity and develop solutions for better governance and control

Qualifications

  • 5-7 years of experience in a lead capacity within Linux system administration
  • Skill in database management, emphasizing the value of extensive knowledge
  • Comprehensive background in ETL applications and data integration methodologies
  • Deep understanding of automation frameworks like Foreman, Puppet, and Ansible

Competencies

  • A self-motivated, methodical, and progressive engineer, team members take initiative and produce effective solutions
  • Dedicated to automation, efficiency, and data-driven decision-making
  • Someone who thrives in problem-solving and technical decision-making
  • A collaborative individual who can work across teams and engage key stakeholders
  • Dedicated to ongoing enhancements and keeping up with industry advancements
This advertiser has chosen not to accept applicants from your region.
 

Nearby Locations

Other Jobs Near Me

Industry

  1. request_quote Accounting
  2. work Administrative
  3. eco Agriculture Forestry
  4. smart_toy AI & Emerging Technologies
  5. school Apprenticeships & Trainee
  6. apartment Architecture
  7. palette Arts & Entertainment
  8. directions_car Automotive
  9. flight_takeoff Aviation
  10. account_balance Banking & Finance
  11. local_florist Beauty & Wellness
  12. restaurant Catering
  13. volunteer_activism Charity & Voluntary
  14. science Chemical Engineering
  15. child_friendly Childcare
  16. foundation Civil Engineering
  17. clean_hands Cleaning & Sanitation
  18. diversity_3 Community & Social Care
  19. construction Construction
  20. brush Creative & Digital
  21. currency_bitcoin Crypto & Blockchain
  22. support_agent Customer Service & Helpdesk
  23. medical_services Dental
  24. medical_services Driving & Transport
  25. medical_services E Commerce & Social Media
  26. school Education & Teaching
  27. electrical_services Electrical Engineering
  28. bolt Energy
  29. local_mall Fmcg
  30. gavel Government & Non Profit
  31. emoji_events Graduate
  32. health_and_safety Healthcare
  33. beach_access Hospitality & Tourism
  34. groups Human Resources
  35. precision_manufacturing Industrial Engineering
  36. security Information Security
  37. handyman Installation & Maintenance
  38. policy Insurance
  39. code IT & Software
  40. gavel Legal
  41. sports_soccer Leisure & Sports
  42. inventory_2 Logistics & Warehousing
  43. supervisor_account Management
  44. supervisor_account Management Consultancy
  45. supervisor_account Manufacturing & Production
  46. campaign Marketing
  47. build Mechanical Engineering
  48. perm_media Media & PR
  49. local_hospital Medical
  50. local_hospital Military & Public Safety
  51. local_hospital Mining
  52. medical_services Nursing
  53. local_gas_station Oil & Gas
  54. biotech Pharmaceutical
  55. checklist_rtl Project Management
  56. shopping_bag Purchasing
  57. home_work Real Estate
  58. person_search Recruitment Consultancy
  59. store Retail
  60. point_of_sale Sales
  61. science Scientific Research & Development
  62. wifi Telecoms
  63. psychology Therapy
  64. pets Veterinary
View All Site Reliability Jobs