26 Devops Engineers jobs in the Philippines

Senior Site Reliability Engineer

Mandaluyong City, National Capital Region Penbrothers

Posted 25 days ago

Job Viewed

Tap Again To Close

Job Description

About Penbrothers

Penbrothers is an HR & remote talent management partner and one of the fastest-growing companies in the Philippines. We provide talented Filipinos with global opportunities in high-growth startups and dynamic companies, from the comfort of their own homes.

About the Client

The client is a pioneer in medical recruitment, is seeking an experienced Tech Lead to drive their mission to enhance doctors' well-being. This is an opportunity to contribute your unique skills and expertise to create technology that truly matters, impacting lives on a daily basis

About the Role

We are looking for a Senior SRE/DevOps Specialist to play a vital role in ensuring the reliability of our Salesforce and web/mobile application environments. You will work closely with our engineers to continually improve and enhance our platform leaning towards world class best practices. 

Service reliability and observability

  • Analysing resource utilization and forecasting capacity needs to ensure the system can handle expected traffic and workloads without performance issues.

  • Writing code and scripts to automate repetitive operational tasks, configuration management, and deployment processes to reduce human error and increase efficiency.

  • Managing changes to production systems and services, ensuring that new releases and configuration changes are rolled out with minimal disruption and risk.

  • Identifying and addressing performance bottlenecks, optimizing software and infrastructure to improve response times and reduce resource consumption.

  • Maintaining thorough documentation of systems, configurations, and incident response procedures to facilitate knowledge sharing and onboarding of new team members.

  • Defining and maintaining service level objectives that specify the acceptable level of service quality, such as uptime and latency, for a particular system or service.

  • Defining the key performance metrics and indicators that will be used to measure the system's performance and reliability, such as error rates and response times.

  • Designing and implementing monitoring systems to track the SLIs and using alerting mechanisms to notify the team when the system deviates from its defined SLOs.

Incident management & Disaster recovery planning

  • Responding to and mitigating incidents that impact service availability or performance,

  • following an incident management process, and conducting post-incident reviews to learn and improve.

  • Planning and implementing and executing disaster recovery and backup strategies to ensure data and service availability in case of failures or disasters.

Security

  • Ensure systems and infrastructure are securely configured and hardened by default

  • Manage secrets, credentials, and access controls across environments

  • Monitor for security-related events and support incident response efforts

  • Maintain secure CI/CD pipelines and enforce safe deployment practices

  • Planning and implementing disaster recovery and backup strategies to ensure data and service availability in case of failures or disasters.

Continuous Improvement

  • Continuously evaluating and improving system reliability, efficiency, cost optimization and automation to meet our evolving business needs and customer expectations.

  • Rationalizing, evaluating and integrating 3rd party developer tooling and services.

  • Troubleshooting platform issues with development teams

  • Providing tooling support and access management for development teams

  • Stay ahead of the tech curve, bringing new tools and frameworks to the table

This advertiser has chosen not to accept applicants from your region.

Sr Site Reliability Engineer (Project based)

1226 Makati City, National Capital Region iScale Solutions

Posted 2 days ago

Job Viewed

Tap Again To Close

Job Description

This is a remote position.

Core Expertise SRE Foundations & Practices Deep understanding of SRE principles  (SLIs, SLOs, error budgets, toil reduction, reliability vs. velocity trade-offs). Proven experience driving SRE adoption and culture change  across teams and applications. Strong knowledge of incident management on-call practices , and blameless postmortems . Cloud & Infrastructure 5+ years of experience with Google Cloud Platform (GCP)  services Solid expertise with Kubernetes  , including scaling, workload optimization, network policies, service mesh, and troubleshooting. Experience with infrastructure as code Reliability & Observability Strong knowledge of monitoring, logging, and tracing Proven ability to design and implement alerting strategies  aligned with SLOs/SLIs. Hands-on experience optimizing application performance, resiliency, and cost efficiency  in cloud-native environments. Automation & Tooling Proficiency in at least one modern programming language (preferably Python) for automation, reliability tooling, and operational improvements. Familiarity with CI/CD pipelines  and release engineering best practices. Expertise in automating reliability tasks , reducing toil, and scaling best practices across multiple applications. Leadership & Collaboration Ability to evangelize SRE best practices  and influence engineering/product teams in adopting them. Experience mentoring engineers  and establishing communities of practice around reliability. Strong stakeholder management skills to balance product delivery goals with reliability requirements. Excellent communication skills. Requirements Preferred Qualifications Hands-on experience migrating applications to SRE operating models  in multi-team/multi-application settings. Certification(s): Google Cloud Professional DevOps Engineer, Kubernetes CKA/CKS, or equivalent. Benefits ● Full Time Employment with competitive salary and benefits ● Medical, dental, and vision insurance coverage
This advertiser has chosen not to accept applicants from your region.

Site Reliability Engineer (Supply Chain IT Operations)

Procter & Gamble

Posted 4 days ago

Job Viewed

Tap Again To Close

Job Description

Job Location
Taguig City
Job Description
Information Technology (IT) at Procter & Gamble is where business, innovation and technology integrate to build a competitive advantage for P&G. Our mission is clear -- you deliver IT to help P&G win with consumers.
Do you love implementing continuous improvement in IT solutions to drive efficiency and agility in meeting constantly evolving business needs? Then this job might be for you!
As a Site Reliability Engineer, you will be instrumental in ensuring the high availability and reliability of our digital IT products in the P&G supply chain. Your primary focus will be on enhancing system performance through faster detection, response, and resolution of issues, while also implementing strategies to prevent recurrence and reduce operational toil. You will use robust Observability and Monitoring tools, automate incident response systems, and optimize IT architecture to create a resilient and reliable infrastructure.
Responsibilities:
+ Implement and lead comprehensive monitoring solutions and tools to provide real-time insights into system performance, enabling proactive incident detection and ensuring accurate, actionable alerts for prompt responses.
+ Continuously refine monitoring strategies and develop automation scripts to address recurring issues, enhancing system visibility, resource optimization, and overall efficiency.
+ Establish and maintain Service Level Indicators (SLIs) and Service Level Objectives (SLOs) to improve service quality and reliability,
+ Collect and share data and insights from observability tools to drive continuous improvement initiatives.
+ Work closely with Software Engineers, Product Teams, and Infrastructure Teams to develop and implement initiatives that enhance IT reliability.
+ Engage with customers to understand their needs and difficulties regarding Observability and Monitoring tools, providing exceptional support in all interactions, including communications, updates, and feedback.
+ Stay updated on industry trends and effective strategies in Site Reliability Engineering while continuously enhancing technical skills in system architecture, automation, cloud technologies, and operational processes.
+ Share knowledge and mentor team members to foster a culture of learning and professional development within the team
+ Lead root cause analysis efforts and implement corrective action plans in a timely manner to achieve permanent resolutions for incidents.
+ Oversee documentation and knowledge management efforts.
Job Qualifications
Candidates must demonstrate strong leadership in the application of technical expertise to drive business results.
We are looking for candidates who possess the following core qualities:
+ A Bachelor's degree in related field such as Engineering, Information Technology and Computer Science discipline.
+ Up to 5 years of relevant experience .
+ Experience or familiarity with monitoring and observability tools (e.g., Prometheus, preferably Grafana)
+ Knowledge and familiarity in system administration, including Linux/Unix environments, cloud platforms (Azure is preferred, but AWS or GCP are acceptable)
+ Experience with configuration management tools and infrastructure-as-code frameworks (e.g., Terraform)
+ Proficiency in at least one programming language (e.g., Python, C#) and a background in scripting for automation tasks
+ Understanding of networking protocols, network infrastructures, load balancing, and DNS management
+ Familiarity with containerization and Orchestration Technologies (e.g., Docker, Kubernetes)
+ Familiarity with databases and proficiency in writing SQL queries
+ Understanding of best practices in security and experience with implementing secure systems
+ Knowledge of incident response methodologies, root cause analysis, and implementing preventive measures (ITIL and/or SRE)
+ Familiarity with ticketing systems and task management (preferably ServiceNow)
+ Problem-solving skills with ability to analyze complex issues and devise effective solutions
+ Learning agility as there will be new topics to learn and new spaces to understand
+ Communication and collaboration skills to work effectively with multi-functional teams, partners, and customers
+ Teamwork and interpersonal skills, with an ability to build relationships and work effectively in a collaborative environment
+ Operational excellence / execution skills as the work requires discipline
Preferred Skills:
+ Understanding or experience in Supply Chain applications and processes, documents or general data flow to understand impact of unplanned IT downtimes and impact of IT changes to business operations
About us
We produce globally recognized brands, and we grow the best business leaders in the industry. With a portfolio of trusted brands as diverse as ours, it is paramount our leaders are able to lead with courage the vast array of brands, categories and functions. We serve consumers around the world with one of the strongest portfolios of trusted, quality, leadership brands, including Always®, Ariel®, Gillette®, Head & Shoulders®, Herbal Essences®, Oral-B®, Pampers®, Pantene®, Tampax® and more. Our community includes operations in approximately 70 countries worldwide.
Visit to know more.
We are an equal opportunity employer and value diversity at our company. We do not discriminate against individuals on the basis of race, color, gender, age, national origin, religion, sexual orientation, gender identity or expression, marital status, citizenship, disability, HIV/AIDS status, or any other legally protected factor.
Job Schedule
Full time
Job Number
R
Job Segmentation
Experienced Professionals (Job Segmentation)
This advertiser has chosen not to accept applicants from your region.

AWS Cloud Engineer

IBM

Posted 10 days ago

Job Viewed

Tap Again To Close

Job Description

**Introduction**
A career in IBM Consulting is rooted by long-term relationships and close collaboration with clients across the globe.
You'll work with visionaries across multiple industries to improve the hybrid cloud and AI journey for the most innovative and valuable companies in the world. Your ability to accelerate impact and make meaningful change for your clients is enabled by our strategic partner ecosystem and our robust technology platforms across the IBM portfolio; including Software and Red Hat.
Curiosity and a constant quest for knowledge serve as the foundation to success in IBM Consulting. In your role, you'll be encouraged to challenge the norm, investigate ideas outside of your role, and come up with creative solutions resulting in ground breaking impact for a wide network of clients. Our culture of evolution and empathy centers on long-term career growth and development opportunities in an environment that embraces your unique skills and experience.
**Your role and responsibilities**
Infrastructure Management:
* Deploy, configure, and manage cloud resources using infrastructure-as-code (IaC) tools such as Terraform or CloudFormation
* Monitor and maintain cloud infrastructure to ensure optimal performance, availability, and security.
* Implement automation and orchestration to streamline deployment and scaling processes.
Cloud Services Administration:
* Manage and administer various cloud services, including compute instances, storage solutions, databases, networking components, and serverless offerings hosted on AWS and Azure.
* Optimize resource utilizations to ensure efficient cloud service delivery and cost management.
Monitoring and Incident Response:
* Set up monitoring and alerting mechanisms to promptly identify and address performance issues and security vulnerabilities.
* Collaborate with the incident response team to troubleshoot and resolve operational incidents, ensuring minimal service disruption.
Security and Compliance:
* Implement security best practices to protect cloud environments from potential threats and vulnerabilities.
* Assist in conducting security audits and assessments to maintain compliance with industry regulations and standards.
Collaboration and Documentation:
* Collaborate with cross-functional teams, including DevOps, development, and security, to understand requirements and align cloud infrastructure accordingly.
* Maintain comprehensive documentation related to cloud infrastructure, processes, and procedures.
Continuous Improvement:
* Stay up to date with industry trends and advancements in cloud technologies.
* Identify opportunities for automation, optimization, and process improvement to enhance cloud operations.
**Required technical and professional expertise**
* Bachelor's degree in computer science, Information Technology, or a related field (or equivalent experience).
* Has working experience in cloud operations, managing cloud-based infrastructure, and deploying cloud services.
* Proficiency in cloud platforms such as Amazon Web Services (AWS) and/or Microsoft Azure.
* Strong understanding of infrastructure-as-code principles and tools.
* Familiarity with scripting and automation using languages like Python, PowerShell, Bash, etc.
* Experience with monitoring and log aggregation tools for proactive issue detection.
* Knowledge of security best practices and experience implementing security controls in cloud environments.
* Excellent problem-solving skills and the ability to troubleshoot complex technical issues.
* Certification in AWS Cloud Practitioner, AWS Architect is a plus
IBM is committed to creating a diverse environment and is proud to be an equal-opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, gender, gender identity or expression, sexual orientation, national origin, caste, genetics, pregnancy, disability, neurodivergence, age, veteran status, or other characteristics. IBM is also committed to compliance with all fair employment practices regarding citizenship and immigration status.
This advertiser has chosen not to accept applicants from your region.

Senior Cloud Reliability Engineer - Makati City

Makati City, National Capital Region Avaloq

Posted 6 days ago

Job Viewed

Tap Again To Close

Job Description

Senior Cloud Reliability Engineer - Makati City  Company Description

Founded and headquartered in Switzerland, Avaloq is continuously expanding its global footprint with around 2,500 colleagues in 12 countries, and more than 170 clients in 35 countries. We are an industry-leading provider of wealth management technology and services for financial institutions around the world, including private banks and wealth managers, investment managers, as well as retail and neo banks. Our research led approach and continual innovation is powered by the passion and creativity of our colleagues.
We are always looking for talented people to join us on our mission to orchestrate the financial ecosystem and democratize access to wealth management. Avaloq offers the opportunity to work closely with some of the world’s leading financial institutions as we jointly develop and shape careers. Championing a collaborative, supportive and flexible work environment empowers our colleagues to reach their full potential.

 Job Description

Your key tasks

  • Build & run our cloud SaaS environments by monitoring availability, optimizing system performance and taking a holistic view of system health
  • Provide primary operational support and engineering for multiple large distributed software applications
  • Build software and systems to manage and automate platform infrastructure and applications
  • Balance feature development speed and reliability with well-defined service level objectives
  • Improve reliability, quality, and time-to-market of our suite of software as a service solutions
  • Partner with development teams to improve services through rigorous testing and release procedures
  • Participate in system design consulting, platform management, and capacity planning to create sustainable systems and services through automation and uplifts
 Qualifications
  • University degree in Computer Science or related discipline
  • Solid experience in implementing public cloud concepts and models
  • Programming experience with at least one modern language such as Python, Ruby, Go or Java including object-oriented design
  • Solid experience with Infrastructure as Code, e.g. Terraform and using it with a cloud provider such as AWS, AZURE or OCI.
  • Gitops experience, using git as daily activity (Github or Gitlab or Bitbucket)
  • A proactive approach to spotting problems, areas for improvement, and performance bottlenecks
  • Drive to standardize, streamline, and automate processes
  • Experience with Observability (e.g. Cloudwatch, Prometheus / Grafana, Kibana, Elastic Search)

It would be a real bonus if you have 

  • Expert of cloud SaaS security best practices
  • Configuration management tooling (e.g. Puppet, Ansible)
  • Github actions, experience using AI assitance, AI prompt engineering with terraform.
 Additional Information

We realize that managing work life balance is a challenge we all face in our daily lives and in order to support with this we are pleased to offer hybrid and flexible working for most of our Avaloqers to maintain work life balance and still continue our fantastic Avaloq culture in our global offices. 

In Avaloq we are proud to embrace diversity and understand the success of our business is built on the power of different opinions, we are whole heartedly committed to fostering an equal opportunity environment and inclusive culture where you can be your true authentic self. 

We hire, compensate and promote regardless of origin, age, gender identity, sexual orientation or any other fantastic traits that make us all unique, we have done our best to write this advert in an inclusive and neutral way. 

Please be aware that we will not accept speculative CV submissions for any of our roles from recruitment agencies, and any unsolicited candidate submissions will be exempt from any payment expectations.  

#LI-Hybrid

This advertiser has chosen not to accept applicants from your region.

Cloud Tools Engineer

Manila, Metropolitan Manila RELX INC

Posted 3 days ago

Job Viewed

Tap Again To Close

Job Description

The role will be responsible in designing and building Azure DevOps pipelines and automated DevOps processes to empower our product development teams to rapidly deliver applications to our customers. The role is also expected to drive execution and delivery of projects through the use of Agile methodologies.
Accountabilities:
+ Tooling Management: Manage and maintain the complete DevOps toolchain hosted on AWS, including Jenkins, Artifactory, SonarQube, Prisma Cloud, and Azure DevOps.
+ Infrastructure as Code (IaC): Develop, maintain, and enhance infrastructure automation using tools like Terraform and CloudFormation.
+ Security and Compliance: Implement security best practices, maintain compliance with industry standards, and perform regular security assessments of the tooling infrastructure.
+ Performance Optimization: Continuously monitor and optimize the performance of the tooling to ensure efficient software delivery.
+ Integration and Scaling: Integrate tools with other systems, such as source control, containerization, and monitoring tools. Ensure the infrastructure can scale to meet growing demand.
+ Documentation: Maintain comprehensive documentation for the tooling infrastructure, configuration, and procedures.
+ Troubleshooting and Support: Address and resolve issues related to the tools in a timely manner, providing support to development and operations teams.
+ Team Collaboration: Collaborate with DevOps, development, and IT teams to improve processes and practices
+ Facilitate Agile ceremonies (daily stand-ups, sprint planning, retrospectives, etc.) for multiple teams.
+ Partner with DevOps, Observability, and Cloud Infrastructure teams to plan and track deliverables, ensuring alignment with business priorities.
+ Remove blockers, drive continuous improvement, and promote a culture of Agile best practices.
+ Maintain and enhance project roadmaps, backlogs, and sprint boards using tools like Jira, Confluence, or similar platforms.
+ Collaborate with engineering, security, and business teams to define project scope, milestones, and success criteria.
+ Drive cross-functional communication and ensure transparency on project status, risks, and dependencies.
+ Optimize team velocity by proactively identifying inefficiencies and implementing best practices.
+ Champion DevOps and SRE principles, helping teams enhance automation, monitoring, and cloud infrastructure reliability.
+ Track key performance
Qualifications:
+ Bachelor's degree in Computer Science, Information Technology, or a related field.
+ Proven experience in managing DevOps tooling, particularly Jenkins, Artifactory, SonarQube, Prisma Cloud, and Azure AD.
+ AWS certification or equivalent experience in cloud infrastructure management.
+ Proficiency in Infrastructure as Code (IaC) using tools like Terraform or CloudFormation.
+ Strong knowledge of security best practices and experience in maintaining secure tooling environments.
+ Solid experience with scripting languages like Bash, Python, or PowerShell.
+ Familiarity with containerization technologies (e.g., Docker, Kubernetes).
+ Excellent problem-solving and troubleshooting skills.
+ Strong communication and collaboration skills for working with cross-functional teams.
+ DevOps and Agile methodologies understanding.
+ Familiarity with source control systems (e.g., Git, Bitbucket) and build tools.
+ Strong understanding of Agile methodologies (Scrum, Kanban, SAFe, etc.) and hands-on experience in scaling Agile practices.
+ Proficiency in tools like Jira, Confluence, Azure DevOps, or similar project management tools.
+ Strong leadership and facilitation skills with the ability to influence without authority.
+ Excellent communication skills to interact with technical and non-technical stakeholders.
+ Ability to manage multiple projects in a fast-paced, evolving environment.
Nice-to-Haves:
+ Relevant certifications (e.g., AWS Certified DevOps Engineer, Jenkins Certified Engineer).
+ Experience with other DevOps tools like GitLab CI/CD, CircleCI, or Travis CI.
+ Knowledge of continuous testing and continuous deployment best practices.
+ Certification in CSM, PMI-ACP, SAFe, or equivalent Agile frameworks is a plus.
We are committed to providing a fair and accessible hiring process. If you have a disability or other need that requires accommodation or adjustment, please let us know by completing our Applicant Request Support Form or please contact .
Criminals may pose as recruiters asking for money or personal information. We never request money or banking details from job applicants. Learn more about spotting and avoiding scams here .
Please read our Candidate Privacy Policy .
We are an equal opportunity employer: qualified applicants are considered for and treated during employment without regard to race, color, creed, religion, sex, national origin, citizenship status, disability status, protected veteran status, age, marital status, sexual orientation, gender identity, genetic information, or any other characteristic protected by law.
USA Job Seekers:
EEO Know Your Rights .
RELX is a global provider of information-based analytics and decision tools for professional and business customers, enabling them to make better decisions, get better results and be more productive.
Our purpose is to benefit society by developing products that help researchers advance scientific knowledge; doctors and nurses improve the lives of patients; lawyers promote the rule of law and achieve justice and fair results for their clients; businesses and governments prevent fraud; consumers access financial services and get fair prices on insurance; and customers learn about markets and complete transactions.
Our purpose guides our actions beyond the products that we develop. It defines us as a company. Every day across RELX our employees are inspired to undertake initiatives that make unique contributions to society and the communities in which we operate.
This advertiser has chosen not to accept applicants from your region.

Cloud Security Engineer

IBM

Posted 11 days ago

Job Viewed

Tap Again To Close

Job Description

**Introduction**
We are seeking a highly skilled and experienced Cloud Security Engineers to support the implementation, tuning, and maintenance of cloud security platforms (CSPM, CWPP, SSPM). This role focuses on engineering tasks, posture policy management, reporting, and platform operations.
**Your role and responsibilities**
Key Responsibilities
As a Cloud Security Engineer, you will play a crucial role in assisting in implementing, tuning, and optimizing Prisma Cloud CSPM policies across AWS, Azure, and GCP. Perform baseline and periodic posture assessments to identify configuration drift and highlight risky assets. Collect, parse, and analyze Prisma Cloud audit logs from cloud workloads to detect misconfigurations and threats. Support onboarding and configuration of the SSPM platform for key SaaS applications used by the client. Assist in defining and tuning posture policies and compliance baselines across CSPM, CWPP, and SSPM.
You'll also maintain and update SOPs, RQL templates, and operational documentation. Support monthly knowledge transfer sessions and cloud security framework awareness activities. Administer and monitor CSPM (Cloud Security Posture Management), CWP (Cloud Workload Protection), and SSPM (SaaS Security Posture Management) platforms. Perform cloud misconfiguration analysis, vulnerability detection, and incident triage from cloud-native and third-party security tools.
You will be responsible in automating policy enforcement and remediation scripts in coordination with DevOps/CloudOps. Integrate tools into SIEM, SOAR, ITSM, and CI/CD pipelines as required. Participate in onboarding cloud accounts/projects into security tooling and ensure correct tagging, coverage, and visibility. Provide operational metrics, dashboards, and reporting to stakeholders. Collaborate with cloud architects and app teams to provide security reviews and technical remediation guidance. Assist in the implementation of SOPs for platform and incident management of CSPM/CWP/SSPM. Ensure CSPM alert integration into the client's SIEM and ITSM systems, mapped to SOC workflows.
**Required technical and professional expertise**
Technical Requirements:
* 5+ years in cloud security consulting, architecture, or posture management.
* Proven and extensive experience with Prisma Cloud (CSPM/CWPP) and SSPM platforms.
* Hands-on experience integrating alerts into SIEM/SOAR tools like Google SecOps.
* Familiar with cloud-native and hybrid environment architecture in AWS, Azure, or GCP
* Familiar with compliance frameworks: NIST CSF, CIS, GDPR, PCI DSS.
* Experience integrating alerts and posture signals into SIEM/ITSM (e.g., Chronicle, Splunk, ServiceNow)
Soft Skills:
* Strong analytical and problem-solving abilities with keen attention to detail.
* Excellent communication and collaboration skills, with the ability to interact effectively with stakeholders at all levels.
* Capable of managing multiple priorities in a fast-paced, dynamic environment.
**Preferred technical and professional experience**
Certifications: CCSP, GCSA, CISSP, CRISC, CISA, AWS/Azure/GCP Security Specialty ( or any cloud platform-specific certs), Prisma Cloud Certification (e.g., Palo Alto Networks Certified Cloud Security Engineer-PCCSE), Google Cybersecurity Professional Certificate or SIEM-specific trainings (e.g., Chronicle)
IBM is committed to creating a diverse environment and is proud to be an equal-opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, gender, gender identity or expression, sexual orientation, national origin, caste, genetics, pregnancy, disability, neurodivergence, age, veteran status, or other characteristics. IBM is also committed to compliance with all fair employment practices regarding citizenship and immigration status.
This advertiser has chosen not to accept applicants from your region.
Be The First To Know

About the latest Devops engineers Jobs in Philippines !

Cloud Observability Engineer

Manila, Metropolitan Manila RELX INC

Posted 13 days ago

Job Viewed

Tap Again To Close

Job Description

The Observability Engineer will play a critical role in designing, implementing, and managing our observability solutions. This role involves collaborating with development, DevOps, and operations teams to ensure high availability, performance, and reliability of our applications and services.
Accountabilities:
+ Observability and Monitoring:
+ Develop and maintain observability solutions using tools like Datadog, Splunk, New Relic, AWS CloudWatch, and Azure Application Insights.
+ Instrument applications and infrastructure to provide comprehensive monitoring and alerting.
+ Create and manage dashboards, alerts, and reports to support operational needs.
+ Infrastructure as Code (IaC):
+ Implement infrastructure as code (IaC) using CloudFormation and Terraform.
+ Ensure that observability tools and configurations are managed through IaC for consistency and reliability.
+ Collaboration and Support:
+ Work closely with development, DevOps, and operations teams to ensure seamless integration of observability solutions.
+ Participate in root cause analysis and incident management to quickly resolve issues and improve system reliability.
+ Provide guidance and mentorship to junior engineers.
+ Documentation and Process Improvement:
+ Document observability configurations, procedures, and best practices.
+ Continuously evaluate and improve existing processes and tools to enhance efficiency and effectiveness.
Qualifications:
+ Bachelor's degree in Engineering, Computer Science.
+ Extensive experience in application observability and performance management.
+ Proficiency with observability tools such as Datadog, Splunk, New Relic, AWS CloudWatch, and Azure Application Insights.
+ Strong expertise in AWS and Azure cloud environments.
+ Hands-on experience with infrastructure as code (IaC) using CloudFormation and Terraform.
+ Proven track record in working with operational teams and managing processes.
+ Excellent troubleshooting skills and the ability to implement effective solutions.
+ Strong communication skills and the ability to convey information clearly to diverse audiences.
+ Exceptional organizational skills and the ability to manage multiple priorities
LexisNexis, a division of RELX, is an equal opportunity employer: qualified applicants are considered for and treated during employment without regard to race, color, creed, religion, sex, national origin, citizenship status, disability status, protected veteran status, age, marital status, sexual orientation, gender identity, genetic information, or any other characteristic protected by law. We are committed to providing a fair and accessible hiring process. If you have a disability or other need that requires accommodation or adjustment, please let us know by completing our Applicant Request Support Form: , or please contact .
Please read our Candidate Privacy Policy ( .
RELX is a global provider of information-based analytics and decision tools for professional and business customers, enabling them to make better decisions, get better results and be more productive.
Our purpose is to benefit society by developing products that help researchers advance scientific knowledge; doctors and nurses improve the lives of patients; lawyers promote the rule of law and achieve justice and fair results for their clients; businesses and governments prevent fraud; consumers access financial services and get fair prices on insurance; and customers learn about markets and complete transactions.
Our purpose guides our actions beyond the products that we develop. It defines us as a company. Every day across RELX our employees are inspired to undertake initiatives that make unique contributions to society and the communities in which we operate.
This advertiser has chosen not to accept applicants from your region.

Cloud Systems Engineer

Iloilo, Iloilo RELX INC

Posted 15 days ago

Job Viewed

Tap Again To Close

Job Description

The Cloud Systems Engineer is accountable for the delivery of LexisNexis Legal & Professional services by supporting application portfolio and services within the New Lexis Platform. This includes Lexis+ AI US, Lexis US, Lexis+ US, Lexis+ Canada, Lexis+ UK, Lexis+ South Africa, Lexis+ APAC, gNS, Decisis and other cloud-based products. The Cloud Systems Engineer is primarily responsible in ensuring service availability, quality, performance and cost effectiveness.
Accountabilities:
+ Responsible in working closely with All other teams across the Platform to lead in incident management in Production including Certification and Development environments to optimize performance and security on our infrastructure. Responsible for both maintaining site reliability and Service deployments of Business products. Facilitate service monitoring, application upgrades, building infrastructure enhancements, and managing ongoing tasks.
+ Create processes designed to measure system effectiveness and identify areas for improvement. Stay abreast of new technologies in the field and provide recommendations to organizational management on new solutions. Oversee the selection of orchestration tooling, as well as compliance audits and reporting. May be responsible for identifying, correcting, and enhancing important software tools. Seek ways to enhance systems operations, with a focus on automation and minimizing cost.
+ 24/7/365 Site Operations supporting New Lexis Platform Applications (Cloud and On-Premise) including upcoming Services and Project that will be migrated to NLP.
+ Primary on call for any New Lexis Platform related incidents (PROD, CERT and DEV Environments)
+ Service Restoration
+ Real-time and proactive monitoring of logs and application performance
+ System Administration and Operations in Production
+ Incident and Change Management, Service Catalog / Service Task Fulfillment
+ Responsible for Problem Management - Problem identification, predictability, prevention and detection that will help to improve Availability and Reliability of the application
+ Status Reporting
+ Build / Bake / Deployments and Releases in CERT and PROD Environment for New Lexis Platform Applications and Services in Minor, Major and Emergency Releases
+ Adaptable in fast phase changing environment and new technologies.
+ Responsible for installation, maintenance, security, performance and tuning of New Lexis Platform and related software and services.
+ Ensures close technical interchanges on Platform related issues with SRE, Application Developers, Shared Services and Operations personnel as necessary.
+ Recommends and aids in the definition of New Lexis Platform strategies, policies, standards, and procedures which are consistent with the Company mission.
+ Actively participates and often leads team meetings and activities.
+ Responsible for research, risk assessment, design and validation of emerging and/or improved infrastructure technologies and services related to New Lexis Platform management.
+ Follow security guidelines for the proper delegation of accounts and privileges.
+ Participate in Continuous Improvements initiatives using Lean Six Sigma methodologies
+ Actively participates in team meetings and activities.
+ Build a solid, positive relationship with development, peers, colleagues and vendors.
+ 3Rs - Respond, React, Resolve
+ Other duties as assigned.
Qualifications:
+ A bachelor's degree (Information Systems, Computer Science/Engineering)
+ Minimum 2-3 year experience in IT industry or related field
+ Must have an experience in supporting huge cloud infrastructure (numerous cloud-hosted applications and servers)
+ Knowledgeable in Cloud concepts and technologies (AWS, Kubernetes and Azure).
+ Familiar in Cloud application performance monitoring (Splunk, New Relic, Datadog and alike)
+ Knowledgeable in Continuous Integration / Continuous Delivery - CI/CD tools (Jenkins, Artifactory and alike)
+ Knowledge and understanding of basic Unix/Linux, Windows server operating systems
+ Understanding of basic JAVA middleware and web server concepts and technologies.
+ Knowledgeable in network concepts, configuration, and routing.
+ Knowledge and understanding of database concepts including basic troubleshooting, high availability, clustering and disaster recovery, including no-SQL database.
+ Knowledgeable with the management of virtual machines hosted on VMware ESX.
+ Knowledgeable in using IT Service Management systems (ServiceNow, Freshservice and the likes)
+ Basic Unix/Linux, Windows, Database and Middleware troubleshooting, and analysis required.
+ Solid interpersonal, proactive, teamwork, communication and follow up skills (verbal and written) required with different levels of hierarchy.
+ Ability to monitor, define, analyze and resolve issues both effectively and efficiently in a high-pressure production environment.
+ Preferred candidates whose location is in close proximity to REPH office or satellite offices but will also consider candidates with backup power supply (able to power up laptop for 8-10h hours) and internet connection (~ 20-30 Mbps)
We are committed to providing a fair and accessible hiring process. If you have a disability or other need that requires accommodation or adjustment, please let us know by completing our Applicant Request Support Form or please contact .
Criminals may pose as recruiters asking for money or personal information. We never request money or banking details from job applicants. Learn more about spotting and avoiding scams here .
Please read our Candidate Privacy Policy .
We are an equal opportunity employer: qualified applicants are considered for and treated during employment without regard to race, color, creed, religion, sex, national origin, citizenship status, disability status, protected veteran status, age, marital status, sexual orientation, gender identity, genetic information, or any other characteristic protected by law.
USA Job Seekers:
EEO Know Your Rights .
RELX is a global provider of information-based analytics and decision tools for professional and business customers, enabling them to make better decisions, get better results and be more productive.
Our purpose is to benefit society by developing products that help researchers advance scientific knowledge; doctors and nurses improve the lives of patients; lawyers promote the rule of law and achieve justice and fair results for their clients; businesses and governments prevent fraud; consumers access financial services and get fair prices on insurance; and customers learn about markets and complete transactions.
Our purpose guides our actions beyond the products that we develop. It defines us as a company. Every day across RELX our employees are inspired to undertake initiatives that make unique contributions to society and the communities in which we operate.
This advertiser has chosen not to accept applicants from your region.

Cloud Systems Engineer

Manila, Metropolitan Manila RELX INC

Posted 15 days ago

Job Viewed

Tap Again To Close

Job Description

The Cloud Systems Engineer is accountable for the delivery of LexisNexis Legal & Professional services by supporting application portfolio and services within the New Lexis Platform. This includes Lexis+ AI US, Lexis US, Lexis+ US, Lexis+ Canada, Lexis+ UK, Lexis+ South Africa, Lexis+ APAC, gNS, Decisis and other cloud-based products. The Cloud Systems Engineer is primarily responsible in ensuring service availability, quality, performance and cost effectiveness.
Accountabilities:
+ Responsible in working closely with All other teams across the Platform to lead in incident management in Production including Certification and Development environments to optimize performance and security on our infrastructure. Responsible for both maintaining site reliability and Service deployments of Business products. Facilitate service monitoring, application upgrades, building infrastructure enhancements, and managing ongoing tasks.
+ Create processes designed to measure system effectiveness and identify areas for improvement. Stay abreast of new technologies in the field and provide recommendations to organizational management on new solutions. Oversee the selection of orchestration tooling, as well as compliance audits and reporting. May be responsible for identifying, correcting, and enhancing important software tools. Seek ways to enhance systems operations, with a focus on automation and minimizing cost.
+ 24/7/365 Site Operations supporting New Lexis Platform Applications (Cloud and On-Premise) including upcoming Services and Project that will be migrated to NLP.
+ Primary on call for any New Lexis Platform related incidents (PROD, CERT and DEV Environments)
+ Service Restoration
+ Real-time and proactive monitoring of logs and application performance
+ System Administration and Operations in Production
+ Incident and Change Management, Service Catalog / Service Task Fulfillment
+ Responsible for Problem Management - Problem identification, predictability, prevention and detection that will help to improve Availability and Reliability of the application
+ Status Reporting
+ Build / Bake / Deployments and Releases in CERT and PROD Environment for New Lexis Platform Applications and Services in Minor, Major and Emergency Releases
+ Adaptable in fast phase changing environment and new technologies.
+ Responsible for installation, maintenance, security, performance and tuning of New Lexis Platform and related software and services.
+ Ensures close technical interchanges on Platform related issues with SRE, Application Developers, Shared Services and Operations personnel as necessary.
+ Recommends and aids in the definition of New Lexis Platform strategies, policies, standards, and procedures which are consistent with the Company mission.
+ Actively participates and often leads team meetings and activities.
+ Responsible for research, risk assessment, design and validation of emerging and/or improved infrastructure technologies and services related to New Lexis Platform management.
+ Follow security guidelines for the proper delegation of accounts and privileges.
+ Participate in Continuous Improvements initiatives using Lean Six Sigma methodologies
+ Actively participates in team meetings and activities.
+ Build a solid, positive relationship with development, peers, colleagues and vendors.
+ 3Rs - Respond, React, Resolve
+ Other duties as assigned.
Qualifications:
+ A bachelor's degree (Information Systems, Computer Science/Engineering)
+ Minimum 2-3 year experience in IT industry or related field
+ Must have an experience in supporting huge cloud infrastructure (numerous cloud-hosted applications and servers)
+ Knowledgeable in Cloud concepts and technologies (AWS, Kubernetes and Azure).
+ Familiar in Cloud application performance monitoring (Splunk, New Relic, Datadog and alike)
+ Knowledgeable in Continuous Integration / Continuous Delivery - CI/CD tools (Jenkins, Artifactory and alike)
+ Knowledge and understanding of basic Unix/Linux, Windows server operating systems
+ Understanding of basic JAVA middleware and web server concepts and technologies.
+ Knowledgeable in network concepts, configuration, and routing.
+ Knowledge and understanding of database concepts including basic troubleshooting, high availability, clustering and disaster recovery, including no-SQL database.
+ Knowledgeable with the management of virtual machines hosted on VMware ESX.
+ Knowledgeable in using IT Service Management systems (ServiceNow, Freshservice and the likes)
+ Basic Unix/Linux, Windows, Database and Middleware troubleshooting, and analysis required.
+ Solid interpersonal, proactive, teamwork, communication and follow up skills (verbal and written) required with different levels of hierarchy.
+ Ability to monitor, define, analyze and resolve issues both effectively and efficiently in a high-pressure production environment.
+ Preferred candidates whose location is in close proximity to REPH office or satellite offices but will also consider candidates with backup power supply (able to power up laptop for 8-10h hours) and internet connection (~ 20-30 Mbps)
We are committed to providing a fair and accessible hiring process. If you have a disability or other need that requires accommodation or adjustment, please let us know by completing our Applicant Request Support Form or please contact .
Criminals may pose as recruiters asking for money or personal information. We never request money or banking details from job applicants. Learn more about spotting and avoiding scams here .
Please read our Candidate Privacy Policy .
We are an equal opportunity employer: qualified applicants are considered for and treated during employment without regard to race, color, creed, religion, sex, national origin, citizenship status, disability status, protected veteran status, age, marital status, sexual orientation, gender identity, genetic information, or any other characteristic protected by law.
USA Job Seekers:
EEO Know Your Rights .
RELX is a global provider of information-based analytics and decision tools for professional and business customers, enabling them to make better decisions, get better results and be more productive.
Our purpose is to benefit society by developing products that help researchers advance scientific knowledge; doctors and nurses improve the lives of patients; lawyers promote the rule of law and achieve justice and fair results for their clients; businesses and governments prevent fraud; consumers access financial services and get fair prices on insurance; and customers learn about markets and complete transactions.
Our purpose guides our actions beyond the products that we develop. It defines us as a company. Every day across RELX our employees are inspired to undertake initiatives that make unique contributions to society and the communities in which we operate.
This advertiser has chosen not to accept applicants from your region.
 

Nearby Locations

Other Jobs Near Me

Industry

  1. request_quote Accounting
  2. work Administrative
  3. eco Agriculture Forestry
  4. smart_toy AI & Emerging Technologies
  5. school Apprenticeships & Trainee
  6. apartment Architecture
  7. palette Arts & Entertainment
  8. directions_car Automotive
  9. flight_takeoff Aviation
  10. account_balance Banking & Finance
  11. local_florist Beauty & Wellness
  12. restaurant Catering
  13. volunteer_activism Charity & Voluntary
  14. science Chemical Engineering
  15. child_friendly Childcare
  16. foundation Civil Engineering
  17. clean_hands Cleaning & Sanitation
  18. diversity_3 Community & Social Care
  19. construction Construction
  20. brush Creative & Digital
  21. currency_bitcoin Crypto & Blockchain
  22. support_agent Customer Service & Helpdesk
  23. medical_services Dental
  24. medical_services Driving & Transport
  25. medical_services E Commerce & Social Media
  26. school Education & Teaching
  27. electrical_services Electrical Engineering
  28. bolt Energy
  29. local_mall Fmcg
  30. gavel Government & Non Profit
  31. emoji_events Graduate
  32. health_and_safety Healthcare
  33. beach_access Hospitality & Tourism
  34. groups Human Resources
  35. precision_manufacturing Industrial Engineering
  36. security Information Security
  37. handyman Installation & Maintenance
  38. policy Insurance
  39. code IT & Software
  40. gavel Legal
  41. sports_soccer Leisure & Sports
  42. inventory_2 Logistics & Warehousing
  43. supervisor_account Management
  44. supervisor_account Management Consultancy
  45. supervisor_account Manufacturing & Production
  46. campaign Marketing
  47. build Mechanical Engineering
  48. perm_media Media & PR
  49. local_hospital Medical
  50. local_hospital Military & Public Safety
  51. local_hospital Mining
  52. medical_services Nursing
  53. local_gas_station Oil & Gas
  54. biotech Pharmaceutical
  55. checklist_rtl Project Management
  56. shopping_bag Purchasing
  57. home_work Real Estate
  58. person_search Recruitment Consultancy
  59. store Retail
  60. point_of_sale Sales
  61. science Scientific Research & Development
  62. wifi Telecoms
  63. psychology Therapy
  64. pets Veterinary
View All Devops Engineers Jobs