86 Site Reliability jobs in the Philippines
Site Reliability Engineer
Posted 23 days ago
Job Viewed
Job Description
▪ In-depth knowledge of CI/CD pipelines, automation tools, and infrastructure management. br>▪ Proficiency in scripting languages (e.g., Python, Bash) and automation frameworks. < r>▪ Familiarity with cloud computing platforms (e.g., AWS, Azure, Google Cloud). < r>▪ Strong problem-solving and analytical abilities. < r>▪ Detail-oriented and proactive mindset. < r>▪ Effective communication and collaboration skills
Senior Site Reliability Engineer
Posted 14 days ago
Job Viewed
Job Description
About Penbrothers
Penbrothers is an HR & remote talent management partner and one of the fastest-growing companies in the Philippines. We provide talented Filipinos with global opportunities in high-growth startups and dynamic companies, from the comfort of their own homes.
About the Client
The client is a pioneer in medical recruitment, is seeking an experienced Tech Lead to drive their mission to enhance doctors' well-being. This is an opportunity to contribute your unique skills and expertise to create technology that truly matters, impacting lives on a daily basis
About the Role
We are looking for a Senior SRE/DevOps Specialist to play a vital role in ensuring the reliability of our Salesforce and web/mobile application environments. You will work closely with our engineers to continually improve and enhance our platform leaning towards world class best practices.
Service reliability and observability
Analysing resource utilization and forecasting capacity needs to ensure the system can handle expected traffic and workloads without performance issues.
Writing code and scripts to automate repetitive operational tasks, configuration management, and deployment processes to reduce human error and increase efficiency.
Managing changes to production systems and services, ensuring that new releases and configuration changes are rolled out with minimal disruption and risk.
Identifying and addressing performance bottlenecks, optimizing software and infrastructure to improve response times and reduce resource consumption.
Maintaining thorough documentation of systems, configurations, and incident response procedures to facilitate knowledge sharing and onboarding of new team members.
Defining and maintaining service level objectives that specify the acceptable level of service quality, such as uptime and latency, for a particular system or service.
Defining the key performance metrics and indicators that will be used to measure the system's performance and reliability, such as error rates and response times.
Designing and implementing monitoring systems to track the SLIs and using alerting mechanisms to notify the team when the system deviates from its defined SLOs.
Incident management & Disaster recovery planning
Responding to and mitigating incidents that impact service availability or performance,
following an incident management process, and conducting post-incident reviews to learn and improve.
Planning and implementing and executing disaster recovery and backup strategies to ensure data and service availability in case of failures or disasters.
Security
Ensure systems and infrastructure are securely configured and hardened by default
Manage secrets, credentials, and access controls across environments
Monitor for security-related events and support incident response efforts
Maintain secure CI/CD pipelines and enforce safe deployment practices
Planning and implementing disaster recovery and backup strategies to ensure data and service availability in case of failures or disasters.
Continuous Improvement
Continuously evaluating and improving system reliability, efficiency, cost optimization and automation to meet our evolving business needs and customer expectations.
Rationalizing, evaluating and integrating 3rd party developer tooling and services.
Troubleshooting platform issues with development teams
Providing tooling support and access management for development teams
Stay ahead of the tech curve, bringing new tools and frameworks to the table
Site Reliability Engineer (Supply Chain IT Operations)
Posted 22 days ago
Job Viewed
Job Description
Taguig City
Job Description
Information Technology (IT) at Procter & Gamble is where business, innovation and technology integrate to build a competitive advantage for P&G. Our mission is clear -- you deliver IT to help P&G win with consumers.
Do you love implementing continuous improvement in IT solutions to drive efficiency and agility in meeting constantly evolving business needs? Then this job might be for you!
As a Site Reliability Engineer, you will be instrumental in ensuring the high availability and reliability of our digital IT products in the P&G supply chain. Your primary focus will be on enhancing system performance through faster detection, response, and resolution of issues, while also implementing strategies to prevent recurrence and reduce operational toil. You will use robust Observability and Monitoring tools, automate incident response systems, and optimize IT architecture to create a resilient and reliable infrastructure.
Responsibilities:
+ Implement and lead comprehensive monitoring solutions and tools to provide real-time insights into system performance, enabling proactive incident detection and ensuring accurate, actionable alerts for prompt responses.
+ Continuously refine monitoring strategies and develop automation scripts to address recurring issues, enhancing system visibility, resource optimization, and overall efficiency.
+ Establish and maintain Service Level Indicators (SLIs) and Service Level Objectives (SLOs) to improve service quality and reliability,
+ Collect and share data and insights from observability tools to drive continuous improvement initiatives.
+ Work closely with Software Engineers, Product Teams, and Infrastructure Teams to develop and implement initiatives that enhance IT reliability.
+ Engage with customers to understand their needs and difficulties regarding Observability and Monitoring tools, providing exceptional support in all interactions, including communications, updates, and feedback.
+ Stay updated on industry trends and effective strategies in Site Reliability Engineering while continuously enhancing technical skills in system architecture, automation, cloud technologies, and operational processes.
+ Share knowledge and mentor team members to foster a culture of learning and professional development within the team
+ Lead root cause analysis efforts and implement corrective action plans in a timely manner to achieve permanent resolutions for incidents.
+ Oversee documentation and knowledge management efforts.
Job Qualifications
Candidates must demonstrate strong leadership in the application of technical expertise to drive business results.
We are looking for candidates who possess the following core qualities:
+ A Bachelor's degree in related field such as Engineering, Information Technology and Computer Science discipline.
+ Up to 5 years of relevant experience .
+ Experience or familiarity with monitoring and observability tools (e.g., Prometheus, preferably Grafana)
+ Knowledge and familiarity in system administration, including Linux/Unix environments, cloud platforms (Azure is preferred, but AWS or GCP are acceptable)
+ Experience with configuration management tools and infrastructure-as-code frameworks (e.g., Terraform)
+ Proficiency in at least one programming language (e.g., Python, C#) and a background in scripting for automation tasks
+ Understanding of networking protocols, network infrastructures, load balancing, and DNS management
+ Familiarity with containerization and Orchestration Technologies (e.g., Docker, Kubernetes)
+ Familiarity with databases and proficiency in writing SQL queries
+ Understanding of best practices in security and experience with implementing secure systems
+ Knowledge of incident response methodologies, root cause analysis, and implementing preventive measures (ITIL and/or SRE)
+ Familiarity with ticketing systems and task management (preferably ServiceNow)
+ Problem-solving skills with ability to analyze complex issues and devise effective solutions
+ Learning agility as there will be new topics to learn and new spaces to understand
+ Communication and collaboration skills to work effectively with multi-functional teams, partners, and customers
+ Teamwork and interpersonal skills, with an ability to build relationships and work effectively in a collaborative environment
+ Operational excellence / execution skills as the work requires discipline
Preferred Skills:
+ Understanding or experience in Supply Chain applications and processes, documents or general data flow to understand impact of unplanned IT downtimes and impact of IT changes to business operations
About us
We produce globally recognized brands, and we grow the best business leaders in the industry. With a portfolio of trusted brands as diverse as ours, it is paramount our leaders are able to lead with courage the vast array of brands, categories and functions. We serve consumers around the world with one of the strongest portfolios of trusted, quality, leadership brands, including Always®, Ariel®, Gillette®, Head & Shoulders®, Herbal Essences®, Oral-B®, Pampers®, Pantene®, Tampax® and more. Our community includes operations in approximately 70 countries worldwide.
Visit to know more.
We are an equal opportunity employer and value diversity at our company. We do not discriminate against individuals on the basis of race, color, gender, age, national origin, religion, sexual orientation, gender identity or expression, marital status, citizenship, disability, HIV/AIDS status, or any other legally protected factor.
Job Schedule
Full time
Job Number
R000136489
Job Segmentation
Experienced Professionals (Job Segmentation)
System Administrator
Posted 6 days ago
Job Viewed
Job Description
br>We are looking for a highly organized and tech-savvy System Administrator to join our remote team. The successful candidate will provide day-to-day support and administration of Microsoft Dynamics D365 and other third-party systems. This role also involves order processing, inventory management, and creating process documentation to support business operations. We are seeking someone who can start immediately.
Key Responsibilities
Provide operational and functional support for Microsoft Dynamics 365 (D365) and other related software systems
Assist in Sales Order and Purchase Order creation and posting
Process warehouse transactions including inbound/outbound and inventory adjustments
Manage inventory control within the system
Handle Sales Credit/Adjustment Notes creation and posting
Create and update user manuals, and deliver user coaching/training (especially within Microsoft Dynamics – FinOps) < r>Assist in developing and maintaining policies, procedures, and documentation for business processes
Conduct master file maintenance and ensure data accuracy
Identify and recommend process improvements to increase efficiency
Perform other administrative or ad-hoc tasks as needed
System Administrator
Posted 16 days ago
Job Viewed
Job Description
- Minimum of 3 years IT related experience br>
Principal Duties and Responsibilities
1. Establish and control systems access and security.
2. oordinate protocols for operation within multi-user IT networks that provide voice, data and text transmissions.
3. Check systems in order to optimize performance and to initiate recovery action after system failures
4. Implement regular housekeeping procedures, including data backup.
5. Manage the distribution and retention of data on various storage devices.
6. Coordinate system updates and replacement of 'old' versions.
7. Provide assistance in testing new equipment and systems.
8. Prepare technical reports on the operation of systems.
9. Prepare systems cost estimates and cost-benefit analyses.
10. Provide training in systems use and access.
11. Oversee the daily performance of computer systems
12. Prepare evaluations of software or hardware, and recommend improvements or upgrades
13. Coordinate workers engaged in problem-solving, monitoring, and installing data communication equipment and software
14. Conducts preventive and corrective maintenance of all server-related hardware.
System Administrator
Posted 17 days ago
Job Viewed
Job Description
Join Intelligent Technical Solutions, a dynamic and growing company, as our System Administrator. We're looking for a collaborative problem-solver with excellent customer service skills and a detail-oriented mindset. In this role, you'll be pivotal in providing remote IT support, ensuring our clients receive top-notch technical assistance. You'll handle a range of tasks, from system updates to hands-on troubleshooting, playing a critical role in maintaining our high standards of IT service. br>
Job Responsibilities:
This role is expected to be able to resolve the following technical issues with little or no assistance:
Workstation operating system issues of any kind.
Printer issues of any kind.
Standard business application (Office, etc.) issues of any kind.
Line-of-business application issues for common applications (SAGE ERP, QuickBooks), and for more niche applications with vendor assistance.
Server issues of any kind.
Basic and intermediate networking issues.
Escalation point for technicians
Job Qualifications:
Experience with Windows and Mac OS troubleshooting.
Experience with Server 2012 to 2022.
Experience with VMWare and Hyper V Virtualization.
Experience supporting M365 and Azure.
Minimum of three to five years experience in IT Support or a similar role.
Basic knowledge of Firewalls, Switches, Network architecture, and troubleshooting.
Proficient in supporting MS Office, Adobe, Active Directory, backup software, endpoint cybersecurity, mobile devices, and various business applications.
Experience with Managed Service Providers (MSPs) is highly desirable.
Active Server Certifications are preferred.
Job KPIs:
First-touch closed: Each ticket that is completed with only one time entry contributes to this number
Utilization: The percentage of time that you are on the clock and billing time to client tickets
CSAT: Scores filled out by clients using the rating system in tickets
Job Requirements:
A home office set up that meets the following:
A working computer with minimum of 8gb RAM, i5 Windows10 or macOS Big Sur.
Stable internet connection with at least 5mbps download and upload.
A quality headset with noise cancellation feature for meetings.
A mobile phone that has the capacity to install the Microsoft Authentication app.
HD webcam
2nd Monitor
UPS (Uninterruptible Power Supply) if using a desktop (Not needed if using a laptop)
A very strong cultural fit the ITS way (Our core values will be discussed during the interview)
Excellent command of the English language both spoken and written
Able to work full time 40 hours a week - 8:00AM to 5:00PM, Pacific Standard Time (PST) from Monday to Friday.
Able to pass our online behavior and cognitive assessments.
Preferably, at least two of the following certifications: CompTIA A+, CompTIA Network+, CompTIA Security+, 3cx Basic Certified Engineer.
Able to provide NBI Clearance (for PH applicants).
Compensation:
Pay rate starts at $8.70/hr and may vary by experience and location.
Benefits:
Comprehensive Health Plans / HMO (Employee & Dependent/s)
Life Insurance
Flexible Benefit Reimbursement (Dental/Vision/Mental Health/Wellness/Government Contributions)
Paid Time Off (starts at 15 days per year)
Maternity/Paternity Leave
Paid US Holiday
Night Shift Bonus
Salary Advancement/Loan
Health & Wellness Program
Company-paid training and certification
System Administrator
Posted 1 day ago
Job Viewed
Job Description
- Bachelor’s Degree in Information Technology, Electronics & Communication Engineering or any related field/ Completed vocational course in Information Technology.
- 2-3 years of proven relevant experience in IT.
- 1-2 years enterprise level administration and troubleshooting experience with Windows Server.
- 1-2 years' experience deploying, supporting, and administering VMware 5.0+ server infrastructure environments.
- Understanding and experience of networking technologies including AD, DNS, WINS, and DHCP.
- PCI DSS
- Experience with HP or Intel server hardware, firmware, BIOS, RAID, server backups and server security along with SAN
- Must possess good problem solving and troubleshooting skills in supporting enterprise level clients, logical and methodical problem-solving skills, and the ability to communicate and justify the resolution process where necessary.
- Exceptional communication and people skills
- Highly professional approach and positive attitude
- VCP, MCSA or MCSE certifications preferred but not required
**Duties & Responsibility**:
1. Daily tasks include AD configuration, and deployment of VM
- VMware Infrastructure
- Windows Servers 2008 R2, 2012, 2016, & 2019
- Domain Controller, DNS, DHCP, GPO
- DFS Fileserver
- IIS
- MS SQL
- Anti-Virus
- WSUS
- FTP
- Backup
- Syslog
- Office 365
2. Shared responsibility and daily operational deployment of entire server environment
3. Patch management and Server management
4. Working with a team to help maintain the vSphere infrastructure
5. Work with vendors / cross team engineering group/s to provide solutions to complex issues.
6. Ability to work flexible hours including on call and after hours work for scheduled and unscheduled maintenance.
**Enjoy the following perks**:
- Basic salary plus ₱ 3,000.00 Non - taxable Allowance (upon permanency)
- HMO with 1 free dependent
- SL, VL, EL, Mental Health Day leave, and Bereavement leave (upon permanency)
- Transportation allowance *terms and conditions apply.
APPLY NOW!
**Job Types**: Full-time, Permanent
**Salary**: Php25,000.00 - Php30,000.00 per month
**Benefits**:
- Additional leave
- Company Christmas gift
- Company events
- On-site parking
- Opportunities for promotion
- Paid training
- Promotion to permanent employee
Schedule:
- Holidays
- Night shift
- Overtime
- Weekends
Supplemental pay types:
- 13th month salary
- Overtime pay
- Yearly bonus
Be The First To Know
About the latest Site reliability Jobs in Philippines !
System Administrator L2
Posted 6 days ago
Job Viewed
Job Description
br>Work Arrangement: Remote
Schedule: Rotating Shift
Location: Cebu
Employment Type: Full-time
Salary: ₱80,000 – ₱100,000
br>About the Role:
The System Administrator L2 is responsible for resolving Level 1 and Level 2 incidents, ensuring timely and exceptional customer service, maintaining accurate documentation, and continuously developing technical skills. This role requires strong technical expertise, proactive problem-solving, and the ability to provide efficient support in a dynamic IT environment.
Key Responsibilities:
Resolve Level 1 and Level 2 incidents and follow escalation protocols
● Continuously review all tickets and statuses in your queue. < r> ● Respond promptly to colleagues, end users, and partners. < r> ● Follow company, partner, and customer security protocols at all times. < r> ● Assess tickets, escalate when necessary, and document steps thoroughly. < r> ● Ensure issues are fully resolved and verified with customers before closing tickets. < r>
Provide timely and exceptional customer service
● Attend appointments and meetings on time. < r> ● Set clear expectations with colleagues, partners, and end users. < r> ● Communicate clearly via chat, email, and phone using professional language. < r> ● Keep users updated frequently and provide proactive follow-ups. < r> ● Deliver warm, friendly, and solution-focused customer service. < r>
Maintain accurate records and documentation
● Record all time spent on issues against relevant tickets. < r> ● Update notes with clear, detailed, and secure information. < r> ● Ensure consistent communication with end users, partners, and colleagues. < r> ● Avoid disclosing sensitive internal information. < r>
Proactive learning and development
● Enhance knowledge by reviewing escalated tickets and documentation. < r> ● Attend webinars and training sessions to strengthen technical skills. < r> ● Share knowledge, tips, and best practices with colleagues. < r> ● Engage in self-paced learning to develop expertise. < r>
Level 1 Incident Examples
● Credential management (password resets, account unlocks). < r> ● Microsoft Windows, Office, and Office 365 desktop support. < r> ● User connectivity and application support. < r> ● New user setup and deactivation in AD/Azure AD and Office 365. < r> ● VPN account setup and email configuration. < r> ● Device provisioning (workstations, laptops, smartphones). < r> ● Virus/malware scanning and general performance optimization. < r>
Level 2 Incident Examples
● Partial service outages affecting subsets of staff. < r> ● Server-level support impacting limited users. < r> ● Group permissions and printer support. < r> ● Troubleshooting desktop/terminal server applications. < r> ● Proactive server maintenance and monitoring. < r>
Qualifications:
● Minimum IT diploma level education and relevant Microsoft IT certification. < r> ● Minimum 1 year in a Level 2 role or 2+ years in Level 1 with proven L2 skills. < r> ● Minimum 2 years in a B2B IT support role. < r> ● Strong knowledge of Microsoft desktop OS, Microsoft 365/Office 365, virtualization, and networking. < r> ● Experience with FortiGate Firewalls, remote desktop tools (e.g., TeamViewer), and documentation/IT diagrams. < r> ● Strong customer service skills and fluent verbal/written English. < r> ● Certifications (e.g., Microsoft, Cisco) are a plus. < r>
Candidates with attached CV and relevant experience will be considered for a phone interview.
System Administrator L2
Posted 8 days ago
Job Viewed
Job Description
br>Work Arrangement: Remote
Schedule: Rotating Shift
Location: Cebu
Employment Type: Full-time
Salary: ₱80,000 – ₱100,000
br>About the Role:
The System Administrator L2 is responsible for resolving Level 1 and Level 2 incidents, ensuring timely and exceptional customer service, maintaining accurate documentation, and continuously developing technical skills. This role requires strong technical expertise, proactive problem-solving, and the ability to provide efficient support in a dynamic IT environment.
Key Responsibilities:
Resolve Level 1 and Level 2 incidents and follow escalation protocols
● Continuously review all tickets and statuses in your queue. < r>● Respond promptly to colleagues, end users, and partners. < r>● Follow company, partner, and customer security protocols at all times. < r>● Assess tickets, escalate when necessary, and document steps thoroughly. < r>● Ensure issues are fully resolved and verified with customers before closing tickets. < r>
Provide timely and exceptional customer service
● Attend appointments and meetings on time. < r>● Set clear expectations with colleagues, partners, and end users. < r>● Communicate clearly via chat, email, and phone using professional language. < r>● Keep users updated frequently and provide proactive follow-ups. < r>● Deliver warm, friendly, and solution-focused customer service. < r>
Maintain accurate records and documentation
● Record all time spent on issues against relevant tickets. < r>● Update notes with clear, detailed, and secure information. < r>● Ensure consistent communication with end users, partners, and colleagues. < r>● Avoid disclosing sensitive internal information. < r>
Proactive learning and development
● Enhance knowledge by reviewing escalated tickets and documentation. < r>● Attend webinars and training sessions to strengthen technical skills. < r>● Share knowledge, tips, and best practices with colleagues. < r>● Engage in self-paced learning to develop expertise. < r>
Level 1 Incident Examples
● Credential management (password resets, account unlocks). < r>● Microsoft Windows, Office, and Office 365 desktop support. < r>● User connectivity and application support. < r>● New user setup and deactivation in AD/Azure AD and Office 365. < r>● VPN account setup and email configuration. < r>● Device provisioning (workstations, laptops, smartphones). < r>● Virus/malware scanning and general performance optimization. < r>
Level 2 Incident Examples
● Partial service outages affecting subsets of staff. < r>● Server-level support impacting limited users. < r>● Group permissions and printer support. < r>● Troubleshooting desktop/terminal server applications. < r>● Proactive server maintenance and monitoring. < r>
Qualifications:
● Minimum IT diploma level education and relevant Microsoft IT certification. < r>● Minimum 1 year in a Level 2 role or 2+ years in Level 1 with proven L2 skills. < r>● Minimum 2 years in a B2B IT support role. < r>● Strong knowledge of Microsoft desktop OS, Microsoft 365/Office 365, virtualization, and networking. < r>● Experience with FortiGate Firewalls, remote desktop tools (e.g., TeamViewer), and documentation/IT diagrams. < r>● Strong customer service skills and fluent verbal/written English. < r>● Certifications (e.g., Microsoft, Cisco) are a plus. < r>
Candidates with attached CV and relevant experience will be considered for a phone interview.
Junior System Administrator
Posted 9 days ago
Job Viewed
Job Description
Bachelor’s Degree in Information Technology, Computer Science, or a related field. br>
Must have minimum 6 months of experience in System Administration. (Non negotiable)
Must have minimum 6 months of experience in Windows Server (2016/2019/2022). (Non negotiable)
Must have experience in Linux Server (RHEL/OEL/CentOS). (Non negotiable)
Must have experience in Windows and Linux system administration tasks. (Non negotiable)
Must have understanding of networking principles, including LAN/WAN configuration, Active Directory, email systems, and web-based applications. (Non negotiable)
Must have knowledge in virtualization environments (VMWare, Hyper‑V, Oracle VM (OVM)) (Non negotiable) < r>
Exposure to server and storage hardware setup, backup solutions, and system monitoring tools. (Non negotiable)
Knowledge of basic database administration using Microsoft SQL Server or Oracle is an advantage.
Experience or familiarity with scripting and automation tools such as PowerShell, Cron, Visual Basic, or batch scripts.
Awareness of ITIL framework and compliance standards such as PCI-DSS is preferred
Exposure to banking systems or financial IT environments is an advantage.
Strong analytical and problem-solving skills.
Ability to work independently and as part of a team.
Willingness to learn new technologies and adapt to evolving IT environments
Good communication and documentation skills.
Job Description
Installing, configuring, maintaining, and operating server hardware, operating systems, and system utilities
Handling routine tasks such as provisioning, patching, monitoring, capacity planning, and performance tuning
Maintaining infrastructure performance and availability
Collaborating with cross‑functional teams across IT infrastructure, applications, and support groups < r>
Working within enterprise environments, including virtualization, clustering, system upgrades, and cloud transition frameworks