Who We Are

Located in the heart of Bangkok’s Phrom Phong District, Sertis is ASEAN's leading Data and AI engineering and solutions company. Since 2014, our advanced solutions and products have powered over 400 enterprise Data and AI deployments at the region’s leading companies and conglomerates. We are also a member firm of Andersen Consulting, a global consulting practice integrating business strategy, digital transformation, and AI-driven technology solutions with Andersen Global’s world-class expertise.

What We Do

Sertis provides both productized and bespoke AI and Data solutions for our Customers, optimizing and commercializing their data in ways that activate real business results. Our 140+ team have developed product offerings and IP ranging from advanced Computer Vision applications accredited Global Top 20 by NIST, to automated insights monetization for Retailers, eKYC for financial institutions, AI-driven agricultural safety assurance, precision steel cutting, trading algorithms for hedge funds, and enterprise knowledge management systems based on AI.

Our Aspiration

We are data and AI pioneers, dedicated to enhancing the economic and social lives of our customers via technology. We are not just living in history, we are making history everyday. In becoming one of the world’s leading Data and AI companies, we always double-down on remaining a place where a diverse mix of talent wants to come, do their best work, and stay. We pride ourselves on bringing the best talent worldwide into a culture that encourages learning, growth opportunities, innovative contributions, and a sense of ownership. As part of Andersen Consulting, we are committed to delivering best-in-class Data and AI solutions—aligned with a global platform known for innovation, integration, and impact—while continuing to set benchmarks in the region and beyond.

For more information, please visit: sertiscorp.com

Overview of the job

Our Senior-Lead Site Reliability Engineer will be responsible for improving the efficiency and reliability of our software development and deployment processes, as well as ensuring the availability, performance, and scalability of our systems and services. You will work closely with our Machine Learning, Software Engineering, Quality Assurance, and Data Engineering teams to automate and streamline the build, test, and deployment of our systems and services through automation. Additionally, you will be responsible for designing and building new infrastructure, as well as supporting pre-sales activities and continuously improving our CI/CD pipeline, monitoring and processes.

In this role, you will get to:

Automate infrastructure provisioning, configuration management, and deployment processes
Ensure the availability, performance, and scalability of our systems and services by continuously monitoring and maintaining them.
Implement and maintain SLAs to meet or exceed customer expectations and to ensure that the systems are operating effectively and efficiently
Design, build, and maintain the CI/CD pipeline to ensure the efficient and reliable deployment of software releases
Develop and maintain runbooks/playbooks, and procedures for responding to incidents and for performing regular maintenance activities
Work closely with the different engineering teams to identify and resolve production issues, establish and implement best practices for reliability and performance, improve the overall quality and efficiency of the systems
Conduct incident response and post-mortem analysis to identify root causes and prevent future incidents
Share your expertise across the team and mentor the junior/mid-level via code reviews, 1:1 sessions, workshops or knowledge sharing sessions, to enhance their technical skills and understanding of best SRE practices
Participate in the recruitment in order to evaluate and interview candidates, as well as improving our recruitment processes
Ensure and advocate for DevOps practices in the company

You'll be successful if you have:

5-8 years of hands-on experience in designing, building, maintaining cloud infrastructure, and applying DevOps and SRE practices in large-scale systems
In-depth knowledge about container orchestration principles and techniques, including hands-on experience with Docker and platforms such as Kubernetes, to effectively manage and deploy containerized applications at scale
In-depth knowledge of cloud infrastructure and its components, including virtual machine, serverless, storage, networking, and security, with hands-on experience in deploying and managing applications in cloud environments, to ensure optimal utilization and cost-effectiveness of cloud resources
Ability to design and build new infrastructure and continuously improve the CI/CD pipelines
Strong automation and IaC skills, including experience with tools such as Terraform, AWS CDK, Flux/ArgoCD, Helm, and Gitlab CI
Ability to scope projects, define architectures, and choose technologies based on project requirements
Experience with monitoring (Prometheus/Grafana preferred) and defining SLAs
A secure by design mindset and understanding of the importance of security in the development and deployment process
Strong problem-solving skills and experience in troubleshooting production issues
Ability to work collaboratively with different teams
Leadership skills and ability to mentor junior and mid-level Site Reliability Engineers
Familiarity with Agile, DevOps and SRE best practices and methodologies
Proactiveness in keeping up to date with the latest technology and industry trends
Excellent communication (written/verbal) skills, and the ability to effectively communicate with technical and non-technical stakeholders

It's a plus if you have:

Gathering customer requirements and estimating project scope during pre-sales interactions
Working on multiple projects simultaneously
Penetration testing tools such as Nessus, Nikto, nmap, etc. and have been using them before
Holding certifications of cloud providers or CNCF (such as CKA/CKAD, AWS/GCP/Azure).
Optimizing cloud costs through various strategies
Utilizing service mesh technologies, such as Istio or Linkerd
Implementing canary deployment strategies for testing and deploying new releases in a controlled and safe manner

What are some benefits working at Sertis?

Hybrid working environment, up early or slow starter in the morning? We have flexible office hours
Get to work and learn from the best in the industry, and share your ideas with like-minded individuals
We cultivate intelligence and learning so that our experts can become community leaders in their respected fields in the tech industry
Amazing colleagues to enjoy company social outings, parties, and events
Result-oriented workplace; We provide direction, not orders and give you the autonomy to deliver your best work
We work at the frontier of innovation in the AI industry
Work on meaningful solutions that solve and improve real-life problems and challenges
We run like a startup, and embrace the adventure; we focus on getting things done, while still having a down-to-earth and informal culture

This is your chance to build your career in a growing data-driven and AI industry.

APPLY NOW!

Sertis may collect, use, or disclose your personal data or personal data of other persons provided by you in order to carry out your recruitment process. For more information, please refer to our Recruitment Privacy Notice

Site Reliability Engineer Lead / SRE Lead

Who We Are

What We Do

Our Aspiration

Overview of the job

In this role, you will get to:

What are some benefits working at Sertis?

Site Reliability Engineer

Engineer, Reliability/Test

Engineer Sr, Reliability/Test

Enterprise Sales Lead (IT Sales Lead)

Lead/Senior lead, Partner Program - Supply (Bangkok - Based, Relocation provided)