Who We Are
Located in the heart of Bangkok’s Phrom Phong District, Sertis is ASEAN's leading Data and AI engineering and solutions company. Since 2014, our advanced solutions and products have powered over 400 enterprise Data and AI deployments at the region’s leading companies and conglomerates. We are also a member firm of Andersen Consulting, a global consulting practice integrating business strategy, digital transformation, and AI-driven technology solutions with Andersen Global’s world-class expertise.
What We Do
Sertis provides both productized and bespoke AI and Data solutions for our Customers, optimizing and commercializing their data in ways that activate real business results. Our 140+ team have developed product offerings and IP ranging from advanced Computer Vision applications accredited Global Top 20 by NIST, to automated insights monetization for Retailers, eKYC for financial institutions, AI-driven agricultural safety assurance, precision steel cutting, trading algorithms for hedge funds, and enterprise knowledge management systems based on AI.
Our Aspiration
We are data and AI pioneers, dedicated to enhancing the economic and social lives of our customers via technology. We are not just living in history, we are making history everyday. In becoming one of the world’s leading Data and AI companies, we always double-down on remaining a place where a diverse mix of talent wants to come, do their best work, and stay. We pride ourselves on bringing the best talent worldwide into a culture that encourages learning, growth opportunities, innovative contributions, and a sense of ownership. As part of Andersen Consulting, we are committed to delivering best-in-class Data and AI solutions—aligned with a global platform known for innovation, integration, and impact—while continuing to set benchmarks in the region and beyond.
For more information, please visit: sertiscorp.com
Overview of the job
Our Senior-Lead Site Reliability Engineer will be responsible for improving the efficiency and reliability of our software development and deployment processes, as well as ensuring the availability, performance, and scalability of our systems and services. You will work closely with our Machine Learning, Software Engineering, Quality Assurance, and Data Engineering teams to automate and streamline the build, test, and deployment of our systems and services through automation. Additionally, you will be responsible for designing and building new infrastructure, as well as supporting pre-sales activities and continuously improving our CI/CD pipeline, monitoring and processes.
In this role, you will get to:
- Automate infrastructure provisioning, configuration management, and deployment processes
- Ensure the availability, performance, and scalability of our systems and services by continuously monitoring and maintaining them.
- Implement and maintain SLAs to meet or exceed customer expectations and to ensure that the systems are operating effectively and efficiently
- Design, build, and maintain the CI/CD pipeline to ensure the efficient and reliable deployment of software releases
- Develop and maintain runbooks/playbooks, and procedures for responding to incidents and for performing regular maintenance activities
- Work closely with the different engineering teams to identify and resolve production issues, establish and implement best practices for reliability and performance, improve the overall quality and efficiency of the systems
- Conduct incident response and post-mortem analysis to identify root causes and prevent future incidents
- Share your expertise across the team and mentor the junior/mid-level via code reviews, 1:1 sessions, workshops or knowledge sharing sessions, to enhance their technical skills and understanding of best SRE practices
- Participate in the recruitment in order to evaluate and interview candidates, as well as improving our recruitment processes
- Ensure and advocate for DevOps practices in the company
You'll be successful if you have:
- 5-8 years of hands-on experience in designing, building, maintaining cloud infrastructure, and applying DevOps and SRE practices in large-scale systems
- In-depth knowledge about container orchestration principles and techniques, including hands-on experience with Docker and platforms such as Kubernetes, to effectively manage and deploy containerized applications at scale
- In-depth knowledge of cloud infrastructure and its components, including virtual machine, serverless, storage, networking, and security, with hands-on experience in deploying and managing applications in cloud environments, to ensure optimal utilization and cost-effectiveness of cloud resources
- Ability to design and build new infrastructure and continuously improve the CI/CD pipelines
- Strong automation and IaC skills, including experience with tools such as Terraform, AWS CDK, Flux/ArgoCD, Helm, and Gitlab CI
- Ability to scope projects, define architectures, and choose technologies based on project requirements
- Experience with monitoring (Prometheus/Grafana preferred) and defining SLAs
- A secure by design mindset and understanding of the importance of security in the development and deployment process
- Strong problem-solving skills and experience in troubleshooting production issues
- Ability to work collaboratively with different teams
- Leadership skills and ability to mentor junior and mid-level Site Reliability Engineers
- Familiarity with Agile, DevOps and SRE best practices and methodologies
- Proactiveness in keeping up to date with the latest technology and industry trends
- Excellent communication (written/verbal) skills, and the ability to effectively communicate with technical and non-technical stakeholders
It's a plus if you have:
- Gathering customer requirements and estimating project scope during pre-sales interactions
- Working on multiple projects simultaneously
- Penetration testing tools such as Nessus, Nikto, nmap, etc. and have been using them before
- Holding certifications of cloud providers or CNCF (such as CKA/CKAD, AWS/GCP/Azure).
- Optimizing cloud costs through various strategies
- Utilizing service mesh technologies, such as Istio or Linkerd
- Implementing canary deployment strategies for testing and deploying new releases in a controlled and safe manner
What are some benefits working at Sertis?
- Hybrid working environment, up early or slow starter in the morning? We have flexible office hours
- Get to work and learn from the best in the industry, and share your ideas with like-minded individuals
- We cultivate intelligence and learning so that our experts can become community leaders in their respected fields in the tech industry
- Amazing colleagues to enjoy company social outings, parties, and events
- Result-oriented workplace; We provide direction, not orders and give you the autonomy to deliver your best work
- We work at the frontier of innovation in the AI industry
- Work on meaningful solutions that solve and improve real-life problems and challenges
- We run like a startup, and embrace the adventure; we focus on getting things done, while still having a down-to-earth and informal culture
This is your chance to build your career in a growing data-driven and AI industry.
APPLY NOW!
Sertis may collect, use, or disclose your personal data or personal data of other persons provided by you in order to carry out your recruitment process. For more information, please refer to our Recruitment Privacy Notice