Join DevFest Scotland Online for an insightful session on Site Reliability Engineering! Discover essential SRE practices for cloud platforms like Google Cloud, emphasizing resilience, scalability, and security. Key topics include chaos engineering, observability, and automation, with practical insights into tools like Terraform and Kubernetes for boosting reliability.
14 RSVP'd
Join DevFest Scotland Online for an insightful session on Site Reliability Engineering!
In today’s rapidly evolving cloud-driven world, Site Reliability Engineering (SRE) plays a crucial role in ensuring the performance, scalability, and security of mission-critical systems. This talk will explore cutting-edge SRE practices, focusing on how they drive reliability in cloud-native environments. We'll examine leading platforms like AWS, Google Cloud, Microsoft Azure, and Oracle Cloud Infrastructure (OCI), with a focus on infrastructure-as-code, serverless computing, and container orchestration.
Leveraging real-world case studies, this session will dive into essential SRE methodologies such as chaos engineering, capacity planning, and advanced observability. Attendees will gain insights into optimizing database performance, networking, and security using state-of-the-art tools like Terraform, Kubernetes, and Prometheus. The talk will also address industry-specific challenges in sectors like finance, e-commerce, and healthcare, showcasing how to define and manage key metrics like error budgets, service-level indicators (SLIs), and service-level objectives (SLOs).
Finally, we will explore the future of SRE, powered by automation, AI-driven anomaly detection, and next-gen monitoring solutions. Attendees will leave with actionable insights to enhance system reliability and align SRE practices with the specific needs of their organizations and industries. This session offers a comprehensive roadmap for building resilient, scalable systems that meet the demands of modern cloud infrastructure.
This session will be led by Nagarjuna Malladi, Principal Site Reliability Engineer at Oracle America, Inc., who brings over 12 years of experience in system reliability, monitoring optimization, and operational efficiency.
Oracle America
Principal Software Engineer SRE
Charles River Laboratories
Tech Director | AI/ML GDE
Charles River Laboratories
Organizer
JP Morgan Chase & Co
Organizer
JPMorgan Chase & Co.
Organizer
Glasgow Caledonian University
Organiser
Barclays
Software Developer - Organizer
Jordanhill School
GDG Glasgow Youth Team Leader
University of Strathclyde
Team member
University of Glasgow
Team member
Charles River Laboratories
Team member
JP Morgan
Technical BA
PGT Student
Contact Us