Striking a balance between pager volume and system reliability is an ongoing challenge for teams in the software industry. By focusing on KPI-based alerts and implementing better monitoring and alerting practices, teams can work towards reducing pager fatigue while ensuring system reliability.
Author: Mridul Verma
Atomically Updating Multi-Node Cache
In this blog post, we will go over a simple intuitive approach on how to atomically update a hashmap which has been distributed across multiple nodes.
Proxy Server – Proxying at Different TCP / IP Model Levels
Understanding Basics of the Network Proxy - Part II
Proxy Server – TCP / IP Model Internals
Understanding Basics of the Network Proxy - Part I
Introduction to Proxyview
Proxyview helps you to proxy HTTP Requests to all your internally deployed services on k8s pods or ec2 machines via a single endpoint.
Debugging K8s Connection Refused
Getting Connection Refused Exceptions while sending connection requests to an k8s pod's application binding to 127.0.0.1
API Best Practices
Learn about the design practices we followed and fixes we made to improve the page load performance of our product
Evaluating Product Architecture
Different dimensions for evaluating Product Architecture.
Things to do: K8s Master going down
When was the last time when one of yours K8s master went down and what did you do about it and also do we know why did the k8s master went down. In this blog post, we will go through one of the same scenarios which happened within our k8s cluster and understand the details … Continue reading Things to do: K8s Master going down
Whitelisting IPs with NGINX Ingress K8s
Understand how to whitelist IPs with NGINX via Ingress Objects and securing your kubernetes infrastructure