23 Oct
Kwal
Junagadh
Job Title: Senior Distributed Systems Engineer – Kubernetes, Azure, API Gateways, and Distributed Queuing
Location: Remote
Timezone: IST
About Us:
Kwal AI is a pioneering company in the voice AI space, focusing on scalable, cloud-native solutions tailored for the HRTech industry. Our technology stack leverages Kubernetes, Azure cloud services, Supabase, and Firebase to build distributed, high-performance systems. We integrate cutting-edge voice AI with recruitment platforms, optimizing interactions with seamless API gateways and robust queuing architectures. Our solutions are designed to handle massive data throughput while maintaining reliability and scalability across global infrastructures.
Job Description:
We are seeking an experienced Distributed Systems Engineer with expertise in designing and scaling cloud-native, distributed systems, primarily on Azure. You’ll work with Kubernetes, API gateways, distributed queuing architectures,databases and backend like PostgreSQL and Supabase to build scalable, reliable systems that can handle massive traffic and data throughput. You will collaborate with cross-functional teams to create systems that are performant, secure, and scalable across multiple geographies and cloud regions.
Responsibilities:
•Architect, design, and implement highly scalable, distributed systems leveraging Azure services , Kubernetes , and API gateways (e.g., Azure API Management, Zuplo).
•Design, implement, and scale distributed queuing systems (e.g., Azure Service Bus ) for large-scale message processing.
•Build and optimize PostgreSQL clusters and manage distributed databases using services like Postgres for Supabase .
•Integrate cloud-native databases like Supabase for data storage and synchronization at scale, ensuring high availability and performance.
•Utilize Azure Kubernetes Service (AKS) for container orchestration, scaling services automatically based on load, and ensuring high availability across zones and regions.
•Work with Azure Functions , Azure Logic Apps , Azure Service Bus and Event Grids to create highly scalable serverless architectures.
•Implement API gateways for routing, throttling, and monitoring API traffic, ensuring secure and scalable communication between microservices and external clients.
•Leverage Azure Monitor , Application Insights , and Log Analytics for monitoring, tracing, and ensuring the reliability of distributed services.
•Implement load balancing, autoscaling,
and failover strategies across multiple Azure regions to ensure resilience and disaster recovery capabilities.
•Utilize Azure Redis Cache for fast, globally distributed data access.
•Work with CI/CD pipelines using Github Actions , ensuring automated testing, deployment, and scaling across multiple environments.
•Continuously optimize system performance by conducting load testing , capacity planning, and tuning PostgreSQL , Kubernetes , and Azure resources .
•Ensure data security and compliance by implementing Azure Key Vault , Azure AD , and secure networking practices such as VNETs and Private Endpoints .
Required Skills and Qualifications:
•Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field.
•5+ years of experience in distributed systems, cloud architecture,
and large-scale system design.
• Proficiency: 5+ years of professional experience in software development, with a focus on Python and Node.js
•Strong expertise in Azure Cloud Services , including AKS , Service Bus , Event Queues , PostgreSQL , and Azure Functions .
•Extensive experience with Kubernetes , particularly in scaling microservices , managing multi-cluster environments , and ensuring high availability through pod autoscaling , load balancing , and service mesh technologies like Istio .
•Deep experience with distributed queuing systems such as Azure Service Bus , Kafka, or RabbitMQ .
•Strong knowledge of PostgreSQL , Supabase , or Firebase , with experience in optimizing distributed databases for low-latency, high-throughput workloads.
•Experience building and managing API gateways (e.g., Azure API Management , Zuplo ) for secure, scalable API interactions.
•Proven experience with distributed data storage and caching solutions like Azure Redis , Cosmos DB , or Supabase .
•Strong familiarity with Azure networking components (e.g., VNETs , VPN Gateway , Private Link ) for building secure, scalable cloud architectures.
•Experience with Infrastructure-as-Code (IaC) using Terraform or ARM templates to automate infrastructure provisioning and scaling on Azure .
•Strong understanding of serverless architectures using Azure Functions and Logic Apps .
•Experience with CI/CD pipelines and automated deployments using Azure DevOps or similar platforms.
•Familiarity with monitoring and observability tools such as Azure Monitor , Prometheus , Grafana , and Jaeger for distributed systems.
•Solid understanding of security practices, including OAuth , JWT , Bearer tokens , and integrating with Azure AD for identity and access management.
Nice-to-Have Skills:
•Experience with multi-region scaling and geo-replication on Azure , ensuring fault-tolerant, globally distributed services.
•Knowledge of service mesh architectures (e.g., Istio , Linkerd ) for managing complex microservice interactions in Kubernetes.
•Familiarity with real-time streaming technologies (e.g., Azure Event Hub , Apache Kafka ) for processing large-scale event data.
•Experience with Supabase or Firebase in building scalable, modern back-end solutions with integrated authentication, real-time data, and cloud functions.
•Exposure to chaos engineering and zero-downtime deployment strategies using Kubernetes and Azure services .