LogoLanguage
REIZEND PRIVATE LIMITED

SBC Module 15, -2 Floor, Thejaswini Building, Technopark phase 1, Trivandrum , 695581

DevOps Technical Support (Junior Role)

Closing Date:03,Apr 2025
Job Published: 17,Feb 2025

Brief Description

We are looking for a proactive &  highly skilled Technical Support Engineer to provide exceptional technical support to overseas projects, working in rotational shifts to ensure 24/7 availability. This role involves troubleshooting complex issues across cloud platforms, networking, application architectures, and DevOps toolchains. The ideal candidate should be self motivated, a collaborator, agile and a continuous learner.

Key Responsibilities

  1. Provide technical support and troubleshoot issues related to cloud platforms and services such as FargateECSDynamoDBBigQuerySNS etc.
  2. Understand the problems by consuming logs and metrics from various sources using the services such as CloudWatchPrometheusGrafanaLokiAlert Managers and Splunk etc.
  3. Analyze and resolve networking challenges, including load balancers, API gateways, reverse proxies, ingress controllers, and service-to-service communications.
  4. Work on issues related to client-server communications, firewalls, and virtual machines.
  5. Collaborate with DevOps teams to manage and troubleshoot toolchains like Docker, Kubernetes, Jenkins, Ingress Controllers etc.
  6. Act as the first point of contact for technical queries and escalate issues when necessary.
  7. Liaise with development and operations teams to identify root causes and resolve incidents effectively.
  8. Document troubleshooting steps, solutions, and maintain a knowledge base for recurring issues.
  9. Collaborate with cross-functional teams to implement best practices for monitoring and incident response.
  10. Participate in shift handovers and provide timely updates on ongoing issues.

Preferred Skills

Technical Skills

Cloud Platforms and Services

  1. Hands on knowledge working with Fargate and ECS for managing and troubleshooting containerized workloads.
  2. Proficiency with DynamoDB and BigQuery for analyzing data and take decisions based on the analysis.
  3. Hands-on knowledge of SNS for debugging message delivery issues and integration workflows.

Monitoring and Logging Tools

  1. Proficiency in CloudWatch LogsLoki, and Splunk for consuming and analyzing logs to identify and resolve issues.
  2. Hands-on knowledge with Prometheus and Grafana for analysing metrics using dashboards and monitoring system health.
  3. Knowledge of Alert Manager for configuring and managing alert escalation.
  4. Ability to interpret metrics from various sources and create actionable insights.

Networking and Security

  1. Understanding of load balancers (e.g., ALB, NLB) for distributing traffic and troubleshooting connectivity issues.
  2. Knowledge in API Gateways like AWS API Gateway or NGINX for managing API traffic.
  3. Knowledge of reverse proxies and ingress controllers (e.g., NGINX IngressTraefik) for managing internal/external traffic.
  4. Understanding service-to-service communications, including DNS, HTTP/HTTPS, and gRPC protocols.
  5. Hands-on knowledge with firewalls, security groups, and IAM roles for secure communications.
  6. Troubleshooting skills for VM-related issues in platforms like AWS EC2 or equivalent.

DevOps Toolchains

  1. Proficiency with Docker for managing container images and runtime debugging.
  2. Understanding of Kubernetes concepts of managing deployments, ingress setups, and pod-related issues and related troubleshooting commands and mechanisms.
  3. Knowledge of CI/CD pipeline building tools such as Jenkins, GitHub Actions, ArgoCD for building, deploying, and managing automated pipelines.
  4. Understanding of Ingress controllers (e.g., NGINX, Traefik) and SSL termination for secure routing.

Troubleshooting and Incident Management

  1. Strong problem-solving skills to identify root causes using logs, metrics, and system-level debugging.
  2. Ability to document detailed troubleshooting steps and solutions for recurring issues.

Collaboration and Communication

  1. Ability working with cross-functional teams (DevOps, development, and operations) to resolve incidents.
  2. Skills in effective and proactive communication to escalate issues and provide updates during shift handovers.
  3. Proficiency with tools like Slack, JIRA, Confluence, or Google Workspace for collaboration and issue tracking.

Salary Package- 4-6 LPA

Experience Required

Technical Support Engineer with 0.5 years of experience