-
Manage, configure, maintain Linux servers (e.g., RHEL, CentOS, Ubuntu)
-
Monitor Performance and Plan Capacity
-
Implement backup and disaster recovery processes and Fix Network Issues
-
Create Automation Scripts and Maintain Automation Playbooks in Ansible, Chef for seamless environment setup
-
Create shell scripts to simplify tasks like system monitoring and software installation
-
Keep CI/CD pipelines running smoothly for quick and reliable builds, testing, and deployments.
-
Create and maintain pipelines using tools like Jenkins, GitLab CI/CD, and Redgate
-
Add automated tests and quality checks in the pipeline to improve code quality.
-
Set up notifications and error logging to identify build failures and deployment issues
-
Install and configure monitoring, alerting, and logging tools like Prometheus, Grafana, and the ELK Stack to ensure services run smoothly and efficiently.
-
Develop dashboards and alerts in tools like Zabbix, Nagios, and Grafana to visualize infrastructure health and spot performance issues.
-
Address system and application alerts promptly to diagnose and fix problems quickly
-
Apply security measures like firewall rules, user access controls, and patch management and vulnerability scans.
-
Set up secure authentication and authorization across the infrastructure and Conduct regular audits on system configurations to ensure they meet security standards.
-
Build Docker images and manage containerized applications for secure and consistent deployments.
-
Work with orchestration tools like Kubernetes or Docker Swarm to manage containers and Troubleshoot container-related issues and optimize performance
-
Document infrastructure configurations, deployment processes, and troubleshooting steps.
-
Regularly update the knowledge base to keep the team informed of environmental changes and create clear documentation for scripts, tools, and workflows.
-
Quickly address and resolve critical incidents in production to reduce downtime and Use monitoring tools to identify potential system failures before they affect users.