Job Purpose:
The
Senior Manager, Cloud Operations, in a support capacity, is tasked with
overseeing the day-to-day management and running of cloud infrastructure and
services. Their primary purpose is to ensure the reliability, availability, and
performance of cloud-based systems to meet business needs.
Key
Deliverables:
- Lead the Cloud operations team in monitoring, maintaining, and
troubleshooting cloud environments to ensure smooth and uninterrupted
service delivery.
- Establish processes for incident response and problem
resolution, coordinating with internal teams and cloud service providers
to minimize downtime and service disruptions.
- Implement monitoring tools and processes to continuously
monitor the performance and health of cloud infrastructure, identifying
and addressing performance bottlenecks and optimization opportunities.
- Collaborate with stakeholders to forecast resource
requirements and plan capacity expansion or optimization strategies to
accommodate growth and changing business demands.
- Manage changes to cloud environments, including configuration
changes, updates, and patches, ensuring changes are implemented safely and
with minimal impact on operations.
- Liaise with cloud service providers to escalate support
issues, coordinate maintenance activities, and stay informed about product
updates and new features.
- Enforce security policies and compliance standards within
cloud environments, conducting regular audits and implementing remediation
measures as necessary to mitigate risks.
- Maintain documentation of cloud configurations, procedures,
and best practices, and provide training to operations team members to
ensure they are equipped to effectively manage cloud infrastructure.
- Track and report on key performance indicators related to
incident response, including mean time to resolution (MTTR) and incident
recurrence rates.
- Generate regular performance monitoring reports, highlighting
trends, anomalies, and areas for optimization.
- Provide recommendations for capacity planning based on usage
trends, anticipated growth, and resource utilization metrics.
- Maintain a record of all changes made to cloud environments,
documenting the change process and outcomes for audit and compliance
purposes.
- Produce security compliance reports, detailing adherence to
regulatory requirements and internal security policies.
- Develop and maintain a knowledge base of standard operating
procedures (SOPs) and troubleshooting guides for common cloud-related
issues.
- Design and deliver training programs for operations team
members, covering cloud technologies, best practices, and procedural
guidelines.
- Identify opportunities for process improvement and automation
to enhance the efficiency and effectiveness of cloud operations.
Qualifications
Professional
Experience & Certifications:
- 5 Yeasrs + of extensive experience in managing and leading
cloud operations teams.
- In-depth knowledge of cloud infrastructure, particularly
Azure, AWS, or Google Cloud.
- Proficiency in cloud monitoring tools and incident management
systems.
- Strong understanding of disaster recovery planning and
execution.
- Expertise in performance tuning, system stability, and
capacity planning.
- Excellent troubleshooting and problem-solving abilities.
- Strong leadership, team management, and communication skills.
- Experience with IT service management (ITSM) and best
practices.
- Bachelor’s degree in IT, Computer Science, Engineering, or a
related field.
- Professional Qualifications in Cloud Administrator/Architect
certification (e.g., Azure, AWS, or Google Cloud)
How
To Apply