Design, build, and maintain multi‑cloud landing zones, environments, and guardrails in AWS, Azure, and Alibaba Cloud using Infrastructure‑as‑Code (IaC).
Standardize reusable Terraform modules and policy‑as‑code to enforce security, compliance, and configuration baselines.
Implement GitOps and CI/CD pipelines for infrastructure and application workloads. Use scripting to automate common infrastructure management and operations tasks
Develop proficiency in the scripting tools and software that are essential in the design, build, management, and operation of infrastructure solutions and services
Drive cloud infrastructure platform operation with cloud native and serverless technology including Kubernetes, Docker, AWS Lambda, Azure Function, Ali Function Compute, API gateway, etc.
Design and prototype AI-driven automation workflows.
Integrate intelligent alerting and anomaly detection. Correlate events across telemetry sources to reduce alert noise and prioritize incidents
Build auto‑remediation workflows (functions, runbooks, chatbots) and ChatOps integrations (Teams) for operations tasks.
Partner with data/ML teams to operationalize simple models for forecasting, capacity planning, or anomaly detection where appropriate.
Design, implement, administer, and support infrastructure technologies and solutions. These technologies and solutions can include computing, storage, networking, physical infrastructure, software, commercial-of-the-shelf (COTS), and open source packages and solutions. They can also include virtual and cloud computing such a Infrastructure as a Service (IaaS), Platform as a Service (PaaS), and Software as a Service (SaaS)
Identify information security risks and the controls that can be used to mitigate threats within solutions and services
Competently apply a modern standards approach and guide others to do so
Own an issue until a new owner has been found or the problem has been mitigated or resolved
Take inputs from stakeholders and establish solutions that facilitate the achievement of business objectives
Design systems characterized by medium levels of risk, impact, and business or technical complexity
Select appropriate design standards, methods, and tools, and ensure they are applied effectively
Review the systems designs of others to ensure the selection of appropriate technology, efficient use of resources, and integration of multiple systems and technology
Define the integration build
Co-ordinate build activities across systems
Understand how to undertake and support integration testing activities
Develop a thorough understanding of the technical concepts required and effectively communicate how these concepts apply to the wider technical landscape of IT
Review requirements and specifications and define test conditions
Identify issues and risk associated with work
Analyze and report test activities and results
Break a problem down into its component parts to identify and diagnose root causes
Troubleshoot and identify problems across different technology capabilities