The Cloud Engineer â Networking focuses on the design, operation, and troubleshooting of network services that underpin Rhapsodyâs AWSâhosted platforms (RaaS, CaaS, Envoy, Identity/NGS). You will build and support secure, resilient connectivity VPC/VPCe, Transit Gateway, Direct Connect, siteâtoâsite VPNs (including Sophos XG or similar), routing, DNS, and load balancing while partnering with CloudOps/SRE, Security, Product Support, and customer teams across US/UK/APAC time zones. Success in this role requires strong networking fundamentals, handsâon AWS networking, crisp incident handling, and a serviceâoriented mindset.
Key Responsibilities
- Design, configure, and operate AWS networking: VPC/VPCe, Subnets, Route Tables, NACLs, Security Groups, Transit Gateway, PrivateLink, NAT, IGW, Route 53, and hybrid connectivity patterns.
- Build and maintain siteâtoâsite VPNs (IPsec) and Direct Connect (with BGP), including failover and HA designs; administer Sophos XG (or equivalent) virtual firewalls.
- Manage Layerâ4/7 traffic using ALB/NLB, AWS WAF, TLS termination, and client/server certificate workflows (PKI).
- Lead deepâdive troubleshooting for network connectivity (AWS â customer DC/cloud), packet flow, NAT, routing asymmetry, MTU/fragmentation, TCP/TLS, DNS, and identityâadjacent issues.
- Instrument and monitor network health (CloudWatch, VPC Flow Logs, Datadog, firewall logs); respond to alerts, drive rapid mitigation, and provide clear RCA inputs.
- Execute network changes and environment builds using Terraform and AWS CLI following change controls and maintenance windows.
- Develop scripts (Bash/Python/PowerShell) for validation checks, log parsing, and configuration hygiene; reduce toil via automation and golden patterns.
- Enforce leastâprivilege network access, segmentation standards, and encryption in transit; collaborate with Security on detections and guardrails.
- Maintain auditable documentation (diagrams, SOPs/runbooks, firewall rulesets, cert inventories) and support patching/compliance activities.
- Work directly with customer IT/network teams to set up connectivity (VPN/DCX), perform cutovers, and resolve issues; explain decisions and tradeâoffs clearly.
- Partner with SRE/Engineering to improve observability, resiliency, and performance; assist Support with networkâcentric cases.
- Participate in the global onâcall rotation for P1/P2 incidents; own clean shift handoffs and accurate ticket hygiene.
- Contribute to postâincident reviews, knowledge base articles, and continuous improvement initiatives.