Meta is looking for a forward-thinking engineering leader with technical skills in systems, tooling/automation, and network operations to join the Edge and Network Services (ENS) Operations organization. In this role, you will lead initiatives to improve efficiency, reliability, and risk management via process, systems, and data in one of the largest scale networks in the world. The person in this role will work within a fast-moving operations organization and enjoy digging into complex operational and reliability challenges in order to implement processes and technical system solutions at a global scale.
Own and shape the operations of Meta’s next-generation edge and CDN infrastructure. This role has direct influence over how we design, deploy, and operate the environments that connect our global infrastructure. Scope and outcomes: Move responsibility/duty statements to the RESPONSIBILITIES section—work that is immediately felt by partner teams. Blend of strategic and hands-on technical leadership: You will drive long-term process and tooling direction while also being close enough to the ground to unblock restoration issues, optimize workflows, and ensure operational performance expectations. Cross Functional teams reach across Product Engineering, Security, Compliance, and Investment teams: This role interfaces with a broad cross-functional group of stakeholders, providing the opportunity to influence standards, champion automation, and extend best practices across the subsea ecosystem. Opportunity to build and scale a team. You’ll develop talent, define operational guardrails, and shape a team centered around ownership, precision, and continuous improvement.
Responsibilities
- Be a key Subject Matter Expert in operations management with a focus on reliability and business process engineering, networking, and AI tooling/automation
- Formulate the right metrics and definitions of success to report and drive quality, efficiency, cost, and timeliness, and evolve these over time to match changes to the infrastructure and business requirements; manage related programs, project budget plans, and budgets
- Develop operations process improvement plans and transform with partner teams the improvements to scalable and automated AI-powered workflows by writing and reviewing the code to improve operational efficiency
- Perform deep dives on complex operations and technical issues across programs & networks (e.g., business process defects, opportunities for AI-automated tooling, operations compliance nonconformance, troubleshooting and resolving time-sensitive network issues
- Support detailed planning with management, and the reporting of all regulated controls throughout the relevant project lifecycles
- Participate in team oncall rotation and improve issue/event escalation and emergency/incident response, including detailed after-action reviews to prevent future recurrences
- Build cross-functional relationships and deliver results with partner teams (internally and externally) associated with all coordination, colocation operations, and compliance issues, including with vendors, contractors, and stakeholders/partners
- Support real-world operations and production challenges that affect network capacity and reliability, and take the lessons to improve current and future generation coordination products and business processes
- Up to 15 percent travel (domestic and international)
Minimum Qualifications
- Lean Six Sigma or related certification (green belt or higher)
- Experience working within a global team and collaborating with cross-functional teams in a fast-paced and dynamic environment with limited supervision
- Experience in implementing, maintaining the monitoring, alerting, and repairing systems for production networks in a DevOps environment
- Strong organization/multitasking skills and ability to adapt to quickly-changing priorities with excellent communication skills (verbal and written)
- 7+ years of experience with network operations and/or operations leadership while supporting large-scale operations in heterogeneous network environments
- Project Management Professional (PMP) or related certification
- Experience with networking protocols and concepts, including BGP/OSPF, TCP/IP, and IPv6
- Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience Familiarity and experience with the creation and management of Standard Operating Procedures (SOPs), Methods of Procedure (MOPs), and policies, alongside comprehensive technical and operations guidance and knowledge management practices
- Some coding experience in Python, or a willingness to learn
- Demonstrated ongoing AI skill development (e.g., prompt/context engineering, agent orchestration) and staying current with emerging AI technologies
- Demonstrated ability to integrate AI tools to optimize/redesign workflows and drive measurable impact (e.g., efficiency gains, quality improvements)
- M.S. in Computer Science/Information Technology, Computer Engineering, Operations leadership, or a related discipline, or an equivalent mix of business process and technical experience
- Familiarity with physical infrastructure design: rack elevations, cable types, connector types, optic types, patch panels, power/cooling, and facility infrastructure
- Experience adhering to and implementing responsible, ethical AI practices (e.g., risk assessment, bias mitigation, quality and accuracy reviews)
- Understanding of AI methods and tools, as well as training workloads and the demands they exert on the network
- Knowledge of data-driven analysis and analytics applied to a full project lifecycle
- Experience in providing technical and operational guidance to internal partners and external vendors/MSPs
- Working knowledge of project and ticket management tools
- Certificates and/or demonstrated experience in technical operations management, compliance, and auditing