Fast Facts
The College Board is seeking a Lead Engineer for its Enterprise Incident & Change Management team to enhance incident and change processes, focusing on automation and operational excellence in a remote role.
Responsibilities: Key responsibilities include designing and implementing incident and change management frameworks, automation solutions, and KPIs, collaborating across teams to align technology strategies with business objectives, and mentoring team members in best practices.
Skills: Required skills include software development experience, proficiency in Infrastructure as Code (IaC) tools, cloud infrastructure (AWS), coding/scripting for automation, and strong problem-solving capabilities.
Qualifications: Preferred qualifications include 7+ years of relevant experience, familiarity with ITIL frameworks, incident management systems, and monitoring tools, as well as strong communication skills and a collaborative mindset.
Location: Remote - Virginia, United States of America
Compensation: $168000 - $183000 / Annually
Lead Engineer, Enterprise Incident & Change Management
College Board–Technology, InfoSec & Infrastructure
Location:Remote
Type:This is a full-time position
About the Team
The Enterprise Incident Management team atCollegeBoardis dedicated tominimizing theimpactof incidentson business operations andswiftlyrestoringnormal service.Theteamis also dedicated to ensuringchanges are effectively tracked across the enterprise andsupporting observability of College Board applicationsand services.Leading theseefforts, our team handles policies,integration, automation and collaboration, ensuring seamless operations, effective communication, and rapid resolution tomaintainoperational excellence.Weestablisheffective protocols,managetools and resources,thoughtfully distribute tasks, and guide teams through the resolution process.But wedon’tstop there – we continuously strive to improve excellence with an eye towardsleveragingopportunities for automation and process improvement,ensuring our deliverablesare more effective and efficient in the future.As excellent communicators and collaborators, we work seamlessly with allfacetsof the College Board, both in technology and business. We celebrate individual contributions while ensuring our success as a cohesive team.
About the Opportunity
As a Lead Engineeron theEnterprise Incident Managementteam,you area seasonedtechnical leaderand problem solver. You understand cloud software delivery, tools, and processes that empower efficient and resilient delivery that adheres to top-notch development practices. You thrive in an environment with a strong mix of creativity and productivity. You are technologically curious andseek outopportunities toapply your knowledge to improving processes and operations, as well as opportunities to research emerging technologies and trends, standards, and products.Your eagerness and vision enable you tolearn, create, and improve complex solutions. Your excellent communication and mentoring skills allow you to effectively articulate solutions to the team while bringing themup to speed and enabling all to makecontributions that improve delivery.
In this role, you will:
Design and Implementation(60%)
- Evaluate incident and change management frameworks using data-driven insightstoidentifyopportunities for improvementthat will provide value tothe EIM team and engineering teams.
- Design and implement automation solutions for incident responseand management, change management, andobservabilityleveraginginputand feedbackfrom domain SMEs andend users.
- Develop andmaintainscripts, tools, and integrations to reduce manual processes and operational overhead.
- Define key performance indicators (KPIs) and metrics to measure the success of automation and improvement efforts and develop and enhance dashboards and reporting mechanisms to measureKPIs as well asincident and change management performance.
- Ensure compliance with governance, risk, and change control policies while promoting agility and innovation.
- Lead cross-functional initiativesand partner withdomainSMEs(deliveryteamsoftwareengineers,security,infrastructure,network,observability,andoperations) to analyze, design, and deliver powerful features,capabilities, andautomation strategiesthat align withengineering best practices.
- Serve as a subject matter expert (SME) for cloud operations, infrastructure automation, and CI/CD pipelines.
Strategy, Operations Support,and Communication (25%)
- CollaboratewiththeEIM team’s director and othertechnology leaders to understand businessobjectivesand team goalsandtoalign solutions and process improvement effortswith those goals.
- Contribute tothelong-term technology strategy by researchingemergingtrends, evaluating new tools(especially AI-driven tools that support observability), and recommending technologiesorautomationsthat improve cost-effectiveness,metrics deliveryto evaluate performance, andsystem and process efficiency.
- Participate in weekly on-call and incident response rotationsresponsible formonitoring alerts toidentifypotential issues, ensuringtimelytriage and escalation of incidents, collaborating with impacted teams, and supporting assessment, response, and communication to bring the incident to resolution.
- Play an active role in agile scrum ceremonies (e.g.,sprintplanning,grooming,dailyscrum meetings) while contributing to high-quality team deliverables.
Team Coordination (15%)
- Provide technical direction and guidance toteam members, ensuring alignment with architectural standards, bestpracticesand organizationalobjectives.
- Review designs, automation scripts, and implementation plans, offering constructive feedback to improve quality, efficiency, and maintainability.
- Foster a culture of continuous learning and collaboration by mentoring engineers in modern automation, cloud infrastructure, and operational excellence.
About You
- 7+years of software development experience with Infrastructure as Code(IaC), CI/CD framework, immutable infrastructure, automation, orchestration, and other modern DevOps patterns.
- StrongproficiencyinIaCtools (e.g., Terraform, CloudFormation, Ansible)and experience with CI/CD pipeline design and automation using platforms such as Jenkins, GitLab CI, or GitHub Actionsis a plus.
- Strong knowledge and experience with distributed cloud infrastructure, including AWS resources such as Lambda, SNS, SQS, S3, Step Functions, EC2, ECS, VPC, IAM, CloudWatch,andDynamoDB.
- Experience building event-driven cloud-based serverless applications, with technical knowledge of cloud computing, DevOps, and microservices.
- Strong coding/scripting experiencefor automation and integration tasksusing tools(e.g., JavaScript, TypeScript, React.js, and Node.js)andproficiencyin scripting languages (Python, Bash, PowerShell, etc.).
- Familiarity withAI tools used for observability(e.g.,AWS resilience hub).
- Familiarity with incident and change management systems (e.g., Jira Service Management).
- Deep understanding of ITIL frameworks, especiallyincident,change, andproblemmanagement.
- Experience integrating monitoring and alerting tools (e.g., Datadog, Prometheus, CloudWatch, Grafana).
- Strong troubleshooting, analytical, and problem-solving skills.
- Proven ability to lead technical initiatives,influence cross-functional teams, andprioritize and execute tasks in a high-pressure environment.
- Excellent communication skills, with the ability to translate technical details into business outcomes.
- Ability totake a weekly,on-callshiftevery month and a half
- Authorization to work in the U.S.
All roles at College Board require:
- A passion for expanding educational and career opportunitiesand mission-driven work
- Authorization to work in the United States for any employer
- Curiosity and enthusiasm for emerging technologies, with a willingness to experiment with and adopt new AI-driven solutions anda comfortlearning and applying new digital tools independently and proactively.
- Clear and concise communication skills, written and verbal
- A learner's mindset and a commitment to growth:welcoming diverse perspectives, giving andreceivingtimely, respectful feedback, and continuously improving through iterative learning and user input.
- A drive for impact and excellence:solving complex problems, making data-informed decisions, prioritizing what matters most, and continuously improving through learning, user input, and external benchmarking.
- A collaborative and empathetic approach:working across differences, fostering trust, and contributing to a culture of shared success.
About Our Process
- Application review will beginimmediatelyand will continue until the position is filled.This role is expected to accept applications for a minimum of 5 business days.
- Whilethehiring processmay vary, it generallyincludes:resume and application submission, recruiter phone/video screen, hiring manager interview, performance exercise such as live coding, a panel interview, a conversation with leadership and reference checks.
What We Offer
AtCollegeBoard, we offer more than just a paycheck—we provide a meaningful career, a supportive team, and a comprehensive package designed to help you thrive.We’rea self-sustaining nonprofit that believes in fair and competitive compensation, grounded in your qualifications, experience, impact, and the market.
A Thoughtful Approach to Compensation
- The hiring range for this role is$168,000–$183,000.
- Your exact salary will depend on your location, experience, and how your background compares to others in similar roles at the College Board.
- We aim to make our best offerupfront—rootedin fairness, transparency, and market data.
- We adjust salaries by location to ensure fairness, no matter where you live.
You’llhave open, transparent conversations about compensation, benefits, and whatit’slike to work atCollegeBoard throughout your hiring process. Check out ourcareerspage for more.
#LI-AP1
#LI-REMOTE