Sr. Site Reliability Engineer
Company: Delta Air Lines, Inc.
Posted on: June 12, 2021
Delta Air Lines is looking for a talented IT System Engineer to
work on its technology environment, with your primary functions
being to support new and existing software systems, applications
and associated databases for service providers and consumers usage
; and provide application monitoring and detailed performance and
reliability analysis and reporting for Revenue framework - flight
and ancillary products services/applications & retail platform -
experience layer that interface with consumers/clients.
As a key player in this role you will create, fix, extend and
scale the code to keep it working and to harden it against all the
loop holes. We drive reliability and performance across a massive
scale. You will work on software development projects to keep
important revenue-critical systems up and running, from code-level
troubleshooting of traffic anomalies to maintenance of our most
cutting-edge services; from monitoring and alerts to building new
Overview of Responsibilities:
Provide application monitoring and detailed performance and
reliability analysis and reporting for all the services that
Revenue framework and Retail platform support using tools, not
limited to, like: Dynatrace, Sumologic, Promethius, Grafana,
Enhance and maintain synthetic monitoring to capture performance
for new functionality and new requirements for performance and
Support operationally critical environment, using monitoring
tools and scripts, data feeds and associated scripts, research and
analysis of production issues, capturing logging.
Participate in application load tests and assisting with
Update support documentation and wiki pages, operating plans,
infrastructure diagrams, and assist in PCI audits as required.
Influences and monitors technical application performance for
large-scale, technical initiatives and/or projects requiring
integration of cross-functional systems.
Provides technical guidance in evaluating applications systems
or evaluating customer reports through performance and reliability
Participate in deployment meetings and consult with team to
refine, test, and debug programs to meet technical needs.
Be flexible to support the release planning, go-no-go meetings,
stakeholders sign offs and Launch activities creating
implementation plan and executing working with POs, SMs, AD s and
Understands servers and databases and related architecture
requirements and ensures those requirements can be achieved and
maintained through high-quality deliverables.
Developing proof of concepts and proposing solutions to
architecture and tech leads.
Enhance and maintain role definitions in partnership with
business team owners as needs change
Support operationally the addition and or removal of software
from the defined role(s)
Coordinate renewal of vendor licenses
Influences and evaluates software for inclusion into the role(s)
Provides technical guidance in evaluating hardware systems or
evaluating timing of updates
- Patching coordination of multiple onshore and offshore IT teams
for enterprise updates, Microsoft and security patches
What you need to succeed (minimum qualifications)
Consistently makes safety and security, of self and others, the
Embraces diverse people, thinking and styles.
At least 3-5 years of experience with in IT Desktop Role
Management; Thorough understanding of desk top based technology,
utilizing network systems related to enterprise view.
Good working knowledge Shell scripting, DevSecOp concepts and
Working knowledge of different authentication methods
Ability to think of end to end solutions rather than within a
Experience working with AWS cloud services and Legacy setup on
Prem WAS, JBOSS etc.
Necessary interpersonal skills include: Ability to work in an
environment with minimal supervision, but forge relationships with
key stake holders, the ability to clearly articulate approach and
mindset behind having alternate approaches
- The necessary analytical and process skills include: Having a
clear understanding of Agile processes and have worked in the Agile
environment, experience in User story management tools like
VersionOne and must have a good understanding of openshift PaaS
Major Skills and Competencies:
Communication Skills - The ability to communicate verbally and
in writing with all levels of employees and management, capable of
successful formal and informal communication, speaks and writes
clearly and understandably at the right level
Integrity and Trust - Involves being widely trusted, being seen
as a direct, truthful individual, can present the unvarnished truth
in an appropriate and helpful manner, keeps confidences, admits
mistakes, and doesn't misrepresent him/herself for personal
Teamwork - Involves working well in a collaborative setting,
supporting work team by volunteering for and completing
assignments, acting as a positive team member by contributing to
discussions, developing and maintaining both formal and informal
relationships enterprise-wide, defines success in terms of the
entire team through mentoring and knowledge transfer
- Technical Expertise - Involves demonstrating a commitment to
increasing knowledge and skills in current technical/functional
area, keeping up to date on technical developments, staying
informed as to industry practices, knowing how to apply relevant
technical processes to appropriate business needs
Dedication - Involves demonstrating a desire to dedicate time
and energy to accomplish goals, tasks, assignments, etc. Will do
what it takes to get things done.
Flexibility - Is open to change, enjoys the challenge of
unfamiliar tasks, anticipates and adjusts to problems and
roadblocks, is not thrown off when things change, can flex to
future consequences and trends appropriately.
Problem Solving - Uses rigorous logic and methods to solve
difficult problems with effective solutions, probes all fruitful
sources for answers, can see hidden problems, is excellent at
honest analysis, looks beyond the obvious and doesn't stop at the
Self-Development - Is actively committed to continuously improve
him/herself, understands that different situations and levels may
call for different skills and approaches, knows personal strengths,
weaknesses, opportunities and limits, works on compensating for
weaknesses and limitations, seeks feedback, gains insights from
mistakes, is open to criticism without being defensive.
Task Management - Delivers quality work on time, translates
planning into action by following applicable established procedures
or methodologies, proactively monitors and controls task status by
collecting and analyzing task data to anticipate and address
barriers, appropriately communicates and resolves or escalates any
problems that arise.
Ability to clearly articulate approach and mindset behind having
Experience in supporting applications teams and troubleshooting
in DEV, SI and production environments
Clear understanding of Agile processes and have worked in the
Agile environment, experience in User story management tools like
The successful candidate must be a self-motivator and be able to
perform work with little guidance or instructions. Multiple tasks
may be assigned which will require managing/prioritizing
Proven problem-solving skills required
Candidate must have attention to detail, and be methodical in
carrying out responsibilities
Collaborate on scalability issues involving access to data and
Utilize exposure to large-scale production software
Help maintain mission critical services
- Must have received or be willing to receive the COVID-19
vaccine by date of hire to be considered for U.S.-based job
What will give you a competitive edge (preferred
- Bachelors Degree in Computer Science, Information Systems or
related field is preferred.
Keywords: Delta Air Lines, Inc., Atlanta , Sr. Site Reliability Engineer, Other , Atlanta, Georgia
Didn't find what you're looking for? Search again!