Demo

Site reliability engineer - Production Support Lead

Spar Information Systems
Frisco, TX Full Time
POSTED ON 1/15/2025
AVAILABLE BEFORE 3/11/2025

Job Details

Hello Everyone,

Hope you are doing good!!!!

My name is Pavan and I work with SPAR Information System., I have a great opportunity for you, please find the job details below, if you are interested in applying please send me your updated resume and best time for you to discuss about this opportunity in details.

Senior Production Support Lead
Location - Atlanta, GA / Frisco, Texas

Duration: Long term contract


Onsite Requirement - Prefer onsite only
Number of days onsite - 3 days

Must Have Skills:
Skill 1 Yrs of Exp 5 support, production support, or system administration in a complex environment, with at least 2 years in a leadership or supervisory role.
Skill 2 Yrs of Exp 6 ITIL processes, particularly incident management, change management, and problem management.
Skill 3 Yrs of Exp 6 monitoring tools (e.g., Splunk, AppDynamics, New Relic, or similar) and experience in log analysis for troubleshooting.
Skill 4 6 Yrs of Exp - 6 Scripting skills (e.g., Python, Shell scripting) for automating routine tasks and improving operational efficiency
Skill 5 6 Yrs of Exp- database systems (SQL, Oracle, etc.) and experience with database troubleshooting in a production environment.
Skill 6 6 Yrs of Exp - Familiarity with cloud platforms (AWS, Azure, Google Cloud Platform) and containerization technologies (Docker, Kubernetes) is a plus.

Job Summary: We are seeking a Senior Production Support Lead to oversee the daily operations of our production environments and ensure smooth and efficient functionality of critical applications and systems. The ideal candidate will have strong technical troubleshooting skills, a solid understanding of production environments, and a proven ability to lead a team in resolving complex production issues quickly and effectively. As a Senior Production Support Lead, you will be responsible for managing escalations, overseeing incident management, driving performance improvements, and working closely with other teams such as development, infrastructure, and business operations.

Key Responsibilities:

Lead Production Support Operations: Manage and lead the production support team to ensure the stability and availability of critical production systems. Oversee monitoring, incident management, and resolution of production issues.

Incident and Problem Management: Coordinate and lead efforts to resolve production incidents promptly, ensuring minimal business impact. Manage the root cause analysis (RCA) process for recurring issues and work towards preventive solutions.

System Monitoring and Optimization: Continuously monitor system performance, identify potential bottlenecks or issues, and take proactive measures to improve system performance and reliability.

Escalation Handling: Serve as the point of escalation for complex production issues, providing guidance and expertise in troubleshooting and resolution.

Collaboration with Development and Infrastructure Teams: Work closely with the development, QA, and infrastructure teams to ensure smooth production deployments, patch management, and post-deployment monitoring.

SLA Adherence: Ensure that SLAs are met for all production issues, including response and resolution times. Track and report on SLA performance metrics regularly.

Team Leadership: Provide mentorship and guidance to junior team members, conduct regular team meetings, and facilitate knowledge-sharing sessions to build a high-performing support team.

Documentation & Knowledge Management: Maintain up-to-date knowledge base articles, troubleshooting guides, and standard operating procedures (SOPs). Ensure proper documentation of all incidents, changes, and resolutions.

Change Management: Assist in managing changes in production environments by ensuring thorough testing and validation of changes, and providing post-implementation support.

Continuous Improvement: Drive continuous improvement initiatives within the production support process. Identify opportunities to automate repetitive tasks, enhance system reliability, and optimize operational workflows.

Qualifications & Skills:

Bachelor's Degree in Computer Science, Information Technology, Engineering, or a related field (or equivalent experience).

5 years of experience in IT support, production support, or system administration in a complex environment, with at least 2 years in a leadership or supervisory role.

Strong technical troubleshooting skills in areas such as application monitoring, databases, network, and server infrastructure.

Experience with ITIL processes, particularly incident management, change management, and problem management

. Proficiency with monitoring tools (e.g., Splunk, AppDynamics, New Relic, or similar) and experience in log analysis for troubleshooting.

Scripting skills (e.g., Python, Shell scripting) for automating routine tasks and improving operational efficiency.

Strong understanding of database systems (SQL, Oracle, etc.) and experience with database troubleshooting in a production environment.

Familiarity with cloud platforms (AWS, Azure, Google Cloud Platform) and containerization technologies (Docker, Kubernetes) is a plus.

Strong communication skills, with the ability to explain technical issues to non-technical stakeholders and produce clear incident reports.

Leadership and Team Management: Proven ability to mana

Pavan Raikhelkar

LEAD TALENT ACQUISITION SPECIALIST

Direct Number:-

Phone: x 323

Fax :

Email:

Website:

(An E-verify Company)

NOTE: We respect your online privacy. This is not an unsolicited mail. Under bill 1618 title III passed by the 105th us congress this mail cannot be considered Spam as long as we include contact information and a method to be removed from our mailing list. If you are not interested in receiving our e-mails, please reply with a "REMOVE" in the subject line. We apologize for any inconvenience caused by this mail.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Site reliability engineer - Production Support Lead?

Sign up to receive alerts about other jobs on the Site reliability engineer - Production Support Lead career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$232,065 - $358,206
Income Estimation: 
$351,772 - $489,656
Income Estimation: 
$249,408 - $397,210
Income Estimation: 
$83,086 - $106,052
Income Estimation: 
$83,298 - $131,726
Income Estimation: 
$101,020 - $131,637
Income Estimation: 
$101,020 - $131,637
Income Estimation: 
$95,435 - $126,957
Income Estimation: 
$130,171 - $173,458
Income Estimation: 
$129,363 - $167,316
Income Estimation: 
$145,845 - $177,256
Income Estimation: 
$147,836 - $182,130
Income Estimation: 
$154,597 - $194,610
Income Estimation: 
$86,891 - $130,303
Income Estimation: 
$105,207 - $132,120
Income Estimation: 
$127,470 - $161,562
Income Estimation: 
$94,567 - $126,847
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Spar Information Systems

Spar Information Systems
Hired Organization Address Raleigh, NC Full Time
Job Details Experience with Order to Cash Modules with Supply Chain Experience ( CTO , PTO , Back-to-Back , Drop Ship) E...
Spar Information Systems
Hired Organization Address Houston, TX Full Time
Hello All, Hope you all are doing well Please let me know if you are looking for the job change and interested in the be...
Spar Information Systems
Hired Organization Address Boston, MA Contractor
Must Have Skills Snaplogic Data Integrations Api Building We are seeking a highly skilled and experienced SnapLogic Cont...
Spar Information Systems
Hired Organization Address Norfolk, VA Temporary
Hello; Title : Healthcare Business Data Analyst with Snowflake Long Term Location is Norfolk, VA Mandatory Areas Must ha...

Not the job you're looking for? Here are some other Site reliability engineer - Production Support Lead jobs in the Frisco, TX area that may be a better fit.

Lead Site Reliability Engineer

JPMorganChase, Plano, TX

AI Assistant is available now!

Feel free to start your new journey!