Demo

Site Reliability Engineer

Edible Arrangements, LLC
Atlanta, GA Full Time
POSTED ON 1/22/2025
AVAILABLE BEFORE 3/22/2025

Senior Site Reliability Engineer (SRE) 

 

Who are we and what do we do?   

  

Fruit was just the beginning. Since our founding in 1999, we’ve evolved over 25 years into an industry leader and modern gifting destination for celebrating the moments that matter. In addition to a robust online e-commerce hub, our vast retail footprint includes nearly 1,000 locally owned and operated franchise locations globally.   

 

With offerings that go beyond our iconic fresh fruit bouquets to include baked treats, fresh flowers, dessert boards, platters, and more, our vast collection of delicious treats and innovative gifts are perfect for treating yourself and others.   

No matter the occasion or moment, there’s an edible® for that.  

  

Through all our incredible years, we’ve remained committed to our 5Ps:     

• Our promise– Experiences that WOW.  

• Our productsRemarkably fresh.  

• Our places– Interactive and creative.  

• Our people– Create special memories.     

• Our purpose–To celebrate what’s good in life.    

 

Purpose:  

As a Senior Site Reliability Engineer (SRE), you will be responsible for ensuring the resilience and reliability of our e-commerce applications through monitoring, automation, and proactive site maintenance. You will leverage Datadog, Azure Application Insights, and other industry-standard tools to develop robust monitoring systems that enhance site awareness, detect and respond to incidents, and maintain high availability. You will also drive collaboration across engineering teams to build a proactive approach to system health, site reliability, and incident management. 

Location Requirements: Onsite at our Sandy Springs, GA Corporate Office 4 days a week, working from home Fridays.

Responsibilities: 

  • Develop, implement, and manage monitoring and alerting systems using Datadog, Azure Application Insights, and other related technologies to gain real-time awareness of system health and potential issues.
  • Ensure integration of Datadog with .NET, Node.js and React-based applications for comprehensive monitoring of application performance and health.
  • Establish proactive monitoring practices to reduce site outages, gain insight into system performance, and identify blockers within Azure DevOps pipelines.
  • Design and implement Standard Operating Procedures (SOPs) to effectively respond to and resolve incidents, minimizing downtime and ensuring prompt recovery.
  • Collaborate with engineering and product teams to establish and execute comprehensive incident response plans, focusing on improving the availability, performance, and reliability of e-commerce platforms.
  • Optimize Azure DevOps pipelines to ensure blockers, errors, and any build issues are proactively addressed, enhancing site deployment efficiency and reliability.
  • Maintain and improve application performance and resilience through enhancements in Azure Application Services, Azure Front Door, and Azure Application Gateway.
  • Execute SQL queries to assess and troubleshoot database performance and availability issues related to the operational health of the site.
  • Work closely with developers to ensure that monitoring tools are embedded effectively into the development cycle and are aligned with the business needs.
  • Create detailed documentation, including SOPs, best practices, incident management guides, and monitoring configurations.
  • Stay current with emerging monitoring technologies and identify opportunities to apply them to enhance the platform's reliability and scalability.
  • Promote a culture of learning and proactive improvement through root cause analysis and post-incident reviews to prevent repeat occurrences.

Requirements: 

  • 5 years of experience in Site Reliability Engineering, preferably within an e-commerce or high-traffic web application environment.
  • Strong expertise with Datadog, including setting up integrations, creating custom metrics, dashboards, and alerts, specifically in .NET, Node.js, and React applications.
  • Proven experience with Azure Application Insights, Azure DevOps, and the ability to implement monitoring and alerting solutions in cloud environments.
  • Hands-on experience managing and optimizing Azure App Services, Azure Front Door, Azure Application Gateway, and SQL databases from a resilience and performance standpoint.
  • Familiarity with SOP development for incident management, proactive monitoring, and site reliability.
  • Knowledge of CI/CD pipelines in Azure DevOps, and experience in identifying and resolving build blockers and pipeline issues.
  • Strong skills in writing SQL queries to diagnose and resolve issues.

Essential Competencies: 

  • Excellent interpersonal skills, with an emphasis on collaboration, clear communication, and the ability to explain technical concepts to non-technical stakeholders.
  • Ability to work in a fast-paced environment, with strong analytical and problem-solving skills, and a proactive mindset towards automation and improvement.

What will set you apart: 

  • Advanced certifications in Azure (e.g., Azure DevOps Engineer Expert, Azure Solutions Architect).
  • Extensive experience with high-traffic e-commerce applications and a track record of ensuring uptime and resilience.
  • Experience with other monitoring and observability tools (e.g., Grafana, Prometheus) is a plus.

 

What We Offer:    

  • Onsite work environment with work-from-home flexibility, fostering collaboration and relationship building with peers, cross-functional partners and leadership.    
  • The stability and resources of an industry-leading company successfully operating for 25 years, with the agility and innovation of a startup, allowing you to make a significant impact and shape our future.  
  • Growth & Development – Each team member has a visible and immediate impact on the business, offering abundant opportunities for personal and professional growth as we scale in size and sophistication.  
  • Healthcare plans that include health/dental/vision insurance, 401K Plan, company-paid life insurance and short-term disability, flexible spending account options and more.  
  • Paid time off, including sick days & holidays to support work-life balance.  

   

We are proud to be an EEO/AA employer. Applicants for employment are considered without regard to race, creed, color, religion, sex, sexual orientation, marital status, national origin, age, and disability, status as a veteran, Vietnam Era Veteran, or being a member of the Reserves or National Guard.  

 

 

 

 

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Site Reliability Engineer?

Sign up to receive alerts about other jobs on the Site Reliability Engineer career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$92,877 - $110,401
Income Estimation: 
$120,933 - $155,034
Income Estimation: 
$114,618 - $136,401
Income Estimation: 
$92,369 - $122,605
Income Estimation: 
$117,024 - $149,811
Income Estimation: 
$117,024 - $149,811
Income Estimation: 
$137,568 - $176,908
Income Estimation: 
$71,493 - $96,419
Income Estimation: 
$92,369 - $122,605
Income Estimation: 
$89,966 - $112,616
Income Estimation: 
$118,163 - $145,996
Income Estimation: 
$120,777 - $151,022
Income Estimation: 
$129,363 - $167,316
Income Estimation: 
$86,891 - $130,303

Sign up to receive alerts about other jobs with skills like those required for the Site Reliability Engineer.

Click the checkbox next to the jobs that you are interested in.

  • Business Objects Skill

    • Income Estimation: $66,514 - $91,167
    • Income Estimation: $78,616 - $102,639
  • Business Objects Administration Skill

    • Income Estimation: $66,514 - $91,167
    • Income Estimation: $78,616 - $102,639
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Edible Arrangements, LLC

Edible Arrangements, LLC
Hired Organization Address Boise, ID Full Time
Store Associate / Fruit Expert Who are we and what do we do? Fruit was just the beginning. Since our founding in 1999, w...
Edible Arrangements, LLC
Hired Organization Address Boise, ID Temporary
Seasonal Store Associate / Fruit Expert Who are we and what do we do? Fruit was just the beginning. Since our founding i...
Edible Arrangements, LLC
Hired Organization Address Atlanta, GA Part Time
Store Associate / Edible Arrangement Specialist Who are we and what do we do? Fruit was just the beginning. Since our fo...
Edible Arrangements, LLC
Hired Organization Address San Antonio, TX Temporary
Seasonal Store Associate / Fruit Expert Who are we and what do we do? Fruit was just the beginning. Since our founding i...

Not the job you're looking for? Here are some other Site Reliability Engineer jobs in the Atlanta, GA area that may be a better fit.

Site Reliability Engineer

Diversity Resource Staffing. Inc, Atlanta, GA

AI Assistant is available now!

Feel free to start your new journey!