Recent Searches

You haven't searched anything yet.

49 Senior Site Reliability Engineer, Observability Jobs in New York, NY

SET JOB ALERT
Details...
CGI Technologies and Solutions, Inc.
New York, NY | Full Time
$131k-153k (estimate)
3 Days Ago
Capital One
New York, NY | Full Time
$146k-172k (estimate)
1 Week Ago
CoreWeave
New York, NY | Full Time
$125k-146k (estimate)
5 Days Ago
CoreWeave
New York, NY | Full Time
$126k-146k (estimate)
2 Days Ago
StubHub
New York, NY | Full Time
$125k-146k (estimate)
1 Month Ago
AlphaSense
New York, NY | Full Time
$124k-140k (estimate)
4 Months Ago
BULL-IT SOLUTIONS LTD
New York, NY | Full Time
$125k-146k (estimate)
2 Weeks Ago
Stellar Development Foundation
New York, NY | Full Time
$101k-125k (estimate)
3 Weeks Ago
soho square solutions
New York, NY | Full Time
$125k-146k (estimate)
1 Month Ago
GlossGenius
New York, NY | Full Time
$119k-143k (estimate)
1 Month Ago
Crisis Text Line
New York, NY | Full Time
$139k-164k (estimate)
1 Month Ago
Wells Fargo
NEW YORK, NY | Full Time
$141k-162k (estimate)
1 Month Ago
CIRCLE
New York, NY | Full Time
$133k-162k (estimate)
1 Month Ago
Fastly
New York, NY | Full Time
$109k-132k (estimate)
2 Months Ago
Cherre
New York, NY | Full Time
$144k-166k (estimate)
3 Months Ago
TekNavigators Staffing
New York, NY | Contractor
$136k-154k (estimate)
4 Days Ago
ZocDoc
New York, NY | Full Time
$130k-149k (estimate)
4 Days Ago
TekNavigators Staffing
New York, NY | Contractor
$122k-153k (estimate)
1 Week Ago
Radley James
New York, NY | Full Time
$103k-122k (estimate)
1 Week Ago
Oakland Search
New York, NY | Full Time
$111k-128k (estimate)
1 Week Ago
Quanta Search
New York, NY | Full Time
$114k-139k (estimate)
1 Month Ago
Quanta Search
New York, NY | Full Time
$114k-139k (estimate)
1 Month Ago
Quanta Search
New York, NY | Full Time
$114k-139k (estimate)
1 Month Ago
Edward Daniels Group
New York, NY | Full Time
$122k-138k (estimate)
3 Months Ago
AYR Global IT Solutions Inc
New York, NY | Full Time
$125k-141k (estimate)
3 Months Ago
Justworks
New York, NY | Full Time
$105k-123k (estimate)
3 Months Ago
Virtu Financial Inc.
New York, NY | Full Time
$106k-124k (estimate)
4 Months Ago
Sesame Workshop – Temporary
New York, NY | Temporary
$105k-120k (estimate)
6 Months Ago
Senior Site Reliability Engineer, Observability
CoreWeave New York, NY
$126k-146k (estimate)
Full Time 2 Days Ago
Save

CoreWeave is Hiring a Senior Site Reliability Engineer, Observability Near New York, NY

CoreWeave is a specialized cloud provider, delivering a massive scale of GPU compute resources on top of the industry's fastest and most flexible infrastructure. CoreWeave builds cloud solutions for compute intensive use cases — VFX and rendering, machine learning and AI, batch processing, and Pixel Streaming — that are up to 35 times faster and 80% less expensive than the large, generalized public clouds. Learn more at www.coreweave.com.
The Observability Team performs a critical role in enabling CoreWeave to understand, troubleshoot, and optimize complex systems by providing comprehensive insights into their behavior and performance. This team is responsible for the development, integration, and operation of observability platforms with the ultimate objective of enabling engineers across CoreWeave to do more, better. Central to the Observability Teams mission is the operation of our observability stack which leverages CoreWeave's deep investment in the Kubernetes ecosystem.
We are seeking a senior engineer with specialization in the observability stack who can help us execute on the mission of providing a comprehensive logging and metrics ecosystem that is deeply integrated with CoreWeave's Kubernetes platform. Integrating logging, metrics, tracing, and monitoring tools for proactive insights into system performance. This individual will work with a team of 6-8 engineers and have the opportunity to work on the full gamut of rewarding challenges that come with the business of building a cloud in a communicative, supportive, and high-performing environment. As a member of the Observability Team you will have the opportunity to:
  • Design and implement the platform that improves visibility into how the services are performing and operating.
  • Improve the performance, security, reliability, and scalability of our observability, and related services and participate in the teams on-call rotation.
  • Assist engineers in maximizing the observability stack to gain insights into the service's functionality and operation.
  • Develop dashboards, alerts, and insights into the customer experience using Grafana-ecosystem tools such as Mimir and Loki.
  • Develop meaningful insights by analyzing the gathered data.
  • Enable and evangelize the best practices around alerting. Collaborate with teams to establish observability standards.
  • Grow, change, invest in your teammates, be invested-in, share your ideas, listen to others, be curious, have fun, and, above all, be yourself.
Wondering if you're a good fit? We believe in investing in our people, and value candidates who can bring their own diversified experiences to our teams – even if you aren't a 100% skill or experience match. Here are some qualities we've found compatible with our team. If a portion of this resonates with you, we'd love to talk.
  • You have four or more years of experience in a software or infrastructure engineering industry.
  • You enjoy helping your colleagues achieve more with less effort.
  • You have experience operating services in production and at scale and are versed in reliability engineering concepts such as the different types of testing, progressive deployments, error budgets, the role observability, and fault-tolerant design.
  • You have experience using Kubernetes with a conceptual understanding of its major components, and/or have operated Kubernetes clusters at scale for both event-driven and stateful orchestration.
  • You're familiar with various logging and metrics systems like Prometheus, ELK, Victoria Metrics, Thanos or Grafana. You have experience with designing and operating these systems at scale.
  • You are familiar with PromQL, any other querying language and enjoy understanding the data model for observability systems.
  • You're comfortable with the idea of using Go as your primary programming language.
  • You know your way around a Linux distro, shell scripting, and/or the Linux storage and networking stacks.
  • You can transform problems in elastic solutions, decompose them into achievable tasks, and socialize both to your teammates.
  • You're excited about being part of a team of diverse perspectives and backgrounds that believe in tackling challenges, growing hand in hand, and winning together.
Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $175,000-$210,000. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience.
Hybrid WorkplaceSuccessful candidates will be expected to attend onboarding training at our NJ Headquarters within their first several weeks of employment, with subsequent quarterly travel requirements of 1 week duration.
If you reside within a 30-mile radius of our New Jersey, New York, or Philadelphia offices, we're excited for you to join us at the office at least three times a week, recognizing the significance we place on fostering connections, collaboration, and creativity within our office culture. Our commitment to operating as a hybrid workplace underscores our dedication to enabling our employees to tailor their work-life balance to their individual preferences.
Why CoreWeave?
At CoreWeave, we work hard, have fun, and move fast! We're in an exciting stage of hyper-growth that you will not want to miss out on. We're not afraid of a little chaos, and we're constantly learning. Our team cares deeply about how we build our product and how we work together, which is represented through our core values:
  • Be Curious at your Core
  • Act like an Owner
  • Empower Employees
  • Deliver Best In-Class Client Experience
  • Achieve More Together
We support and encourage an entrepreneurial outlook and independent thinking. We foster an environment that encourages collaboration and provides the opportunity to develop innovative solutions to complex problems. As we get set for take off, the growth opportunities within the organization are constantly expanding. You will be surrounded by some of the best talent in the industry, who will want to learn from you, too. Come join us!
BenefitsWe offer a competitive salary and benefits, including:
  • Medical, dental and vision insurance - 100% paid for the employee
  • Company paid Life Insurance
  • Voluntary supplemental life insurance
  • Short and long-term disability insurance
  • Flexible Spending Account
  • Tuition Reimbursement
  • Mental Wellness Benefits through Spring Health
  • Family-Forming support provided by Carrot
  • Paid Parental Leave
  • Flexible, full-service childcare support with Kinside
  • 401(k) with a generous employer match
  • Flexible PTO
  • Catered lunch each day in our offices
  • Weekly massages in NJ office
  • A casual work environment
  • Work culture focused on innovative disruption
California Consumer Privacy Act - California applicants only
CoreWeave is an equal opportunity employer, committed to our diversity and inclusiveness. We will consider all qualified applicants without regard to race, color, nationality, gender, gender identity or expression, sexual orientation, religion, disability or age.

Job Summary

JOB TYPE

Full Time

SALARY

$126k-146k (estimate)

POST DATE

06/26/2024

EXPIRATION DATE

07/19/2024

WEBSITE

coreweave.com

HEADQUARTERS

New York, NY

Show more

CoreWeave
Remote | Full Time
$155k-187k (estimate)
1 Day Ago
CoreWeave
Remote | Full Time
$158k-190k (estimate)
1 Day Ago
CoreWeave
Full Time
$65k-82k (estimate)
1 Day Ago