What are the responsibilities and job description for the Senior Software Engineer, Systems Engineering and Administration position at Microsoft Power Platform Community?
Overview
Microsoft is looking for a Senior Software Engineer, Systems Engineering and Administration to join the System Operations team, part of Xandr’s Real Time Bidding platform. The platform operates a hybrid environment of global datacenters and Azure. Software engineers on the System Operations team ensure this platform operates efficiently and error-free with automation, troubleshooting, monitoring, and scales the toolset and resources to match platform growth.
In this role, you are responsible for ensuring the operational integrity of tens of thousands of systems distributed globally. These systems include DNS, DHCP, Puppet, Kubernetes, Docker, CDN properties, NGINX load balancer clusters, and all the automation and APIs in between that allow us to manage them. The System Operations team ensures all components on the platform work together seamlessly to handle hundreds of thousands of requests per second. With such a diverse technology landscape, automation and consistency is key here. You will develop and utilize tools that ensures the right changes are deployed to the intended systems in an automated fashion. There will be plenty of opportunities leverages your ability in finding the most efficient solutions to manage our growing technology stack, as well as expand your experience working with, among other things, data center management tooling, provisioning automation, configuration management, orchestration, and cybersecurity. This opportunity allows flexible hours and remote work.
Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
Responsibilities
Required Qualifications:
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include but are not limited to the following specialized security screenings:
Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: https://careers.microsoft.com/us/en/us-corporate-pay
Microsoft will accept applications for the role until February 21, 2025.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request via the Accommodation request form .
Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
#MicrosoftAI
Microsoft is looking for a Senior Software Engineer, Systems Engineering and Administration to join the System Operations team, part of Xandr’s Real Time Bidding platform. The platform operates a hybrid environment of global datacenters and Azure. Software engineers on the System Operations team ensure this platform operates efficiently and error-free with automation, troubleshooting, monitoring, and scales the toolset and resources to match platform growth.
In this role, you are responsible for ensuring the operational integrity of tens of thousands of systems distributed globally. These systems include DNS, DHCP, Puppet, Kubernetes, Docker, CDN properties, NGINX load balancer clusters, and all the automation and APIs in between that allow us to manage them. The System Operations team ensures all components on the platform work together seamlessly to handle hundreds of thousands of requests per second. With such a diverse technology landscape, automation and consistency is key here. You will develop and utilize tools that ensures the right changes are deployed to the intended systems in an automated fashion. There will be plenty of opportunities leverages your ability in finding the most efficient solutions to manage our growing technology stack, as well as expand your experience working with, among other things, data center management tooling, provisioning automation, configuration management, orchestration, and cybersecurity. This opportunity allows flexible hours and remote work.
Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
Responsibilities
- Engineer solutions for various projects and operational needs required by our customers. Perform code reviews to changes to the infrastructure configuration files and updates to tools libraries and APIs (Application Programming Interface).
- Manage remote servers and configure hardware, peripherals, services, settings, storage, etc. in accordance with standards and project/operational requirements.
- Provide support to technical and non-technical customers from various constituencies. Investigate and troubleshoot issues with hardware failures, application performance and software debugging. Coordinate and engage with other teams during incidents or planned maintenance. Handle escalations on technical issues and operations from global team members.
- Maintain ZFS (Zettabyte File System) storage clusters to ensure data redundancy and business continuity.
- Provide mentorship, coaching and guidance to teammates.
- Maintain documentation for the infrastructure software stack.
Required Qualifications:
- Bachelor's Degree in Computer Science or related technical field AND 4 years writing scalable tools using scripting languages like Python, Perl, Go, and Shell
- OR equivalent experience.
- 4 years of experience in supporting Debian-based Linux distributions such as Ubuntu and operating container technologies, especially Kubernetes and Docker, in a distributed systems environment.
- 4 years of working with Dell hardware such as the PowerEdge C and R systems.
- 3 years of developing Infrastructure as code, execution plans, resource graph and change automation using Terraform, Puppet, and/or Ansible.
- 3 years of experience in configuring and using observability and monitoring tools such as Nagios, Graphite, Prometheus, Grafana, tcpdump, Azure Data Explorer, and/or Splunk.
- 3 years of configuring and managing different implementations NGINX and NGINX-plus.
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include but are not limited to the following specialized security screenings:
- Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.
- Bachelor's Degree in Computer Science or related technical field or equivalent experience and 8 years writing scalable tools using scripting languages like Python, Perl, Go, and Shell.
- 5 years of experience in supporting Debian-based Linux distributions such as Ubuntu and operating container technologies, especially Kubernetes and Docker, in a distributed systems environment.
- 5 years of working with Dell hardware such as the PowerEdge C and R systems. Familiarity with out-of-band management tools such as iDRAC or OpenIPMI is a plus.
- 5 years of developing Infrastructure as code, execution plans, resource graph and change automation using Terraform, Puppet, and/or Ansible.
- 5 years of experience in configuring and using observability and monitoring tools such as Nagios, Graphite, Prometheus, Grafana, tcpdump, Azure Data Explorer, and/or Splunk.
- 5 years of configuring and managing different implementations NGINX and NGINX-plus.
Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: https://careers.microsoft.com/us/en/us-corporate-pay
Microsoft will accept applications for the role until February 21, 2025.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request via the Accommodation request form .
Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
#MicrosoftAI
Salary : $117,200 - $250,200