[Remote] Full Stack ML Efficiency & Observability
Note: The job is a remote job and is open to candidates in USA. Microsoft AI is looking for a Member of Technical Staff - Full Stack Engineer, ML Efficiency & Observability to help efficiently manage compute capacity. The role involves designing and developing features for capacity management and model performance visibility while collaborating with ML researchers and product managers to create intuitive user experiences.
Responsibilities
- Design and develop features for our capacity management portal
- Design and develop features to provide visibility into model performance and quality across our fleet
- Partner with ML researchers and PMs to translate functional requirements into highly functional, intuitive and appealing interfaces
- Integrate with backend APIs from schedulers to training frameworks to build visibility across the training life cycle
- Explore, develop, and adapt new innovations to the software development process
- Contribute to the development of internal tooling and infrastructure
- Implement best software development practices to ensure code quality. Hold a high quality bar
- Embody our culture and values
Skills
- Bachelor's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
- 4+ years experience in business analytics, data science, software development, data modeling or data engineering work
- Bachelor's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND 6+ years experience in business analytics, data science, software development, data modeling or data engineering work
- OR Master's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND 4+ years of business analytics, data science, software development, data modeling or data engineering work experience
- OR equivalent experience
- Experience with Capacity Management, Efficiency Management, ML Training and/or Inference
- Solid expertise in JavaScript / TypeScript, React, HTML, CSS and browser internals
- Solid understanding of web performance, accessibility, and cross‑browser compatibility
- Experience with Development & Debugging with dev environments like Visual Studio or Visual Studio Code
- Software development experience with Generative AI tools
- Experience in leading technical projects and supporting architectural decisions with data
Benefits
- Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: https://careers.microsoft.com/us/en/us-corporate-pay
Company Overview
Company H1B Sponsorship