Principal Machine Learning Engineer
bunny.net is where individual imaginations gather together to solve next-generation internet challenges. Join a team of experts and, together, help shape the future of the internet. Here, you'll do more than join something — you'll add something.
We're not just saying that, but our culture really hops. We have established values that enable our fluffle to grow stronger with each new team member who joins us: Relentless Innovation. Integrity. Impact. Kindness. Diversity. Transparency & open communication. We also believe "good" is not good enough, and we continue striving to keep pushing things to the next level by learning and improving every single day. Oh, and did we mention that all our positions are 100% remote with a flexible working schedule? There's no need to relocate if you would like to join us, and you can carve out your own work schedule as well!
Principal Machine Learning Engineer
As a Principal Machine Learning Engineer, you will be at the forefront of building our next-generation, highly available global GPU cloud computing service. This service will power a diverse range of environments, including containerization and bare metal compute, providing a reliable platform for running AI workloads at scale. In this role, you will act as a key technical leader within our organization, driving innovation, solving complex problems, and influencing the technical direction of our AI infrastructure. Your expertise in infrastructure-as-code, software development, and systems knowledge will be critical in enabling bunny.net to continue its rapid growth and innovation in the AI space.
What will you do?
- Develop and Implement Technical Strategy: Collaborate with the Engineering Leadership team to design and implement the next-generation AI platform. Ensure that your contributions align with the company’s overall goals and objectives, driving innovation in global GPU cloud computing and AI infrastructure.
- Lead Technical Initiatives: Take ownership of key projects and initiatives, driving the design and implementation of robust, scalable AI systems. Act as a technical leader, setting the standard for excellence within the team.
- Drive Innovation: Stay at the cutting edge of AI and machine learning technologies. Identify and implement new tools, frameworks, and methodologies to enhance the performance, scalability, and reliability of our AI infrastructure.
- Ensure Reliability and Performance: Design and implement high-performance, fault-tolerant AI systems that can scale globally. Utilize your deep understanding of operating systems, networking, and distributed systems to optimize our platform for AI workloads.
- Collaborate with Cross-Functional Teams: Work closely with other engineering teams to integrate AI services into the broader bunny.net platform. Provide expert guidance and support to ensure seamless integration and operation.
- Solve Complex Technical Problems: Tackle some of the most challenging problems in AI infrastructure, from low-level optimization to large-scale distributed system design. Ensure that our platform can handle the demands of modern AI workloads efficiently.
- Mentor and Guide Junior Engineers: Provide mentorship and technical guidance to less experienced engineers, helping them develop their skills and advance in their careers. Share your knowledge and experience to foster a culture of continuous learning and improvement.
What will you add?
- Technical Expertise: Leverage extensive experience in high-performance computing, distributed systems, and AI infrastructure. Expertise in Python, containerization, and backend programming is crucial. Familiarity with frameworks such as TensorFlow, JAX, and open-source inference stacks like VLLM and TGI is a plus.
- Leadership Through Technical Excellence: Lead by example, demonstrating deep technical knowledge and a commitment to best practices. Inspire your peers by consistently delivering high-quality, innovative solutions to complex problems.
- Customer-Centric Approach: Ensure that our AI platform meets the needs of our internal and external customers. Focus on delivering robust, scalable services that enhance the overall developer experience.
- Data-Driven Decision Making: Use data analytics and performance metrics to guide decisions on system design and optimization. Continuously monitor and refine our AI infrastructure to ensure it meets the highest standards of performance and reliability.
- Strong Project Management Skills: Effectively manage complex projects, balancing technical demands with business objectives. Prioritize tasks, manage resources, and ensure that projects are delivered on time and within scope.
- Strategic Vision: Bring a strategic perspective to the development of our AI platform. Align your technical contributions with bunny.net's broader business goals, ensuring that our AI infrastructure supports our long-term growth and success.
- Adaptability: Thrive in a fast-paced, dynamic environment. Adapt quickly to new challenges and opportunities in the rapidly evolving field of AI and machine learning.
- Innovative Thinking: Continuously seek out new technologies and methodologies to improve our AI infrastructure. Challenge the status quo and drive innovation across the team.
- Collaborative Mindset: Work effectively across teams, fostering a culture of collaboration and shared success. Ensure that your technical contributions are integral to the overall success of bunny.net.
What will make us extra happy?
- Experience with Edge Computing: Hands-on experience with developing and deploying edge computing solutions, particularly in the context of AI workloads.
- Experience in a Fast-Paced Startup Environment: Proven ability to thrive in a dynamic, rapidly changing environment. Experience scaling systems in a startup setting is highly valued.
- Expertise in Agile Methodologies: Proficiency in Agile development practices, including Scrum and Kanban, with a track record of delivering complex projects on time.
- Strong Problem-Solving Skills: Demonstrated ability to solve complex technical challenges, particularly in high-performance or distributed systems. A knack for identifying root causes and implementing effective solutions.
- Excellent Communication Skills: Ability to communicate complex technical concepts to both technical and non-technical stakeholders. Strong written and verbal communication skills are essential for effective cross-team collaboration.
- Experience with Developer Experience (DevEx): A passion for improving developer tools, workflows, and productivity, especially in the context of AI and machine learning.
What is in it for you?
At bunny.net, we invest in you. We want to provide significant benefits while also helping you maintain a healthy work & personal life.
- A competitive salary. You can expect above-average industry salaries.
- 100% flexible working hours. Work from anywhere, at your own schedule.
- Medical, dental, and vision insurance covers you all over the world.
- Paid Time Off. Love visiting the seaside? No problem - us, too. We offer 25 days of paid vacation (plus your local public holidays on top).
- Parental Leave. Bonding with the little ones is super important, so we offer parental leave and flexible scheduling to ease your way into parenthood.
- Home Office Budget. We believe everyone is more efficient working in a comfortable office, and we're happy to help.
- Paid Sick Leave. Feeling ill? Take time off and get back to 100% without worrying about your finances.
- Mental Health Support. Feeling down? We've got your back. bunny.net provides mental health resources and help via our external partner.
- Educational Budget. Dedicated yearly learning and educational budget for books, seminars, and online courses.
- Wellbeing Budget: A yearly budget to invest in your personal health and wellness.
- Department
- Engineering
- Role
- Backend Engineer
- Locations
- ✽
- Remote status
- Fully Remote
About bunny.net
We are on a mission to help build and accelerate the internet of tomorrow. We obsess about customer experience and strive towards constant innovation, with the goal of helping companies and developers build a faster, safer, and more reliable internet.
Principal Machine Learning Engineer
Loading application form