Talent.com
Această ofertă de loc de muncă nu este disponibilă în țara dvs.
Senior LLMOps Engineer- AI Enabler Team

Senior LLMOps Engineer- AI Enabler Team

Cast AIBucharest, Romania
În urmă cu peste 30 de zile
Descrierea postului

Why Cast AI?

Cast AI is the leading Application Performance Automation (APA) platform , enabling customers to cut cloud costs, improve performance, and boost productivity – automatically.

Built originally for Kubernetes, Cast AI goes beyond cost and observability by delivering real-time, autonomous optimization across any cloud environment. The platform continuously analyzes workloads, rightsizes resources, and rebalances clusters without manual intervention, ensuring applications run faster, more reliably, and more efficiently.

Headquartered in Miami, Florida, Cast AI has employees in more than 32 countries worldwide and supports some of the world’s most innovative teams running their applications on all major cloud, hybrid, and on-premises environments. Over 2,100 companies already rely on Cast - from BMW and Akamai to Hugging Face and NielsenIQ.

What’s next? Backed by our $108M Series C, we’re doubling down on making APA the new standard for DevOps and MLOps, and everything in between.

Core values that hold us all together :

PRACTICE CUSTOMER OBSESSION. Focus on the customer journey and work backwards. Strive to deliver customer value and continuously solve customer problems. Listen to customer feedback, act, and iterate to improve customer experience.

LEAD. Take ownership and lead through action. Think and act on behalf of the entire company to build long-term value across team boundaries.

DEVELOP AND HIRE THE BEST. Strive to raise the performance bar by continuously investing in yourself, the team and by hiring the best possible candidates for every position. Drive towards personal development and professional growth, and mentor others to raise the collective bar.

EXPECT AND ADVOCATE CHANGE. Strive to innovate and accept the inevitable change that comes with innovation. Constantly welcome new ideas and opinions. Share insights responsibly with unwavering openness, honesty, and respect. Once a path is chosen, be ready to disagree and commit to a direction.

What does AI Enabler Team do?

In the AI Enabler team, our day is usually full of R&D challenges. Have you ever encountered a situation where you need to expand your AI infrastructure so that the applications can automatically pick the right large language models (LLMs) that are both more cost-efficient and better performing? Most of us probably do nowadays, or at least understand the complexity of making such decisions while keeping track of our cloud budget.

One of the team's responsibilities is ensuring that whenever a customer makes AI-related decisions regarding their K8s infrastructure, they are implemented automatically without unnecessary costs or hassle. This is just one small piece of a bigger puzzle. To get into a more detailed perspective, ask yourself the following questions :

  • How often do you use LLMs?
  • What is the least expensive LLM you can pick for a given prompt without degrading the quality of the response?
  • How much do your applications cost per 1 million tokens and how can you improve it?
  • Which API keys have the biggest waste?
  • How can you improve your frequently running prompt to use fewer tokens?
  • What is fine-tuning and how to do it efficiently?
  • What is a transformer?

These are just several of the many questions that are part of the daily work of this team.

Being a part of this team would involve design and decision-making end-to-end while collaborating with colleagues from other teams. Cast AI, being a technical product, encourages not only coding something as written in the JIRA ticket but also coming up with new features and potential solutions to customers' problems. Given that the team is working on a technical greenfield project, you will have the opportunity to impact it in many ways positively.

Responsibilities for the role :

  • Evaluate and Analyze LLM performance
  • Fine-Tune LLMs
  • Optimize AI Models for Cost Efficiency
  • Develop and implement data science solutions
  • Architect and build inference and training pipelines, directly contributing through hands-on design, model training pipeline, and deployment strategies
  • Stay up to date with industry trends.
  • Here are some of the tools we use daily :

  • Python
  • ClickHouse and PostgreSQL for persistence
  • GCP Pub / Sub for messaging
  • gRPC for internal communication
  • REST for public APIs
  • Kubernetes, which our product is evolving around
  • AWS , GCP , and Azure cloud providers, which are currently supported in our platform
  • We use GitLab CI with ArgoCD as our GitOps CD engine
  • Prometheus , Grafana , Loki , and Tempo for observability.
  • Requirements

  • Experience with designing a production-grade machine learning system
  • Strong software engineering skills in Python
  • Ability to move fast in an environment where things are sometimes loosely defined and may have competing priorities or deadlines
  • Experience with the design and implementation of efficient model training and inference pipelines end to end
  • 5+ years of hands-on experience in Data Science and Machine Learning, with a proven track record, demonstrated through a robust portfolio of projects
  • You have to be physically in any of the European countries GMT 0 to GMT +3
  • Strong English skills
  • Strong verbal and written communication skills
  • Ability to work independently and collaborate in a group.
  • What's in it for you?

  • Competitive salary (€6,500 - €9,000 gross, depending on the level of experience) with equity options
  • Direct impact on the product in a cutting-edge company that’s reshaping cloud automation and optimization
  • Collaborate with a global team of top cloud experts and innovators passionate about pushing the boundaries of Kubernetes technology
  • Fast development cycles with a short feedback loop and direct customer impact
  • Transparent work environment
  • Focused work with minimal meetings and bureaucracy
  • 10% of your time dedicated to self-improvement and personal projects.
  • Creați o alertă de locuri de muncă pentru această căutare

    Senior Engineer • Bucharest, Romania

    Locuri de muncă asemănătoare
    Software Engineer AI Integration (Microsoft Azure)

    Software Engineer AI Integration (Microsoft Azure)

    coverletter.techBucharest, Romania
    Quick Apply
    We are looking for an experienced.You will be responsible for software development and systems integration, building AI-driven solutions that are reliable, scalable, and high-performing.Location : c...Afișați mai multeUltima actualizare: acum 19 zile
    Software Engineer (AI Platform) - Remote

    Software Engineer (AI Platform) - Remote

    ReplikaBucharest, RO
    Quick Apply
    An AI companion who is eager to learn and would love to see the world through your eyes.Replika is always ready to chat when you need an empathetic friend. Replika is an AI companion loved by 40M+ u...Afișați mai multeUltima actualizare: acum 1 zi
    Senior Embedded Engineer (IoT)

    Senior Embedded Engineer (IoT)

    AvengaRuse, Ruse, .BG
    Quick Apply
    At Avenga, we believe that human creativity empowers technology that matters.Operating globally, our 6000+ specialists provide a full spectrum of services, including business and tech advisory, ent...Afișați mai multeUltima actualizare: acum 22 zile
    Senior Software Engineer (Golang, C / C++, cryptography)

    Senior Software Engineer (Golang, C / C++, cryptography)

    TechBiz Global GmbHBucuresti, B, RO
    TOP clients from our portfolio.If you're looking for an exciting opportunity to grow in a innovative environment, this could be the perfect fit for you. You will work with the system that redefines ...Afișați mai multeUltima actualizare: acum 30+ zile
    Senior FullStack Developer / AI Expert

    Senior FullStack Developer / AI Expert

    Growmodo GmbHBucharest, B, RO
    Growmodo is a fast-scaling web design and development agency that helps businesses and agencies outsource high-quality design and development work through a global subscription-based service.Our cl...Afișați mai multeUltima actualizare: acum 30+ zile
    Staff Engineer - Remote

    Staff Engineer - Remote

    ZyteBucharest, Bucharest, RO
    Quick Apply
    At Zyte, we don’t just collect data from the web—we chase down the toughest data challenges and make them look easy.We’re a globally distributed team that’s bold, curious, and always discovering ne...Afișați mai multeUltima actualizare: acum 5 zile
    Senior Cloud Platform Engineer (AI)

    Senior Cloud Platform Engineer (AI)

    Evolve TodayBucharest, Bucharest, RO
    Quick Apply
    This text can be a short description about your company.A so called "boilerplate", describing your business, services or products that you offer and your target group. This text should be informativ...Afișați mai multeUltima actualizare: acum 10 zile
    MLOps Engineer

    MLOps Engineer

    Evolve TodayBucharest, Bucharest, .RO
    Quick Apply
    This text can be a short description about your company.A so called "boilerplate", describing your business, services or products that you offer and your target group. This text should be informativ...Afișați mai multeUltima actualizare: acum 10 zile
    Senior Data Engineer (remote, Europe)

    Senior Data Engineer (remote, Europe)

    ModashBucharest, Bucharest, RO
    Quick Apply
    Remote — Data Team — Full-time.Modash gives brands the tools to work with the right content creators and helps creators earn a living doing what they love. What your day-to-day will look like.We’re ...Afișați mai multeUltima actualizare: acum 1 zi
    Fullstack Engineer (Frontend-Focused) – Remote (EMEA)

    Fullstack Engineer (Frontend-Focused) – Remote (EMEA)

    ReplikaBucharest, RO
    Quick Apply
    An AI companion who is eager to learn and would love to see the world through your eyes.Replika is always ready to chat when you need an empathetic friend. Replika is an AI companion loved by 35M+ u...Afișați mai multeUltima actualizare: acum 30+ zile
    Senior Software Engineer (Python + AI)

    Senior Software Engineer (Python + AI)

    BonapoliaBucharest, Bucharest, .RO
    Quick Apply
    For job seekers, BONAPOLIA offers a gateway to exciting career prospects and the chance to thrive in a fulfilling work environment. We believe that the right job can transform lives, and we are comm...Afișați mai multeUltima actualizare: acum 5 zile
    Javascript Senior Software Engineer (AI SDK)

    Javascript Senior Software Engineer (AI SDK)

    Tether Operations LimitedBucharest, B, RO
    Join Tether and Shape the Future of Digital Finance.At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from ex...Afișați mai multeUltima actualizare: acum 12 zile
    Staff Engineer - Remote Role

    Staff Engineer - Remote Role

    ZyteBucharest, Bucharest, RO
    Quick Apply
    At Zyte, we don’t just collect data from the web—we chase down the toughest data challenges and make them look easy.We’re a globally distributed team that’s bold, curious, and always discovering ne...Afișați mai multeUltima actualizare: acum 1 zi
    Developer Relations Engineer - Remote

    Developer Relations Engineer - Remote

    ZyteBucharest, Bucharest, RO
    Quick Apply
    At Zyte, we eat data for breakfast and you can eat your breakfast anywhere and work for Zyte.Founded in 2010, we are a globally distributed team of over 250 Zytans working from over 28 countries wh...Afișați mai multeUltima actualizare: acum 5 zile
    Head of AI Enablement

    Head of AI Enablement

    OnHiresBucharesti, RO
    Quick Apply
    Our client is an international fintech provider launching a large-scale transformation into an AI-native organization.They are looking for a leader to design and execute the strategy for integratin...Afișați mai multeUltima actualizare: acum 30+ zile
    Golang Engineer (Senior) ID39990

    Golang Engineer (Senior) ID39990

    AgileEngineSector 1, B, ro
    Quick Apply
    Fortune 500 brands and trailblazing startups across 17+ industries.We rank among the leaders in areas like application development and AI / ML, and our people-first culture has earned us multiple Bes...Afișați mai multeUltima actualizare: acum 25 zile
    AI Engineer With Dify

    AI Engineer With Dify

    Shae GroupBucharest, Bucharest, RO
    Quick Apply
    We are an innovative AI-driven technology and services group dedicated to transforming health and well-being through cutting-edge solutions. With a strong focus on automation and scalable platforms,...Afișați mai multeUltima actualizare: acum 1 zi
    AI Engineer (Senior / Lead) ID40978

    AI Engineer (Senior / Lead) ID40978

    AgileEngineSector 1, B, ro
    Quick Apply
    Fortune 500 brands and trailblazing startups across 17+ industries.We rank among the leaders in areas like application development and AI / ML, and our people-first culture has earned us multiple Bes...Afișați mai multeUltima actualizare: acum 17 zile
    AI Solutions Engineer

    AI Solutions Engineer

    Evolve TodayBucharest, Bucharest, RO
    Quick Apply
    This text can be a short description about your company.A so called "boilerplate", describing your business, services or products that you offer and your target group. This text should be informativ...Afișați mai multeUltima actualizare: acum 30+ zile
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    PlayneticBucharest, Bucharest, .RO
    Quick Apply
    Established in 2023, Playnetic is a new player in the world of gaming entertainment.We design and build slot games from scratch - from idea to release. Our games will be played in regulated markets ...Afișați mai multeUltima actualizare: acum 30+ zile