Talent.com
Senior LLMOps Engineer- AI Enabler Team

Senior LLMOps Engineer- AI Enabler Team

Cast AIBucharest, Romania
30+ days ago
Job description

Why Cast AI?

Cast AI is the leading Application Performance Automation (APA) platform , enabling customers to cut cloud costs, improve performance, and boost productivity – automatically.

Built originally for Kubernetes, Cast AI goes beyond cost and observability by delivering real-time, autonomous optimization across any cloud environment. The platform continuously analyzes workloads, rightsizes resources, and rebalances clusters without manual intervention, ensuring applications run faster, more reliably, and more efficiently.

Headquartered in Miami, Florida, Cast AI has employees in more than 32 countries worldwide and supports some of the world’s most innovative teams running their applications on all major cloud, hybrid, and on-premises environments. Over 2,100 companies already rely on Cast - from BMW and Akamai to Hugging Face and NielsenIQ.

What’s next? Backed by our $108M Series C, we’re doubling down on making APA the new standard for DevOps and MLOps, and everything in between.

Core values that hold us all together :

PRACTICE CUSTOMER OBSESSION. Focus on the customer journey and work backwards. Strive to deliver customer value and continuously solve customer problems. Listen to customer feedback, act, and iterate to improve customer experience.

LEAD. Take ownership and lead through action. Think and act on behalf of the entire company to build long-term value across team boundaries.

DEVELOP AND HIRE THE BEST. Strive to raise the performance bar by continuously investing in yourself, the team and by hiring the best possible candidates for every position. Drive towards personal development and professional growth, and mentor others to raise the collective bar.

EXPECT AND ADVOCATE CHANGE. Strive to innovate and accept the inevitable change that comes with innovation. Constantly welcome new ideas and opinions. Share insights responsibly with unwavering openness, honesty, and respect. Once a path is chosen, be ready to disagree and commit to a direction.

What does AI Enabler Team do?

In the AI Enabler team, our day is usually full of R&D challenges. Have you ever encountered a situation where you need to expand your AI infrastructure so that the applications can automatically pick the right large language models (LLMs) that are both more cost-efficient and better performing? Most of us probably do nowadays, or at least understand the complexity of making such decisions while keeping track of our cloud budget.

One of the team's responsibilities is ensuring that whenever a customer makes AI-related decisions regarding their K8s infrastructure, they are implemented automatically without unnecessary costs or hassle. This is just one small piece of a bigger puzzle. To get into a more detailed perspective, ask yourself the following questions :

  • How often do you use LLMs?
  • What is the least expensive LLM you can pick for a given prompt without degrading the quality of the response?
  • How much do your applications cost per 1 million tokens and how can you improve it?
  • Which API keys have the biggest waste?
  • How can you improve your frequently running prompt to use fewer tokens?
  • What is fine-tuning and how to do it efficiently?
  • What is a transformer?

These are just several of the many questions that are part of the daily work of this team.

Being a part of this team would involve design and decision-making end-to-end while collaborating with colleagues from other teams. Cast AI, being a technical product, encourages not only coding something as written in the JIRA ticket but also coming up with new features and potential solutions to customers' problems. Given that the team is working on a technical greenfield project, you will have the opportunity to impact it in many ways positively.

Responsibilities for the role :

  • Evaluate and Analyze LLM performance
  • Fine-Tune LLMs
  • Optimize AI Models for Cost Efficiency
  • Develop and implement data science solutions
  • Architect and build inference and training pipelines, directly contributing through hands-on design, model training pipeline, and deployment strategies
  • Stay up to date with industry trends.
  • Here are some of the tools we use daily :

  • Python
  • ClickHouse and PostgreSQL for persistence
  • GCP Pub / Sub for messaging
  • gRPC for internal communication
  • REST for public APIs
  • Kubernetes, which our product is evolving around
  • AWS , GCP , and Azure cloud providers, which are currently supported in our platform
  • We use GitLab CI with ArgoCD as our GitOps CD engine
  • Prometheus , Grafana , Loki , and Tempo for observability.
  • Requirements

  • Experience with designing a production-grade machine learning system
  • Strong software engineering skills in Python
  • Ability to move fast in an environment where things are sometimes loosely defined and may have competing priorities or deadlines
  • Experience with the design and implementation of efficient model training and inference pipelines end to end
  • 5+ years of hands-on experience in Data Science and Machine Learning, with a proven track record, demonstrated through a robust portfolio of projects
  • You have to be physically in any of the European countries GMT 0 to GMT +3
  • Strong English skills
  • Strong verbal and written communication skills
  • Ability to work independently and collaborate in a group.
  • What's in it for you?

  • Competitive salary (€6,500 - €9,000 gross, depending on the level of experience) with equity options
  • Direct impact on the product in a cutting-edge company that’s reshaping cloud automation and optimization
  • Collaborate with a global team of top cloud experts and innovators passionate about pushing the boundaries of Kubernetes technology
  • Fast development cycles with a short feedback loop and direct customer impact
  • Transparent work environment
  • Focused work with minimal meetings and bureaucracy
  • 10% of your time dedicated to self-improvement and personal projects.
  • Create a job alert for this search

    Ai Engineer • Bucharest, Romania

    Related jobs
    Senior Data Engineer

    Senior Data Engineer

    Shape Your Future with UsBucharest, Romania
    Position : Senior Data Engineer (B2B / Freelancer Contract).Location : Remote open to candidates based in the European Union. We are looking for an experienced and motivated.The role focuses on develo...Show moreLast updated: 30+ days ago
    AI Data Engineer

    AI Data Engineer

    Shape Your Future with UsBucharest, Romania
    Position : AI Data Engineer (B2B / Freelancer Contract).Location : Remote open to candidates based in the European Union, Serbia, or Moldova. We are recruiting on behalf of our client for an experienc...Show moreLast updated: 30+ days ago
    UI / UX Designer (Figma Focused)

    UI / UX Designer (Figma Focused)

    RM Staffing B.V.Ruse, 18, BG
    We are seeking a creative and detail-oriented.You will collaborate closely with developers, product managers, and other stakeholders to translate user insights and business requirements into functi...Show moreLast updated: 20 days ago
    SAP Director

    SAP Director

    TotalSoftVoluntari, RO
    Quick Apply
    Company description : Architected Business Solutions (ABS) is a leading provider of technology consulting services, focusing on digital transformation of our customers’ businesses, implementin...Show moreLast updated: 30+ days ago
    AI / ML Engineering Lead

    AI / ML Engineering Lead

    Shape Your Future with UsBucharest, Romania
    Position : AI / ML Engineering Lead (B2B / Freelancer Contract).Location : Remote open to candidates based in the European Union, Serbia, or Moldova. We are recruiting on behalf of our client for an exp...Show moreLast updated: 30+ days ago
    System Administrator L2

    System Administrator L2

    E-INFRAChitila, RO
    Quick Apply
    With an experience of over 28 years, E-INFRA Group comprises 5 Romanian companies, active in the field of energy, constructions, and telecommunications infrastructure. E-INFRA holding includes : Elec...Show moreLast updated: 30+ days ago
    Business Consultant with German

    Business Consultant with German

    TotalSoftIlfov, Ilfov, RO
    Quick Apply
    We are looking for an experienced Business Consultant / Analyst with expertise in operations underwriting or risk management within the leasing or banking sectors (specifically for legal entities).Th...Show moreLast updated: 30+ days ago
    Technical Sales Specialist Data Center

    Technical Sales Specialist Data Center

    RM Staffing B.V.Ruse, 18, BG
    You’ll be the face of our technical expertise, conducting compelling.By effectively communicating the value of our services, you'll help prospective customers understand how our solutions can solve...Show moreLast updated: 30+ days ago
    Senior Digital Transformation Consultant - Software for Healthcare Area

    Senior Digital Transformation Consultant - Software for Healthcare Area

    TotalSoftVoluntari, Bucuresti, RO
    Quick Apply
    About Us In Total Soft we empower our clients with cutting-edge driving innovation through deep Healthcare industry expertise and advanced technology. We are at the forefront of digital transformati...Show moreLast updated: 30+ days ago
    AI Solutions Engineer

    AI Solutions Engineer

    Shape Your Future with UsBucharest, Romania
    Position : AI Solutions Engineer (B2B / Freelancer Contract).Location : Remote open to candidates based in the European Union, Serbia, Moldova, Bosnia, or Spain. We are recruiting on behalf of our cli...Show moreLast updated: 30+ days ago
    Senior AI Research Engineer, Model Inference (100% Remote)

    Senior AI Research Engineer, Model Inference (100% Remote)

    Tether Operations LimitedBucharest, B, RO
    Join Tether and Shape the Future of Digital Finance.At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from ex...Show moreLast updated: 30+ days ago
    AI Solutions Architect

    AI Solutions Architect

    Shape Your Future with UsBucharest, Romania
    Position : AI Solutions Architect (B2B / Freelancer Contract).Location : Remote open to candidates based in the European Union, Serbia, or Moldova. We are recruiting on behalf of our client for an exp...Show moreLast updated: 30+ days ago
    Senior Product Engineer, Remote

    Senior Product Engineer, Remote

    ModashBucharest, Bucharest, RO
    Quick Apply
    The world doesn’t need giant media organizations to tell every story.The world needs millions of creators.Independent voices who bring weird, wonderful stories to life online.We’re working to help ...Show moreLast updated: 30+ days ago
    Data Center Procurement Killer!

    Data Center Procurement Killer!

    RM Staffing B.V.Ruse, 18, BG
    Reboot Monkey is a leading provider of comprehensive data center management solutions, offering services such as managed colocation, smart hands, and rack and stack solutions.We ensure fast deploym...Show moreLast updated: 30+ days ago
    AI Research Engineer Reinforcement Learning (100% Remote)

    AI Research Engineer Reinforcement Learning (100% Remote)

    Tether Operations LimitedBucharest, B, RO
    Join Tether and Shape the Future of Digital Finance.At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from ex...Show moreLast updated: 30+ days ago
    Data AI Engineer

    Data AI Engineer

    E-INFRABucuresti, RO
    Quick Apply
    With an experience of over 28 years, E-INFRA Group comprises 5 Romanian companies, active in the field of energy, constructions, and telecommunications infrastructure. E-INFRA holding includes : Elec...Show moreLast updated: 20 days ago
    WMS BUSINESS CONSULTANT

    WMS BUSINESS CONSULTANT

    TotalSoftVoluntari, Voluntari, RO
    Quick Apply
    About the job Charisma ERP & WMS manages a company’s resources, regardless of their size, and it is an integrated software system optimizing and simplifying the internal business processe...Show moreLast updated: 30+ days ago
    • New!
    Senior Fullstack Engineer

    Senior Fullstack Engineer

    Urban ConnectBucharest, Romania
    Initial Screening; Behavioral Interview; Technical Assessment; CEO meeting; Offer.Opportunity to work with different programming languages and frameworks. Flexibility to work from home, with a casua...Show moreLast updated: 9 hours ago