< Back to About

Full time

Hybrid

Raanana, Israel

LLM Engineer

Apply

About the job

Opmed.ai is an innovative startup dedicated to improving the efficiency and resilience of healthcare operations. Our mission is to help healthcare providers deliver better patient care by optimizing planning, scheduling, and operations using AI, Network Science, and advanced optimization algorithms. We seek a highly skilled and motivated LLM Engineer with a passion for startups and genuine interest in ML and Data to join our team at Opmed.ai (Raanana, Israel).

Primary Responsibilities:

  • End-to-end model development – design, train, fine-tune, and evaluate large-language-model (LLM) pipelines (GPT-class, Mistral, Llama 2/3, etc.) for scheduling, resource-allocation, and workflow-automation use-cases in hospitals.
  • RAG & agents – build retrieval-augmented-generation and autonomous agent workflows that combine EHR, HR, IoT and historical case-length data to surface real-time recommendations.
  • Prompt & instruction engineering – craft, test, and version prompts/instructions for accuracy, latency, and bias-mitigation under strict clinical-safety guidelines.
  • Privacy-centric deployment – implement HIPAA / SOC 2-compliant pipelines (on-prem, VPC or secure PaaS) with guardrails, PHI redaction, audit logging and continuous monitoring.
  • Performance tuning – optimize inference cost, token throughput, and latency across GPU clusters (NVIDIA A100/H100) and serverless endpoints.
  • Collaboration – partner with data scientists, optimization researchers (OR-Tools, Hexaly), and product teams to ship LLM-powered features into our SaaS platform.
  • Research-to-production – track new papers, evaluate SOTA methods (Mixture-of-Experts, LoRA, DPO, speculative decoding, tool-former agents) and run rapid POCs.

Requirements

  • Proven experience in designing, developing, and deploying large language models, with a portfolio of past projects or contributions to LLM development.
  • Advanced proficiency in Python and familiarity with libraries like TensorFlow, PyTorch, or Hugging Face Transformers
  • Knowledge of tool automation, APIs, and vector databases
  • Experience with modern LLM serving and inference frameworks,
  • Hands-on experience with LangChain and LlamaIndex, enabling RAG applications and LLM orchestration.
  • Strong software development skills with proficiency in Python. Experienced user of ML and data science libraries such as PyTorch, TensorFlow, Hugging Face Transformers, and scikit-learn.
  • Familiarity with distributed computing, cloud infrastructure, and orchestration tools, such as Kubernetes, Apache Airflow (DAG), Docker, Conductor, Ray for LLM training and inference at scale is a plus.
  • Ability to meaningfully present results of analyses in a clear and impactful manner, breaking down complex ML/LLM concepts for non-technical audiences.

Why Join Us?

At Opmed.ai, you’ll be part of a company that’s at the forefront of AI-driven healthcare solutions. We offer:

  • A collaborative and intellectually stimulating environment.
  • Opportunities to work on cutting-edge technology that directly impacts the healthcare industry.
  • Competitive compensation, growth opportunities, and a fun, supportive team.

LLM Engineer

By clicking “Apply Now,” you allow Opmed to contact you via phone or email.
See our Privacy Policy for more info.

Thank you!

You your application has been submitted

Oops! Something went wrong while submitting the form.

Embrace the new era in OR management

See the unseen and anticipate the unexpected to create resilient OR schedules,
maximize resource allocation, and improve quality of care with Opmed.
Book a demo