Principal Machine Learning Researcher Job at Alldus, San Jose, CA

cTVHSTdvWVVOcms1NWgrTVNKS2F2djRFMXc9PQ==
  • Alldus
  • San Jose, CA

Job Description

Principal / Director, AI Research – Reinforcement Learning for LLMs

We're hiring a Principal or Director-level AI Researcher with deep expertise in Reinforcement Learning and LLM post-training to join our growing AI research group. This is a research-first role, with a mandate to push the frontier of model alignment, safety, and performance — working with foundation models in real-world, high-stakes environments.

You won’t be handed toy problems or legacy systems. Instead, you'll lead applied research efforts focused on tuning, aligning, and optimizing large models for privacy, security, and interpretability - in one of the few spaces where LLMs have both massive scale and measurable consequences.

What You’ll Work On:

This role centers on building and refining intelligent agents that interact with sensitive data and complex access controls, using modern reinforcement learning and post-training techniques:

  • Post-training of LLMs using RL: Design and run experiments with methods like PPO, DPO, RLAIF, and other fine-tuning strategies to align model behavior with security and privacy goals
  • RL for Self-Correction & Redaction: Enable models to iteratively improve their predictions on document classification, redaction, and identity resolution through self-rewarded feedback loops
  • Model Alignment & Safety: Contribute to the development of our “LLM Firewall” — filtering prompts/responses to prevent jailbreaking, data leakage, and adversarial exploits
  • Inference Stack & Optimization: Collaborate with engineers optimizing our in-house inference stack to make LLaMA-class models performant at scale

What We’re Looking For:

  • Demonstrated expertise in Reinforcement Learning applied to language models or decision-making agents
  • Strong understanding of post-training methodologies (e.g., RLHF, DPO, preference modeling, rejection sampling, offline RL)
  • Solid background in LLMs , token-level reasoning , and language modeling internals
  • Publication record or research contributions in top-tier venues (NeurIPS, ICLR, ICML, ACL, etc.) preferred
  • Ability to work independently and iterate quickly — experience in scrappy, high-output research environments a plus
  • Industry experience is not required — we care more about the depth of your research thinking and experimentation rigor

Why This Role:

  • Join a company with massive real-world data , impactful use cases, and a mature infrastructure
  • Avoid the grind of infra-focused roles — we’ve already solved those problems
  • Shape the next phase of LLM alignment , self-correcting models , and AI safety at inference time
  • Work on problems with technical depth and direct product impact

Job Tags

Similar Jobs

Peckham Industries

Truck Driver (Lowbed) (Req #: 1133) Job at Peckham Industries

 ..., inclusive, and profitable. Job Summary: The Low Bed Truck Driver is responsible for the safe and efficient transportation of heavy...  ...matter. Keep accurate records of deliveries, including logs of working hours, mileage, and any incidents. 7. Respect and... 

International Leadership of Texas

Special Education Paraeducator Job at International Leadership of Texas

 ...stipend Primary Purpose: Help meet physical and instructional needs of individual students with disabilities inside and outside...  ...passed as required by Every Student Succeeds Act (ESSA) Special Knowledge/Skills: Ability to work with children with disabilities... 

Bakersfield American Indian Health Project

Health Educator Job at Bakersfield American Indian Health Project

 ...Full-time, Non-Exempt Summary/Objectives of Position The Health Educator position is responsible for assisting with health education...  ...Mandatory Qualifications Education:~ Associate Degree in Public Health, Health Education, Nursing, Science, Social Work or... 

Bayou Bend Health System

RN, PRN - Emergency Department Job at Bayou Bend Health System

 ...reports to the Department Nurse Manager and, in absence of Nurse Manager, to House Supervisor. Position is PRN, DAYS 6A-6P, NIGHTS 6P-6A and/or Weekends.Must be an RN licensed in the State of Louisiana. Must be BLS certification required upon hire. ACLS, NRP, PALS... 

HID

Product Manager EAT RFID Standard Readers Job at HID

 ...ensuring that new products meet market demands, are delivered on time, and contribute to the company's growth. Ideal candidates will have...  ...and resources to maximize your potential To be a part of a global organization that is pioneering the hardware, software...