Job Description
About Us
We are a stealth-mode startup building next-generation infrastructure for the AI industry. Our mission is to make advanced language models portable, efficient, and customizable for real-world deployments. We’re building tools that allow vendors to fine-tune models easily and deploy them securely on diverse hardware.
Role
We are seeking a AI ML Engineer (Python) to help design and implement our AI Pipelines . This is not an academic research role — you will be productizing and automating existing fine-tuning techniques (LoRA/QLoRA) so vendors can train and manage their own adapters with minimal effort.
You’ll work closely with backend engineers (Node.js) who orchestrate jobs and dashboards, while you focus on the training pipelines and adapter export logic .
ResponsibilitiesImplement and maintain LoRA/QLoRA fine-tuning pipelines using PyTorch + Hugging Face Transformers + PEFT.
Develop logic for incremental training and adapter stacking , producing clean, versioned “delta packs.”
Automate data preprocessing (tokenization, formatting, filtering) for user-supplied datasets.
Build training scripts/workflows that integrate with orchestration backends (Node.js, REST/gRPC, or job queues).
Implement monitoring hooks (loss curves, checkpoints, eval metrics) to feed into dashboards.
Collaborate with DevOps to ensure reproducible, portable training environments.
Write tests to guarantee reproducibility and correctness of adapter outputs.
Willingness to occasionally be present in the office for discussions and team collaboration.
Strong programming skills in Python .
Hands-on experience with PyTorch and the Hugging Face ecosystem (Transformers, Datasets, PEFT).
Familiarity with LoRA/QLoRA or parameter-efficient fine-tuning methods.
Understanding of mixed precision training (FP16/BF16) and memory optimization techniques.
Experience building training scripts that are production-ready (reproducibility, logging, error handling).
Comfortable working in Linux GPU environments (CUDA, ROCm).
Ability to collaborate with backend/frontend engineers who are not ML specialists.
Experience with bitsandbytes , xformers , or flash-attention .
Familiarity with distributed training (multi-GPU, NCCL, DeepSpeed, or Accelerate).
Prior work in MLOps or packaging ML pipelines for deployment.
Contributions to open-source ML libraries.
Build the core training product that lets vendors adapt models safely and efficiently.
Focus on product engineering , not open-ended research.
Collaborate with a lean, highly technical team at the intersection of AI and systems.
Competitive compensation, equity potential, and flexible remote work.
Hiring Immediately in the Homer Glen, IL area! Be There to Care for a Person When They Need You Most As a Caregiver, you will provide an essential role, as a personal caretaker to your wonderful Client, by integrating into their day-to-day, just as their familial...
...the world who desperately need both. Learn more at: Position Overview Filter of Hope is seeking a detail-oriented Staff Accountant / Senior Accountant to oversee the day-to-day accounting operations of our growing global ministry. This position is...
...you inspire others with your kindness and joy? Were different than most primary care providers. Were rapidly expanding and we need great people to join our team. ChenMed, a physician-led and mission-driven, primary care organization, is currently one of the...
...Job Description Job Description Tri-State Orthopaedics & Sports Medicine , a well-respected and busy orthopaedic practice, is looking for a full-time Orthopaedic Physician Assistant to work alongside of the current 15 PAs -14 Ortho PAs (including a "Coverage PA" to...
...Event Promoter Luxury Bath of Gulf Coast is a growing name in the acrylic bath remodeling industry. We specialize in custom bathroom remodels that are attractive, durable, and maintenance-free, improving the lives of homeowners throughout the Gulf Coast region....