About Phantm
Phantm is a technology company focused on optimizing large language model (LLM) usage for organizations that rely on AI-powered applications. The platform acts as a drop-in replacement for existing LLM calls, reducing token costs by approximately 50% while maintaining response quality. By improving efficiency and lowering operational expenses, Phantm enables teams to scale AI solutions more sustainably. The company is building tools for developers and businesses that want smarter, more cost-effective AI integrations. Team members have the opportunity to work on cutting-edge problems at the intersection of machine learning and systems optimization.
The role
This is a full-time, remote ML Intern role. The ML Intern will support the design, implementation, and evaluation of models and algorithms that reduce token usage without degrading output quality. Day-to-day work may include experimenting with different LLM configurations, building data pipelines, analyzing performance metrics, and contributing to internal tools for monitoring and optimization. The role also involves staying current with recent ML and LLM research and suggesting improvements that can be translated into practical product features.
What we're looking for
- Strong foundation in machine learning and statistics, including familiarity with model training, evaluation, and optimization.
- Experience with Python and common ML libraries or frameworks (e.g., PyTorch, TensorFlow, scikit-learn).
- Understanding of large language models, NLP concepts, and prompt design or tuning.
- Currently pursuing or recently completed a degree in Computer Science, Data Science, Engineering, Mathematics, or a related field, or equivalent practical experience.