Power Factors is seeking a Senior Machine Learning Engineer to join their Innovation team. This role will focus on ambitious technical initiatives, specifically building and fine-tuning Large Language Models (LLMs) using Power Factors' unique dataset. The engineer will be involved in all stages of the process, from data preparation and architecture design to aligning with business value, developing curriculum and tokenization strategies, and scaling models from proof-of-concept to production. The role involves significant architectural decision-making for foundational time-series models, including choices related to model heads, attention mechanisms, context/horizon sizing, and handling multi-frequency data. The engineer will also design the tokenization strategy for a multi-modal training corpus, estimate scaling laws for fleet-scale capacity, and stay abreast of the latest time-series foundation model literature. A key responsibility is designing an experimental framework to validate the business value of these models for target use cases. On the training infrastructure side, the engineer will build and maintain a reproducible training environment with experiment tracking. They will optimize distributed training pipelines for efficiency and implement a model registry for artifact management. The pre-training recipe, including learning rate schedules and curriculum strategies, will also be owned by this role. For fleet-scale execution, the engineer will scale training from pilot to a larger dataset, handle real-world data quality issues, and report baseline metrics. Collaboration with Backend/Data Engineers and the Tech Lead/Product team is crucial for ensuring data quality, pipeline integration, and meeting customer requirements. Documentation of training procedures and architecture decisions is also expected.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Senior
Education Level
No Education Listed