Sr. Machine Learning Engineer, Network Engineering
Job Description
Senior ML Engineer, Network Engineering role at Tesla in Fremont, CA onsite, focusing on LLM training, network data processing, and intelligent diagnostics.
Responsibilities
- Design, implement, and maintain end-to-end pipelines for LLM pretraining, supervised fine-tuning, reward modeling, PPO/RLHF, and model evaluation
- Build scalable backend services and distributed training infrastructure to support large-scale ML workloads
- Create data ingestion and transformation pipelines for networking datasets including device inventory, topology data, ARP/MAC tables, routing state, metrics, telemetry, and logs
- Normalize and model heterogeneous networking data to support ML workflows and agent inference
- Develop intelligent agent capabilities, including intent classification, context tracking, troubleshooting logic, and action routing workflows
- Collaborate with networking domain experts to translate operational needs into model and platform capabilities
- Implement CI/CD, model versioning, orchestration, and production-grade deployment pipelines
- Drive architectural decisions to ensure scalability, modularity, reliability, and performance
Requirements
- 5+ years of experience in applied ML, ML systems, or backend platform engineering
- Strong experience with LLM training, including SFT, LoRA/PEFT, RLHF, reward modeling, and PPO
- Proficiency with PyTorch and distributed training technologies
- Solid backend engineering skills in Python, including APIs, microservices, data modeling, and containerization
- Strong understanding of networking fundamentals such as IP addressing, Ethernet, VLANs, routing protocols (BGP/OSPF/ISIS), and topology concepts
- Experience working with networking telemetry or operational data (SNMP, flow data, metrics, logs, routing/forwarding tables)
- Ability to interpret and model complex network states, device relationships, and diagnostic patterns
Technologies
- PyTorch
- Python
Benefits
- Medical plans with zero payroll deduction
- Family-building, fertility, adoption and surrogacy benefits
- Dental and vision plans with zero employee contributions
- Company-paid Health Savings Account (HSA) contribution
- Healthcare Flexible Spending Account (FSA)
- Dependent Care Flexible Spending Account (FSA)
- 401(k) with employer match
- Employee Stock Purchase Plan (ESPP)
- Company-paid Basic Life and AD&D insurance
- Short-term disability insurance
- Long-term disability insurance
- Employee Assistance Program
- Paid time off including sick and vacation
- Paid holidays
- Back-up childcare resources
- Parenting support resources
- Critical illness insurance
- Hospital indemnity insurance
- Accident insurance
- Theft & legal services
- Pet insurance
- Weight loss program
- Tobacco cessation program
- Tesla Babies program
- Commuter benefits
- Employee discounts and perks program
What to expect
We seek a highly skilled Senior Software Engineer to lead the development of a platform that unifies large-scale LLM training with advanced network data processing and intelligent diagnostic capabilities. The role requires strong machine-learning engineering expertise, robust software engineering fundamentals, and a solid understanding of network systems.
Expected compensation
Salary: $140,000 - $300,000 per year, plus cash and stock awards and benefits. The final offer may vary based on location, expertise, skills, and experience. Additional elements may be included as part of the overall compensation package, with details provided if an employment offer is extended.