Sovit Nayak | RL Research & Engineering

Experience

/// RESEARCH_PROFILE

Softmax

2025 - Present

Reinforcement Learning Researcher

›Conducted multi-agent RL experiments focusing on reward shaping, curriculum learning, and training stability.
›Built reproducible training pipelines using W&B, structured configs, and automated scorecards.
›Automated distributed training infrastructure, improving reliability and experiment throughput.
›Analyzed rollout data to detect reward hacking, mode collapse, and agent identity drift.

TwoCube AI

2023

ML Engineering Intern

›Engineered multi-agent LLM workflows deployed on GCP Kubernetes clusters.
›Fine-tuned LLMs with supervised objectives, optimizing learning rate schedules and evaluation metrics.
›Optimized infrastructure efficiency by 3x, resulting in a 50% reduction in cloud costs.
›Built ETL pipelines and analytics dashboards for benchmarking model performance.
›Developed evaluation metrics for retrieval-augmented generation (RAG) systems.

UC Rocket Labs

06/2022 - 05/2024

Software Engineer

›Designed custom PCB boards + embedded systems for rocket avionics (Teensy 4.1, Arduino, C++).
›Integrated UART, SPI, and I2C communication buses with RF modules, achieving reliable real-time telemetry with >95% packet delivery rate during test flights.
›Delivered algorithm optimizations, improving real-time system performance by 15%.

Flow AI

2023

Software Engineering Intern

›Developed serverless ML pipelines using AWS Lambda for scalable inference.
›Deployed TensorFlow prediction models for real-time data processing.
›Implemented CI/CD automation to ensure reliability of machine learning deployments.