Vish Sangale

All Notes.

N° 18
Beyond the Click: Slate-Q for Sequential Recommendation Most recommendation systems are designed to maximize immediate engagement—the “next click.” However, true user value is built over entire sessions. In this project, RL-RECSYS, I...
recsys
N° 17
Animating Intelligence: Visualizing AI with Manim-AI Neural networks are often treated as “black boxes,” full of abstract matrices and hidden weights. To bridge the gap between theory and intuition, I developed...
manim
N° 16
Bonsai-LLM: The 'Small LLMs Lab' Philosophy In an era of trillion-parameter models and massive compute clusters, it’s easy to forget that capacity matters, constraints are the point, and craft beats brute...
small-llms
N° 15
Modernizing GPT-2: A 3.1x Throughput Leap with 2025 Optimizations The original GPT-2 architecture, released in 2019, remains the bedrock of modern NLP. However, the “standard” recipe for training Transformers has shifted dramatically. In this...
gpt-2
N° 14
Causal-Informed Hybrid Online Adaptive Optimization for Ad Load Personalization in Large-Scale Social Networks This paper presents CTRCBO (Cohort-Based Trust Region Contextual Bayesian Optimization), a hybrid framework designed for personalizing ad load in large-scale social networks like Meta.
causal-inference
N° 13
Generative Retrieval: Scaling PLUM with Hierarchical Semantic IDs The landscape of recommendation systems (RecSys) is undergoing a paradigm shift. We are moving away from traditional “score-and-rank” models toward Generative Retrieval, where an LLM...
recsys
N° 12
Treatment Effect Learning Under Sequential Randomization This research addresses the challenges of causal inference in online experiments (A/B testing) where treatment assignments are sequential.
treatment-effect
N° 11
NxtPost: User to Post Recommendations in Facebook Groups NxtPost is a graph-based recommendation framework designed for Facebook Groups to help users discover relevant content within their communities.
recsys
N° 10
Behavioral Questions In senior technical roles, behavioral interviews are often more about impact, ownership, and cross-functional leadership than technical skill. They test how you handle conflict, complexity,...
interview
N° 09
ML System Design Designing a machine learning system is vastly different from designing a traditional software system. It requires balancing the needs of high-availability software architecture with the...
system-design
N° 08
Activation Functions Activation functions are the mathematical “gates” that decide whether a neuron should fire or stay dormant. They introduce non-linearity into a neural network, allowing it...
deep-learning
N° 07
Python Clean Code Python’s philosophy emphasizes readability and simplicity. “Clean code” in Python (often called “Pythonic” code) isn’t just about making it work—it’s about making it expressive and...
python
N° 06
Regularization Regularization is a fundamental technique in machine learning used to prevent overfitting by penalizing the complexity of a model. By adding a regularization term to...
regularization
N° 05
Loss functions in machine learning Loss functions are the heartbeat of machine learning models. They quantify the discrepancy between a model’s predictions and the actual ground truth, guiding the optimization...
loss-functions
N° 04
Failure is not an option For as long as I can remember, I have been captivated by the "Final Frontier." Whether through the lens of a telescope peering into the...
exploration
N° 03
TeTrIS: Template Transformer Networks for Image Segmentation With Shape Priors This research, published in IEEE Transactions on Medical Imaging (TMI), introduces TeTrIS, a network architecture that integrates shape priors into the segmentation process.
segmentation
N° 02
Uncertainty in Motion: Probabilistic Robotics from Textbook to Code Probabilistic methods are the foundation of modern robotics. Because sensors are noisy and environments are unpredictable, a robot must be able to represent and reason...
probabilistic-robotics
N° 01
Localization of a Mobile Autonomous Robot Using Extended Kalman Filter This early research focuses on the localization of mobile robots in dynamic environments using Extended Kalman Filters (EKF).
robotics