Beyond the Click: Slate-Q for Sequential Recommendation
Most recommendation systems are designed to maximize immediate engagement—the “next click.” However, true user value is built over entire sessions. In this project, RL-RECSYS, I explored how Reinfo...
