•
Notes on variational inference perspectives for policy optimization
6 min read · January 23, 2026
2026 · RL · notes