Jan 2, Recap for December

😶‍🌫️Dec 16, Notes on DP, Monte Carlo, TD in Reinforcement Learning

Exploration of three key reinforcement learning algorithms: Dynamic Programming (DP) for optimal policies in MDPs, Monte Carlo methods for learning from complete episodes without a model, and Temporal Difference (TD) learning for efficient updates from incomplete episodes using bootstrapping. Each method has unique characteristics and trade-offs essential for understanding advanced concepts in reinforcement learning.
Dec 16, Notes on DP, Monte Carlo, TD in Reinforcement Learning
Dec 13, Notes on Basic Concepts about Reinforcement Learning
Dec 12, Notes on Gemini-Flash 2.0
Dec 6, Some Tests on o1
Dec 4, Recap for November
Nov 29, Notes on “Deep Thinking Model”
Nov 26, Explore DSPy on BootstrapFinetune

🤩Nov 20, Notes on DSPy

This isn't my first blog post on DSPy—I've written several before. However, I've noticed some recent updates to DSPy, and I'd rather not consult the documentation every time I want to build programs. So, I plan to jot down some basic DSPy concepts in this post. Additionally, I intend to use this document as external knowledge for GPT or Claude.
Nov 20, Notes on DSPy
Nov 17, Recap for October
Nov 15,Notes on OPENCODER
Nov 6, Notes on Contextual Retrieval