logo
ML through Reinforcement Learning
[CLOSED] 🔒Q-Learning
Initializing search
    • Home
    • Materials
    • Premium Content
    • About
    • Newslatter
    • Home
      • The Latest
          • 0. (First step)-(Last step)
          • 1. Prerequisites
          • 2. Set up new project, git info
          • [CLOSED] 🔒3. numpy
          • [CLOSED] 🔒4. matplotlib
          • [CLOSED] 🔒5. Linear Regression
          • [CLOSED] 🔒6. Logistic Regression
          • [CLOSED] 🔒7. Perceptron, MLP, XOR
          • [CLOSED] 🔒8. Neurol Networks
          • [CLOSED] 🔒9. NN weights init
          • Why RL
          • When RL
          • [CLOSED] 🔒Idea, Concepts and Terms
          • [CLOSED] 🔒course page
          • RL vocabulary
          • [CLOSED] 🔒MDP
          • [CLOSED] 🔒Gymnasium (gym) api
          • [CLOSED] 🔒MC vc TD updates
          • Sarsa, SarsaMax
          • [CLOSED] 🔒Q-Learning
          • REINFORCE
          • Reward rules
          • [CLOSED] 🔒Exploration/Exploitation
          • [CLOSED] 🔒Cusom gym env with different reward rules (parabola example)
          • Real RL project
          • [CLOSED] 🔒Must read articles
          • Top-k RL-engineer interview questions
          • [CLOSED] 🔒How to hire good rl engineer? (For HR and job seekers)
        • Authors
      • [CLOSED] 🔒Advanced Topics
      • [CLOSED] 🔒Advanced Topics
    • About
    • Newslatter
    NewZc3pK69fuKTztgoxnGw==;QtRkcqX2IZYmSdufy+LthQ==

    This premium content requires a password

    Contact your administrator ___Vitalii___ for access to this page.

    © Copyright © 2023-2025 Vitalii Grebnev