DEV Community

# reinforcementlearning

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Why Most Game NPCs Feel Dead (And How Emotion and Memory Fix It)

Why Most Game NPCs Feel Dead (And How Emotion and Memory Fix It)

1
Comments
4 min read
Hamilton-Jacobi-Bellman Equation: Reinforcement Learning and Diffusion Models!

Hamilton-Jacobi-Bellman Equation: Reinforcement Learning and Diffusion Models!

Comments
11 min read
A free model matched GPT-5.2. No fine-tuning. It rewrote its own skill files until it got there

A free model matched GPT-5.2. No fine-tuning. It rewrote its own skill files until it got there

5
Comments
4 min read
Reinforcement Learning for Robotics: A Comprehensive 2025 Guide

Reinforcement Learning for Robotics: A Comprehensive 2025 Guide

1
Comments
52 min read
How I Built a Readable AlphaZero From Scratch — A Deep Dive Into the Code

How I Built a Readable AlphaZero From Scratch — A Deep Dive Into the Code

1
Comments
10 min read
From Pixels to Physicality ☃️: Engineering Olaf with Reinforcement ✨ Learning, Control Systems, and Illusion Design 🤖

From Pixels to Physicality ☃️: Engineering Olaf with Reinforcement ✨ Learning, Control Systems, and Illusion Design 🤖

2
Comments
8 min read
I Built an AI Arena and Trained AlphaZero to Play Gomoku: Here’s How

I Built an AI Arena and Trained AlphaZero to Play Gomoku: Here’s How

1
Comments
4 min read
[Meta-RL] We told an AI agent 'you can fail 3 times.' Accuracy went up 19%.

[Meta-RL] We told an AI agent 'you can fail 3 times.' Accuracy went up 19%.

4
Comments
4 min read
Fixing an Off-By-One Bug in PufferLib's PPO Implementation

Fixing an Off-By-One Bug in PufferLib's PPO Implementation

Comments
2 min read
Multi armed bandit exercise 2.5 with C#

Multi armed bandit exercise 2.5 with C#

Comments
4 min read
Sutton & Barto Gridworld example in C#

Sutton & Barto Gridworld example in C#

Comments
5 min read
HRPO-X v1.0.1: from HRPO paper production-hardened runnable code

HRPO-X v1.0.1: from HRPO paper production-hardened runnable code

Comments
2 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.