hlfshell
Articles
Feed
Projects
Talks
github
rss
GRPO in DeepSeek-R1
Mar 14, 2025
GRPO in DeepSeek-R1
Diffusion Models Are Real-Time Game Engines
Sep 17, 2024
Diffusion Models Are Real-Time Game Engines
Google DeepMind's Grandmaster-Level Chess Without Search
Aug 12, 2024
Google DeepMind's Grandmaster-Level Chess Without Search
Representation Engineering and Control Vectors - Neuroscience for LLMs
Mar 21, 2024
Representation Engineering and Control Vectors - Neuroscience for LLMs
Nerd Sniped - Solving for Jumbles and Letter Boxed
Feb 15, 2024
Nerd Sniped - Solving for Jumbles and Letter Boxed
Utilizing LLMs as a Task Planning Agent for Robotics
Jan 8, 2024
Utilizing LLMs as a Task Planning Agent for Robotics
A Corollary to Conway's Law - Build for The Team You Have
Jan 7, 2024
A Corollary to Conway's Law - Build for The Team You Have
Repeatable Dev Environments for ROS2
Oct 14, 2023
Repeatable Dev Environments for ROS2
State of the art in LLMs + Robotics - 2023
Oct 5, 2023
State of the art in LLMs + Robotics - 2023
Reinforcement Learning with a Pick and Place Robotic Arm
Aug 2, 2023
Reinforcement Learning with a Pick and Place Robotic Arm
Ultralearning
Jun 21, 2023
Ultralearning
Evolving a Neural Network Traffic Controller
Jun 16, 2023
Evolving a Neural Network Traffic Controller
Golang Docker Harness
Jun 13, 2023
Golang Docker Harness
Evolutionary Neural Networks
Jun 11, 2023
Evolutionary Neural Networks