Joe Clinton | Robotics and AI Researcher

Publication

Planning Transformer: Long-Horizon Offline Reinforcement Learning with Planning Tokens

Joseph Clinton, Robert Lieck

arXiv 2024

Supervised learning approaches to offline reinforcement learning, particularly those utilizing the Decision Transformer, have shown effectiveness in continuous environments and for sparse rewards. However, they often struggle with long-horizon tasks due to the high compounding error of auto-regressive models. To overcome this limitation, we go beyond next-token prediction and introduce Planning Tokens, which contain high-level, long time-scale information about the agent's future. Predicting dual time-scale tokens at regular intervals enables our model to use these long-horizon Planning Tokens as a form of implicit planning to guide its low-level policy and reduce compounding error. This architectural modification significantly enhances performance on long-horizon tasks, establishing a new state-of-the-art in complex D4RL environments. Additionally, we demonstrate that Planning Tokens improve the interpretability of the model's policy through the interpretable plan visualisations and attention map.

Paper Code

Research Projects

DexBench

Next-gen benchmark for dexterous bimanual manipulation. 11 long-horizon tasks on a compact table-top with mobile camera, distractor objects, and low-cost compliant arms. Designed to expose the gap between saturating sim benchmarks and real-world humanoid performance.

Benchmarking Bimanual VLA

Shh... still in stealth mode

Planning Transformer

Novel enhancement to the Transformer architecture for offline RL that significantly improves long-horizon decision making. Master's thesis achieving state-of-the-art on D4RL benchmarks.

PyTorch Offline RL Transformers

Paper GitHub

hand-teleop

Turn your webcam into real-time robot joint positions. Designed for LeRobot with Wilor GPU backend, Kalman smoothing, and plug-and-play integration. 48 GitHub stars.

Python Computer Vision Robotics

LinkedIn Post GitHub

PromptMonkey

Multi-agent many-shot code generation achieving 4th place out of 800+ participants in the NeurIPS 2024 Meta Hackercup AI Track. Extended MapCoder with careful prompt engineering, Codestral-22B, Maj@128 voting, and VLLM parallel inference (2000 tk/s).

LLM Agents VLLM Prompt Engineering

GitHub

Hint Distill

Self-supervised hint distillation for improving LLMs on code generation. Finetunes Qwen3-4B using KL-divergence distillation where the model with hint access teaches itself.

LLM Post-Training Distillation PyTorch

GitHub

vpct-text

Prime verifiers environment for VPCT scenes. Won #3 in the Iterate London RL Environment Hackathon (Dec 2025). Scores model outputs predicting final bucket positions.

RL Environments Python Hackathon Winner

GitHub

Open Source

GEM (Good Enough Manipulator)

A $450 low-cost 7-DOF robot arm bridging the gap between ultra-budget ($140 SO101) and premium ($3000+) options. 1.2kg peak payload, 60cm reach. Designed for accessible robot learning research.

CAD Robotics Hardware

Shh... still in stealth mode

LeRobot Contributions

Contributor to HuggingFace's open-source VLA framework for robot learning. Contributed BlockPush environment for benchmarking manipulation policies, assisted in porting HIL-SERL for online RL finetuning, and improved training speed through data loading optimizations.

Python Imitation Learning VLA

GitHub

robot-arm-viewer

Browser-based URDF viewer with IK-driven click-and-drag controls. Compare low-cost robot arms, export DAE models, and visualize reachable workspaces.

Three.js URDF IK

Demo GitHub

Latent Diffusion Slim

State-of-the-art latent diffusion model trained on FFHQ for photo-realistic face generation up to 128x128px. Optimized for single GPU training. 100% coursework grade.

Diffusion Models PyTorch Generative AI

GitHub

Dirty Dish Detection

Hackathon project detecting when housemates leave dirty dishes using a hybrid of YOLO and traditional computer vision state tracking algorithms.

YOLO Computer Vision PyTorch

GitHub

SO100 Camera Mount

Open source snap-on camera mount for SO100 robot arm and U20CAM camera. Optimized 30-degree angle based on testing. Parametric Fusion 360 design for easy customization.

CAD Robotics Hardware

GitHub

Scratch Addons

Core contributor to browser extension with 593,000+ users. Developed 8 addons including a profiling tool for performance analysis of Scratch projects.

JavaScript Browser Extension Profiling

Website GitHub

Side Projects

Qualicoder

AI-first qualitative coding platform for transcript analysis. Helping consultants efficiently analyze interview and research data.

AI Startup React NLP

Demo

Cluque

Daily cryptic puzzle game with social features. Play the global challenge or compete with friends in groups. Built with React and Firebase subscriptions.

React Firebase Consumer App

Play Video

IBrecap.com

Non-profit IB revision platform I founded and maintain. 1.6 million page views, #4 Google result for "IB revision". Full-stack PHP/SQL with custom CMS.

PHP SQL Full-Stack

Visit

3D Game Engine

First complete 3D graphics and physics engine for Scratch. Built over 2 years with innovative binary space partitioning for efficiency. Demonstrates deep graphics understanding.

Scratch 3D Graphics Physics

Try it