Cody Rushing - Member of Technical Staff at Redwood Research

About

Hey, I'm Cody. I work at Redwood Research on AI security. In the past, I've done AI Mechanistic Interpretability research under Neel Nanda. I got my Bachelor's in Computer Science from UT Austin in Fall of 2024.

In my free time, I'm working on being a better writer, basketball player, thinker, and friend.

Selected Research

Ctrl-Z: Controlling AI Agents via Resampling

Aryan Bhatt, Cody Rushing, et al., Buck Shlegeris

arxiv | website

Explorations of Self-Repair in Language Models

Cody Rushing, Neel Nanda

Accepted to ICML 2024; Accepted to SeT LLM @ ICLR 2024 Workshop | Oral

arxiv

Copy Suppression: Comprehensively Understanding an Attention Head

Callum McDougall*, Arthur Conmy*, Cody Rushing*, Thomas McGrath, Neel Nanda

Accepted to NeurIPS ATTRIB 2023 Workshop

arxiv

Contact

Email: thisiscodyr@gmail.com

GitHub | Google Scholar | Twitter