Subliminal Learning in AIs
July 25 2025Today’s freaky LLM behavior:
We study subliminal learning, a surprising phenomenon where language models learn traits from model-generated data that is semantically unrelated to those traits. For example, a...
Read more
Recent Comments