GPT-4o-mini Falls for Psychological Manipulation
September 5 2025Interesting experiment:
To design their experiment, the University of Pennsylvania researchers tested 2024’s GPT-4o-mini model on two requests that it should ideally refuse: calling the user a jerk and...
Read more

Recent Comments