#ai-behavior — AI News & Research

🧐 Safety LessWrong 1 min read

Current AIs seem pretty misaligned to me

Many people—especially AI company employees [1] —believe current AI systems are well-aligned in the sense of genuinely trying to do what they're supposed to do (e.g., following their spec or constitution, obeying a reasonable interpretation of instructions). [2] I disagree. Current AI systems seem pretty misaligned to me in a mundane behavioral sense: they oversell their work, downplay or fail…

#ai-alignment #ai-behavior #ai-safety

🕐 9 days ago

Read →

DeepTrendLab — Top 50 AI Sources, Research & News

Current AIs seem pretty misaligned to me