BullshitBench is a dedicated benchmark designed to test whether AI models can identify nonsensical questions, or if they will instead provide confident answers to queries that have no valid basis. The initial results
BullshitBench tests whether AI models can detect nonsensical questions—or if they'll confidently answer them anyway. The results are dire.
BullshitBench基准测试旨在检测AI模型能否识别无意义的问题,还是会不管问题是否合理都自信给出答案,而目前的测试结果十分糟糕。
The core concept of this benchmark is simple:
我们在隐私政策里藏了一次瑞士免费旅行,两周后有人发现了
We Hid a Free Trip to Switzerland in Our Privacy Policy. Someone Found It in 2 Weeks.
At Cape, we've always said that privacy shouldn't be buried
关于索引,那些你可能不知道的事
Things You Didn't Know About Indexes
读取变快,写入变慢 / Reads get faster, writes get slower
Let's start with things you probably did know.
让我们从你可能确实知道的事情开始。
A database index is
脑腐工业综合体:你不是分心,你正在被加工
You Are Not Distracted. You Are Being Processed.
你不是分心,你正在被加工 / You are not distracted, you are being processed
Peace be upon you, fellow digital wanderer.
或许没有比网络俚语"brainrot"(脑腐)