INDEX
Explanations
phrases where the speaker self-identifies as an AI (assistant saying it is an AI).
New Auto-Interp
Negative Logits
mixture
-0.07
jeho
-0.06
Hot
-0.06
Marshall
-0.06
TAX
-0.06
cabbage
-0.06
анные
-0.06
μέχρι
-0.06
.getSelectedItem
-0.06
никами
-0.06
POSITIVE LOGITS
990
0.07
plode
0.07
Ні
0.07
\Active
0.06
_OPT
0.06
expression
0.06
ेण
0.06
lookup
0.06
'aff
0.06
=explode
0.06
Activations Density 0.021%