INDEX
Explanations
punctuations in combination with short phrases
New Auto-Interp
Negative Logits
irie
-0.80
enhagen
-0.74
gang
-0.73
robe
-0.72
enaries
-0.71
ument
-0.67
escription
-0.65
ioxide
-0.64
everal
-0.64
ļéĨĴ
-0.64
POSITIVE LOGITS
huh
1.23
especially
1.10
considering
1.07
albeit
1.03
eh
1.02
especially
0.99
but
0.99
though
0.97
Especially
0.93
although
0.92
Activations Density 0.225%