INDEX
Explanations
phrases related to errors or faults
New Auto-Interp
Negative Logits
rollers
-0.81
well
-0.73
amen
-0.73
electric
-0.73
azine
-0.70
bians
-0.69
zona
-0.69
weeney
-0.68
agonists
-0.67
ramid
-0.67
POSITIVE LOGITS
mistakes
0.92
mistake
0.82
perpetrated
0.76
fulness
0.74
careless
0.74
mistaken
0.73
mishand
0.73
Gamble
0.72
é»Ĵ
0.71
gamb
0.70
Activations Density 0.030%