INDEX
Explanations
words related to military operations and warfare
instances of the word "err" indicating errors or mistakes in context
New Auto-Interp
Negative Logits
esville
-0.77
xual
-0.72
eph
-0.69
ĺħ
-0.69
waves
-0.68
¥
-0.66
creen
-0.66
clud
-0.64
Sing
-0.64
rooms
-0.62
POSITIVE LOGITS
idge
0.98
rr
0.97
andom
0.95
ange
0.92
untled
0.91
antly
0.90
ands
0.87
utherford
0.87
anger
0.86
ific
0.84
Activations Density 0.016%