INDEX
Explanations
phrases related to governmental and military subjects
New Auto-Interp
Negative Logits
ba
-0.15
Moff
-0.14
Ba
-0.14
Gover
-0.14
Bd
-0.14
opt
-0.14
MF
-0.14
_cached
-0.14
tron
-0.14
Spect
-0.14
POSITIVE LOGITS
åIJ¹
0.18
iere
0.17
iert
0.17
IDGE
0.16
ople
0.16
ibling
0.16
lom
0.16
Brothers
0.15
bre
0.15
arser
0.15
Activations Density 0.023%