INDEX
Explanations
phrases related to defense or support
expressions related to political loyalty and defense
New Auto-Interp
Negative Logits
swick
-0.61
0004
-0.59
NN
-0.56
Float
-0.55
Roz
-0.54
nesday
-0.49
emonium
-0.48
Zy
-0.48
Crusher
-0.47
contacted
-0.47
POSITIVE LOGITS
sqor
0.59
)?
0.59
nowadays
0.58
Downloadha
0.57
?).
0.57
interstitial
0.56
})
0.56
)</
0.54
idae
0.50
inas
0.50
Activations Density 1.448%