INDEX
Explanations
references to military technology and nuclear weapons
New Auto-Interp
Negative Logits
kindly
-0.17
kra
-0.15
istine
-0.14
lore
-0.14
Lug
-0.14
_boxes
-0.14
ä¾Ľ
-0.14
Mason
-0.14
alien
-0.13
amon
-0.13
POSITIVE LOGITS
æ´²
0.21
ÑĪаÑħ
0.18
ovit
0.17
strategic
0.16
-cap
0.16
deterrent
0.16
delivery
0.16
okoj
0.16
Strategic
0.15
Pose
0.15
Activations Density 0.027%