INDEX
Explanations
references to U.S. military entities and operations, particularly relating to Special Forces and Navy SEALs
New Auto-Interp
Negative Logits
éĽ
-0.15
spoiler
-0.14
276
-0.14
odia
-0.14
ìĪľ
-0.14
Newman
-0.14
nik
-0.14
humane
-0.13
ibir
-0.13
èĴ
-0.13
POSITIVE LOGITS
kre
0.16
ipse
0.15
.gc
0.15
práv
0.14
adden
0.14
/operators
0.14
ÑĤаÑħ
0.14
Means
0.14
apsed
0.13
hod
0.13
Activations Density 0.031%