INDEX
Explanations
connections to military applications and influence
New Auto-Interp
Negative Logits
undra
-0.17
apore
-0.15
alous
-0.14
ilos
-0.14
istrat
-0.14
ingt
-0.14
isle
-0.14
abis
-0.14
chaft
-0.14
.synthetic
-0.14
POSITIVE LOGITS
inic
0.16
/mod
0.15
demand
0.14
Záp
0.14
ICC
0.14
Cyr
0.14
vest
0.14
Fur
0.13
intact
0.13
caveat
0.13
Activations Density 0.448%