INDEX
Explanations
references to historical Communist governing bodies and structures
New Auto-Interp
Negative Logits
ÑĦÑĥнда
-0.19
uru
-0.17
krev
-0.16
$MESS
-0.16
ê½
-0.15
acher
-0.15
emmel
-0.15
коÑĢол
-0.14
иденÑĤ
-0.14
бÑĥÑĢг
-0.14
POSITIVE LOGITS
Stalin
0.25
Party
0.24
Lenin
0.24
GPU
0.23
NK
0.21
GPU
0.20
PARTY
0.20
party
0.19
Party
0.19
gpu
0.18
Activations Density 0.070%