INDEX
Explanations
titles and references to works of literature or film
New Auto-Interp
Negative Logits
alach
-0.18
Alman
-0.15
whore
-0.15
urum
-0.14
anale
-0.14
пÑĢом
-0.14
ä»ĭ
-0.14
Fuck
-0.14
autopsy
-0.14
sightings
-0.13
POSITIVE LOGITS
hypnot
0.17
Monte
0.16
fian
0.15
Redskins
0.15
uye
0.15
dual
0.15
kod
0.15
Bols
0.14
orient
0.14
Satan
0.14
Activations Density 0.039%