INDEX
Explanations
mentions of specific entertainment franchises and media titles
New Auto-Interp
Negative Logits
hers
-0.67
bat
-0.64
rn
-0.63
occ
-0.62
omorph
-0.61
ible
-0.61
het
-0.61
ele
-0.60
enrol
-0.60
hel
-0.59
POSITIVE LOGITS
etheless
0.96
Citiz
0.89
akeru
0.71
Paste
0.69
senal
0.66
Nanto
0.64
Nept
0.63
XVI
0.63
Fernandez
0.63
spons
0.63
Activations Density 9.110%