INDEX
Explanations
mentions of specific names or titles related to entertainment media
mentions of specific television shows and notable characters associated with them
New Auto-Interp
Negative Logits
ĺħ
-0.82
xual
-0.79
xtap
-0.74
antha
-0.73
ozo
-0.73
eering
-0.73
quez
-0.72
iasis
-0.71
uration
-0.69
anchester
-0.68
POSITIVE LOGITS
leneck
0.85
gets
0.83
iesel
0.75
EEK
0.71
lev
0.70
hiba
0.68
ynski
0.68
wana
0.67
JB
0.65
YL
0.65
Activations Density 0.093%