INDEX
Explanations
occurrences of specific movie titles
references to titles of films or literary works that begin with "The."
New Auto-Interp
Negative Logits
patiently
-0.71
contributed
-0.70
lished
-0.70
omever
-0.65
endeav
-0.64
posted
-0.64
elsen
-0.64
authored
-0.63
perse
-0.63
behalf
-0.63
POSITIVE LOGITS
atre
1.17
oret
1.16
orem
1.09
odor
1.04
Simpsons
1.03
resa
1.02
ories
1.00
Greatest
1.00
sis
0.98
Stranger
0.95
Activations Density 0.120%