INDEX
Explanations
references to or mentions of movies
occurrences of the word "movie" and its variations
New Auto-Interp
Negative Logits
ilities
-0.86
withstanding
-0.71
sembly
-0.71
ords
-0.70
Haitian
-0.69
otics
-0.68
odox
-0.67
illin
-0.67
²¾
-0.67
anting
-0.66
POSITIVE LOGITS
theater
1.08
goers
1.08
theaters
1.05
theatre
0.99
movies
0.92
eers
0.91
going
0.88
theat
0.88
movie
0.87
buffs
0.84
Activations Density 0.037%