INDEX
Explanations
references to cinemas or movies
references to cinemas and their various aspects
New Auto-Interp
Negative Logits
lies
-0.78
lynn
-0.69
essee
-0.67
lying
-0.66
ually
-0.65
ional
-0.65
wid
-0.64
hips
-0.64
iang
-0.64
bands
-0.64
POSITIVE LOGITS
ovies
0.94
cinem
0.94
goers
0.89
atography
0.88
cinema
0.88
marqu
0.83
theaters
0.82
premie
0.79
theat
0.79
isoft
0.77
Activations Density 0.026%