INDEX
Explanations
references to Hollywood
mentions of Hollywood and its related figures
New Auto-Interp
Negative Logits
İĭ
-0.85
arity
-0.84
tnc
-0.78
avez
-0.77
cific
-0.76
uter
-0.74
unin
-0.74
rack
-0.73
tf
-0.70
gs
-0.69
POSITIVE LOGITS
Reporter
1.20
movies
1.03
blockbuster
1.01
studios
1.01
Studios
1.01
Hollywood
0.99
Hills
0.98
celebrities
0.98
films
0.96
Pictures
0.92
Activations Density 0.037%