INDEX
Explanations
instances of the word "films"
references to films and movies
New Auto-Interp
Negative Logits
owder
-0.68
legislature
-0.63
FUL
-0.60
Fulton
-0.59
pse
-0.58
sie
-0.57
Islanders
-0.57
GY
-0.57
ļé
-0.57
assis
-0.56
POSITIVE LOGITS
movies
0.96
ensitive
0.95
uggest
0.95
earch
0.93
films
0.93
chool
0.92
ilver
0.89
ynthesis
0.88
ettings
0.88
ovies
0.87
Activations Density 0.048%