INDEX
Explanations
mentions of specific movie titles and characters from a superhero franchise
New Auto-Interp
Negative Logits
ickr
-0.85
jri
-0.84
mediate
-0.83
unicip
-0.82
elong
-0.82
tu
-0.81
administr
-0.80
benches
-0.79
housing
-0.79
prus
-0.77
POSITIVE LOGITS
trilogy
1.51
Chronicles
1.48
sequel
1.46
Trilogy
1.44
sequels
1.42
novels
1.28
starring
1.26
Adventures
1.26
Dracula
1.25
Episode
1.23
Activations Density 4.992%