INDEX
Explanations
movie titles
references to superhero movies and related franchises
New Auto-Interp
Negative Logits
heet
-0.76
iery
-0.70
kered
-0.66
paddle
-0.63
anguage
-0.63
GY
-0.62
ullivan
-0.61
hap
-0.61
imposed
-0.60
bed
-0.60
POSITIVE LOGITS
Of
1.04
Trilogy
1.03
Crusade
0.92
Wars
0.89
Heist
0.89
Awakens
0.88
Hunters
0.87
Origins
0.87
Resurrection
0.84
Returns
0.84
Activations Density 0.164%