INDEX
Explanations
references to popular movie titles, particularly superhero and action films
references to the Guardians of the Galaxy franchise and its associated films
New Auto-Interp
Negative Logits
hus
-0.81
obar
-0.81
minist
-0.76
opathic
-0.73
administ
-0.72
grain
-0.71
Blackwell
-0.69
istically
-0.69
ifling
-0.69
yre
-0.67
POSITIVE LOGITS
Awakens
0.86
Franchise
0.84
buster
0.83
sequels
0.82
Turtles
0.82
villains
0.80
Andromeda
0.78
movies
0.76
naire
0.74
Transformers
0.74
Activations Density 0.042%