INDEX
Explanations
references to heroes and associated themes
New Auto-Interp
Negative Logits
dens
-0.58
ers
-0.57
pck
-0.57
informa
-0.57
Nü
-0.56
UNT
-0.55
Sanders
-0.54
kken
-0.53
Wander
-0.52
र
-0.51
POSITIVE LOGITS
hero
2.34
Hero
2.26
HERO
2.25
heroes
2.22
hero
2.18
Hero
2.07
HERO
2.03
Heroes
1.93
heroes
1.91
Heroes
1.79
Activations Density 0.074%