INDEX
Explanations
references to heroic figures or characters
references to heroes in various contexts
New Auto-Interp
Negative Logits
aeda
-0.90
orie
-0.85
ntil
-0.84
ateur
-0.82
ño
-0.78
rupt
-0.76
aton
-0.76
igree
-0.73
imentary
-0.73
vant
-0.72
POSITIVE LOGITS
heroes
0.93
heroine
0.83
ku
0.77
Reborn
0.75
hero
0.74
ically
0.74
acters
0.69
Saur
0.65
rities
0.65
Pengu
0.64
Activations Density 0.012%