INDEX
Explanations
references to the Pokémon franchise
mentions of Pokémon
New Auto-Interp
Negative Logits
iffe
-0.88
heed
-0.67
ickson
-0.66
ubb
-0.66
EFF
-0.66
iller
-0.66
DCS
-0.65
perial
-0.64
tiss
-0.64
inges
-0.64
POSITIVE LOGITS
Dex
0.96
Pokémon
0.92
Trainer
0.91
pokemon
0.88
Pokémon
0.87
Poké
0.86
athlon
0.84
Pokemon
0.84
Pikachu
0.80
Pokemon
0.80
Activations Density 0.044%