INDEX
Explanations
references to the term "Pokémon"
mentions of the Pokémon franchise and its related content
New Auto-Interp
Negative Logits
iffe
-0.85
)=(
-0.73
rikes
-0.73
perial
-0.70
outer
-0.69
ijk
-0.68
¿½
-0.68
cens
-0.67
bush
-0.67
ickson
-0.67
POSITIVE LOGITS
Dex
0.95
athlon
0.86
pokemon
0.86
Pokemon
0.84
Pokémon
0.83
Pokémon
0.82
Poké
0.80
Trainer
0.79
Pokemon
0.79
Pikachu
0.77
Activations Density 0.026%