INDEX
Explanations
words related to the Pokémon franchise
references to Pokémon and related concepts
New Auto-Interp
Negative Logits
)=(
-0.75
ufact
-0.71
yer
-0.66
cens
-0.65
offic
-0.65
lishes
-0.63
McDonnell
-0.63
perial
-0.62
×Ļ
-0.62
lain
-0.62
POSITIVE LOGITS
okemon
0.96
Dex
0.94
Pokemon
0.92
pokemon
0.91
Pokemon
0.86
Gy
0.80
athlon
0.79
XY
0.79
phan
0.77
walker
0.75
Activations Density 0.016%