INDEX
Explanations
references to cute Pokémon and nostalgic gaming experiences
New Auto-Interp
Negative Logits
èľĺèĽĽ
-0.19
XB
-0.17
ocs
-0.16
gaard
-0.15
Heller
-0.14
von
-0.14
Bang
-0.14
Shepherd
-0.14
Nordic
-0.14
Kushner
-0.14
POSITIVE LOGITS
Trainer
0.28
trainer
0.28
trainer
0.27
trainers
0.26
Pokémon
0.24
Pokemon
0.24
pokemon
0.23
Pok
0.23
Gym
0.23
Pokemon
0.22
Activations Density 0.035%