INDEX
Explanations
references to Pokémon and related terminology
New Auto-Interp
Negative Logits
gerald
-0.17
.scalablytyped
-0.16
Gam
-0.16
559
-0.16
anship
-0.15
enguin
-0.14
ez
-0.14
omore
-0.14
uin
-0.14
081
-0.14
POSITIVE LOGITS
émon
0.24
itto
0.19
emons
0.18
éd
0.18
EMON
0.17
ég
0.17
Pok
0.17
orny
0.17
mon
0.16
olen
0.15
Activations Density 0.010%