INDEX
Explanations
words related to a specific movie title or franchise, "Pokemon"
references to the character "Moke."
New Auto-Interp
Negative Logits
itton
-0.73
Lub
-0.70
ugal
-0.69
iltr
-0.66
ENDED
-0.64
ittal
-0.64
ittance
-0.63
inished
-0.62
arians
-0.61
orescent
-0.60
POSITIVE LOGITS
tto
0.97
ls
0.91
eper
0.89
otle
0.84
oking
0.84
rer
0.78
aways
0.78
lift
0.78
leigh
0.77
loo
0.77
Activations Density 0.011%