INDEX
Explanations
phrases related to actions or commands
action words that indicate gameplay and interaction
New Auto-Interp
Negative Logits
cius
-0.71
thodox
-0.67
unlaw
-0.67
aith
-0.64
icum
-0.64
rake
-0.63
Applic
-0.63
minecraft
-0.61
LGBT
-0.60
iosyn
-0.60
POSITIVE LOGITS
yourself
1.04
yourselves
0.97
your
0.83
Tube
0.65
animate
0.61
butterflies
0.61
YOUR
0.60
lda
0.60
pez
0.58
realise
0.58
Activations Density 0.190%