INDEX
Explanations
instances of the word "Game" along with numbers that signify different gaming-related contexts
New Auto-Interp
Negative Logits
ors
-0.19
est
-0.17
ter
-0.17
licer
-0.15
rible
-0.15
ri
-0.15
ney
-0.15
Games
-0.14
witch
-0.14
ebi
-0.14
POSITIVE LOGITS
Cube
0.20
Boy
0.20
cube
0.19
stop
0.19
cock
0.18
changer
0.18
pad
0.18
changing
0.17
Stop
0.17
changer
0.17
Activations Density 0.021%