INDEX
Explanations
references to video games
mentions of video games
New Auto-Interp
Negative Logits
thening
-0.79
wered
-0.73
thens
-0.71
inois
-0.68
inx
-0.68
ricane
-0.66
hips
-0.65
ioxide
-0.65
politic
-0.64
ought
-0.64
POSITIVE LOGITS
pad
1.17
PLAY
1.07
wright
1.06
cube
1.04
Borderlands
1.03
consoles
0.97
play
0.91
Revolution
0.89
runner
0.86
console
0.84
Activations Density 0.063%