INDEX
Explanations
mentions of gaming consoles and platforms
New Auto-Interp
Negative Logits
TURE
-0.18
igham
-0.16
Kore
-0.15
prung
-0.14
Bij
-0.14
INET
-0.14
kre
-0.14
Copyright
-0.14
IGH
-0.14
Spread
-0.13
POSITIVE LOGITS
ìŰ
0.18
rm
0.15
os
0.14
violent
0.14
yen
0.14
Viol
0.14
rom
0.14
Err
0.14
Os
0.14
violent
0.14
Activations Density 0.008%