INDEX
Explanations
mentions of the benefits and attractiveness of playing video games
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
198
+0.13
0.4%
1265
+0.10
0.3%
581
+0.10
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
198
+0.13
0.06
908
+0.10
0.03
1243
+0.10
0.05
Negative Logits
rispond
-0.62
catég
-0.51
anyone
-0.50
setti
-0.50
registrer
-0.50
Népesség
-0.50
Alat
-0.49
Passe
-0.49
Anything
-0.49
Anyone
-0.49
POSITIVE LOGITS
SneakyThrows
0.79
unwarran
0.77
shewn
0.76
BIBSYS
0.71
liberality
0.69
whofe
0.68
jopa
0.65
ftre
0.65
pymongo
0.63
laft
0.62
Activations Density 0.464%