INDEX
Explanations
positive reviews of video games
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1036
+0.10
0.3%
453
+0.09
0.3%
1343
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1036
+0.10
0.02
1921
+0.09
0.04
1224
+0.09
0.04
Negative Logits
FlatStyle
-0.60
Viited
-0.59
<bos>
-0.56
Horizonte
-0.54
Activités
-0.53
AutoScaleMode
-0.53
gradualmente
-0.52
ventud
-0.52
Whence
-0.51
Lajos
-0.51
POSITIVE LOGITS
uncin
0.88
glab
0.70
xenia
0.70
pancre
0.69
anhyd
0.69
tubercle
0.66
glau
0.66
coar
0.65
macrop
0.65
lepid
0.63
Activations Density 0.227%