INDEX
Explanations
descriptions related to locations or events
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
964
+0.15
0.5%
906
+0.14
0.4%
939
+0.14
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
382
+0.15
0.10
22
+0.14
0.07
964
+0.14
0.06
Negative Logits
schal
-0.85
Lma
-0.79
Væ
-0.78
År
-0.75
Noice
-0.75
Lmfao
-0.75
Vå
-0.72
Gå
-0.70
sako
-0.69
Ikr
-0.69
POSITIVE LOGITS
Sklici
0.65
***!
0.61
Zunanje
0.60
whither
0.59
AndEndTag
0.57
bitField
0.57
Located
0.57
ConstraintMaker
0.56
There
0.55
Where
0.55
Activations Density 0.732%