INDEX
Explanations
personal experiences and feelings expressed by the speaker
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1978
+0.16
0.5%
1895
+0.12
0.4%
674
+0.12
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1919
+0.16
0.08
1634
+0.12
0.05
862
+0.12
0.04
Negative Logits
updateTime
-0.61
guma
-0.58
Keny
-0.54
الاطلاع
-0.52
vų
-0.49
panahon
-0.49
Manbalar
-0.48
getVersion
-0.48
getTime
-0.47
Punj
-0.47
POSITIVE LOGITS
//*/
0.74
alre
0.66
stihl
0.62
overla
0.60
formules
0.60
illustre
0.59
//...
0.58
cahier
0.58
fputs
0.58
vort
0.56
Activations Density 0.199%