INDEX
Explanations
information related to rankings and positions within different contexts
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
752
+0.12
0.3%
1042
+0.11
0.3%
517
+0.08
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
177
+0.12
0.05
416
+0.11
0.05
81
+0.08
0.03
Negative Logits
awaiter
-0.59
!="")
-0.57
ItemBackground
-0.54
=""/>
-0.50
=="")
-0.50
FetchType
-0.49
müm
-0.49
NSCoder
-0.49
外部連結
-0.49
smtplib
-0.48
POSITIVE LOGITS
indestru
0.84
uninten
0.75
impra
0.75
cristina
0.74
reluct
0.74
toledo
0.74
santiago
0.73
ricardo
0.72
roberto
0.71
hcm
0.70
Activations Density 0.318%