INDEX
Explanations
references to the concept of fine things or quality
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
629
+0.13
0.5%
168
+0.12
0.5%
1604
+0.12
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
629
+0.13
0.02
1351
+0.12
0.02
1634
+0.12
0.02
Negative Logits
toolStripButton
-0.56
XmlAccessorType
-0.50
hashlib
-0.50
ciasc
-0.48
setOpaque
-0.47
mettez
-0.47
couverture
-0.47
rabat
-0.46
serai
-0.46
Infatti
-0.45
POSITIVE LOGITS
Fine
1.22
fine
1.22
FINE
1.19
fine
1.18
Fine
1.16
FINE
1.00
fines
0.99
Fines
0.90
fined
0.87
fines
0.80
Activations Density 0.075%