INDEX
Explanations
numeric values and their associations in the context of data
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.24
0.8%
1614
+0.09
0.3%
1741
+0.08
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1614
+0.24
0.08
776
+0.09
0.08
499
+0.08
0.07
Negative Logits
<bos>
-1.96
ArgumentParser
-0.71
blurRadius
-0.71
HasIndex
-0.67
BuildContext
-0.66
DataMember
-0.65
HasAnnotation
-0.64
displayquote
-0.64
addWidget
-0.63
TagHelper
-0.62
POSITIVE LOGITS
maneu
1.87
Juf
1.65
impra
1.61
increa
1.59
affor
1.57
milf
1.56
bandung
1.56
napoli
1.50
🤣🤣
1.50
shenan
1.48
Activations Density 0.398%