INDEX
Explanations
adjectives ending in 'ing' or 'ed'
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1127
+0.17
0.7%
281
+0.16
0.7%
2004
+0.16
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1127
+0.17
0.03
281
+0.16
0.03
2004
+0.16
0.03
Negative Logits
XmlAccessType
-0.49
ENODEV
-0.49
disambiguation
-0.44
Cdb
-0.43
AutoScaleMode
-0.43
Dan
-0.41
Senator
-0.41
Ehrungen
-0.39
Volkes
-0.39
بيوتر
-0.39
POSITIVE LOGITS
FRI
1.08
FRE
1.04
Fri
1.03
Fri
0.98
frie
0.95
tremb
0.95
fri
0.94
FR
0.93
frond
0.93
Fre
0.92
Activations Density 0.122%