INDEX
Explanations
acronyms and short forms in uppercase characters that contain the substring "UR."
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
506
+0.14
0.5%
577
+0.12
0.5%
1806
+0.12
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
981
+0.14
0.04
577
+0.12
0.03
1174
+0.12
0.02
Negative Logits
DockStyle
-0.53
toHaveBeenCalled
-0.48
Walkover
-0.48
half
-0.47
provider
-0.47
initComponents
-0.46
Sam
-0.45
ชาย
-0.44
LINE
-0.44
PROVIDER
-0.44
POSITIVE LOGITS
shur
1.09
desir
1.04
scrat
1.03
embra
1.01
inev
0.97
excru
0.96
increa
0.96
fuj
0.96
purcha
0.95
?...
0.94
Activations Density 0.165%