INDEX
Explanations
terms related to medical conditions or treatments that involve inhibition or suppression
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
376
+0.15
0.8%
320
+0.13
0.7%
253
+0.11
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
320
+0.15
0.01
408
+0.13
0.01
200
+0.11
0.00
Negative Logits
ĨĴ
-2.56
ı
-2.43
ĸ´
-2.29
Ķ
-2.29
½
-2.27
¿
-2.18
¸
-2.17
ĵ
-2.17
Į
-2.17
©
-2.15
POSITIVE LOGITS
ulate
1.86
ulator
1.77
ulated
1.76
ulled
1.67
erals
1.51
ulation
1.47
ulators
1.45
ulating
1.45
element
1.43
"}](#
1.42
Activations Density 0.004%