INDEX
Explanations
terms related to health and medical topics
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
184
+0.14
0.8%
451
+0.12
0.7%
87
+0.11
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
184
+0.14
0.01
386
+0.12
0.01
378
+0.11
0.01
Negative Logits
ĭ
-1.90
ĨĴ
-1.87
Ļª
-1.81
¯
-1.79
ı
-1.72
´
-1.72
ľ
-1.70
µ
-1.66
deal
-1.60
Į
-1.59
POSITIVE LOGITS
care
1.78
grass
1.66
iterranean
1.58
lichen
1.53
ocs
1.53
birds
1.53
table
1.52
ubot
1.52
idelines
1.45
assic
1.40
Activations Density 0.006%