INDEX
Explanations
medicine or health-related terms, especially related to mental health and psychiatry
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.19
0.7%
1562
+0.08
0.3%
2036
+0.08
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
792
+0.19
0.05
1753
+0.08
0.04
2045
+0.08
0.04
Negative Logits
<bos>
-2.74
CreateTagHelper
-0.77
CPtr
-0.69
MessageOf
-0.67
JspWriter
-0.66
TagHelpers
-0.65
ുറ
-0.63
Descripció
-0.61
ViewFeatures
-0.60
}{@-0.59
POSITIVE LOGITS
ecru
1.66
tolerably
1.55
swarovski
1.55
unwarran
1.51
increa
1.48
disagre
1.47
matel
1.44
hairc
1.44
Manufact
1.44
maneu
1.44
Activations Density 0.517%