INDEX
Explanations
medical conditions and health-related terms, specifically focusing on chronic diseases and physical/mental health issues
New Auto-Interp
Negative Logits
*/(
-0.91
ramid
-0.89
ertodd
-0.83
ansk
-0.77
animous
-0.77
imates
-0.75
govtrack
-0.75
imov
-0.73
hammad
-0.71
Ĭ±
-0.70
POSITIVE LOGITS
traumatic
0.88
led
0.84
inflammation
0.81
diseases
0.79
pain
0.79
ling
0.76
disease
0.74
pain
0.73
obstruct
0.73
ity
0.72
Activations Density 6.818%