INDEX
Explanations
terms related to medical conditions and treatments
New Auto-Interp
Negative Logits
o
-0.31
es
-0.31
al
-0.28
ion
-0.22
aliz
-0.22
ed
-0.21
oj
-0.20
t
-0.20
edir
-0.20
oÄį
-0.20
POSITIVE LOGITS
rans
0.24
ting
0.24
ter
0.23
rop
0.21
yped
0.21
tingham
0.20
swana
0.20
tery
0.20
rophic
0.19
assium
0.19
Activations Density 0.028%