INDEX
Explanations
nouns related to diseases or health issues
terms related to diseases or medical conditions
New Auto-Interp
Negative Logits
etheless
-0.81
dies
-0.80
icals
-0.79
nesday
-0.78
istry
-0.75
asca
-0.72
anyl
-0.71
ding
-0.70
ishing
-0.70
ishop
-0.68
POSITIVE LOGITS
ppe
0.90
ño
0.84
versa
0.81
mble
0.81
lda
0.77
Amend
0.76
eq
0.76
FontSize
0.74
olate
0.73
ller
0.71
Activations Density 0.089%