INDEX
Explanations
words and phrases related to medical terminology and symptoms
New Auto-Interp
Negative Logits
sed
-0.18
-fw
-0.16
male
-0.16
mus
-0.14
bru
-0.14
клÑĥ
-0.14
pus
-0.14
докÑĥм
-0.14
ogenerated
-0.13
oproject
-0.13
POSITIVE LOGITS
заболева
0.26
гем
0.23
воÑģпал
0.23
заболеваний
0.22
инÑĦек
0.21
имÑĥ
0.20
заболеваниÑı
0.20
гоÑĢм
0.20
ÑģоÑģÑĥд
0.19
заÑħвоÑĢÑİ
0.19
Activations Density 0.008%