INDEX
Explanations
words related to medical or health contexts, specifically indicating conditions or treatments
New Auto-Interp
Negative Logits
waard
-0.49
lossenen
-0.47
apellido
-0.47
playable
-0.46
griffen
-0.45
dila
-0.45
Konink
-0.44
entes
-0.44
venes
-0.44
venge
-0.43
POSITIVE LOGITS
ie
2.69
IE
2.25
ies
1.96
IES
1.60
ie
1.58
IE
1.39
Ie
1.29
ieg
1.27
iee
1.20
Ie
1.13
Activations Density 1.992%