INDEX
Explanations
terms related to medical conditions and diseases, especially those involving physical ailments and disorders
words related to physical conditions and specific individuals
New Auto-Interp
Negative Logits
behalf
-0.65
Pathfinder
-0.61
judgment
-0.61
unfl
-0.61
prosecut
-0.61
judgement
-0.59
女
-0.59
borders
-0.59
appro
-0.58
DEM
-0.58
POSITIVE LOGITS
phy
1.53
ptoms
1.03
asus
1.03
xit
0.97
lla
0.96
onies
0.93
xia
0.93
lly
0.90
bia
0.89
nce
0.88
Activations Density 0.008%