INDEX
Explanations
words related to physical health and condition
descriptors of physical well-being and health status
New Auto-Interp
Negative Logits
atars
-0.70
utters
-0.70
avorite
-0.70
arching
-0.69
eeper
-0.63
rench
-0.59
Badge
-0.59
Ging
-0.59
rap
-0.59
Ban
-0.59
POSITIVE LOGITS
Dragonbound
0.90
resembling
0.71
nered
0.70
infancy
0.68
handwriting
0.68
financially
0.67
arial
0.66
igue
0.66
mia
0.66
manship
0.65
Activations Density 0.075%