INDEX
Explanations
terms related to medical conditions and body parts, particularly focusing on heart-related issues
phrases related to health issues and risks involving body parts
New Auto-Interp
Negative Logits
ãĤ¤ãĥĪ
-0.93
Reward
-0.84
ebted
-0.83
quished
-0.78
Savings
-0.77
inational
-0.76
Intervention
-0.75
isoft
-0.74
Tradable
-0.74
aution
-0.74
POSITIVE LOGITS
thighs
1.19
necks
1.18
torso
1.14
shoulders
1.14
legs
1.13
limbs
1.11
neck
1.10
lungs
1.09
throat
1.09
palate
1.06
Activations Density 0.279%