INDEX
Explanations
references to health, including healthy lifestyle choices and conditions
references to health and well-being
New Auto-Interp
Negative Logits
ariat
-0.72
raped
-0.68
ammy
-0.66
angrily
-0.66
ère
-0.65
acqu
-0.64
Huss
-0.63
skill
-0.63
illegal
-0.63
Marriott
-0.61
POSITIVE LOGITS
isot
1.00
skepticism
0.87
dose
0.83
fats
0.83
lifestyles
0.81
iterranean
0.78
ceans
0.74
habits
0.73
scratches
0.73
eating
0.73
Activations Density 0.037%