INDEX
Explanations
mentions of overweight and obese individuals
terms related to weight and obesity
New Auto-Interp
Negative Logits
onto
-0.76
ource
-0.74
Skydragon
-0.68
Learned
-0.65
Mé
-0.64
externalActionCode
-0.64
ources
-0.64
uncture
-0.63
arbon
-0.63
Pharmaceutical
-0.63
POSITIVE LOGITS
overweight
1.19
obese
0.98
eleph
0.92
carbohyd
0.84
esity
0.83
gorilla
0.81
iless
0.77
nodd
0.76
practition
0.75
tremend
0.74
Activations Density 0.011%