INDEX
Explanations
references to body weight and condition, particularly relating to fat and obesity
New Auto-Interp
Negative Logits
eer
-0.19
oce
-0.16
itzer
-0.16
illard
-0.16
esine
-0.16
ingroup
-0.15
orf
-0.15
ingen
-0.15
auc
-0.15
hart
-0.15
POSITIVE LOGITS
igue
0.37
ality
0.32
ima
0.30
ig
0.28
uous
0.26
igu
0.25
ness
0.24
IMA
0.22
igure
0.22
ih
0.22
Activations Density 0.016%