INDEX
Explanations
references to weight loss-related topics and their effects
New Auto-Interp
Negative Logits
luv
-0.17
omic
-0.17
qus
-0.16
ptal
-0.15
OMIC
-0.15
uvw
-0.14
ehler
-0.14
thritis
-0.14
subrange
-0.14
_RW
-0.13
POSITIVE LOGITS
fat
0.30
fat
0.29
Fat
0.27
Fat
0.26
weight
0.26
adip
0.26
burn
0.25
appetite
0.25
Burn
0.24
burns
0.24
Activations Density 0.055%