INDEX
Explanations
words related to weight, weight loss, and physical attributes
New Auto-Interp
Negative Logits
Tul
-0.78
WP
-0.77
sky
-0.73
PBS
-0.72
Philips
-0.70
Solitaire
-0.69
Noir
-0.69
Bhar
-0.68
Prairie
-0.68
Ascend
-0.67
POSITIVE LOGITS
weight
1.25
weights
1.14
physique
1.10
weight
1.07
Weight
1.06
weights
0.99
fat
0.96
heaviest
0.93
Weight
0.92
stiffness
0.86
Activations Density 2.254%