INDEX
Explanations
phrases that indicate personal struggles with weight and health-related issues
New Auto-Interp
Negative Logits
Gardens
-0.19
Golf
-0.18
Gesture
-0.17
Goods
-0.16
Grinder
-0.16
Genome
-0.16
Glam
-0.16
Goods
-0.16
Gap
-0.15
Graphics
-0.15
POSITIVE LOGITS
gain
0.81
gains
0.67
gain
0.64
Gain
0.63
gained
0.60
Gain
0.58
gaining
0.56
_gain
0.56
-g
0.55
_GAIN
0.48
Activations Density 0.124%