INDEX
    Explanations

    phrases that indicate personal struggles with weight and health-related issues

    New Auto-Interp
    Negative Logits
     Gardens
    -0.19
     Golf
    -0.18
     Gesture
    -0.17
    Goods
    -0.16
     Grinder
    -0.16
     Genome
    -0.16
     Glam
    -0.16
     Goods
    -0.16
     Gap
    -0.15
    Graphics
    -0.15
    POSITIVE LOGITS
     gain
    0.81
     gains
    0.67
    gain
    0.64
     Gain
    0.63
     gained
    0.60
    Gain
    0.58
     gaining
    0.56
    _gain
    0.56
    -g
    0.55
    _GAIN
    0.48
    Act Density 0.124%

    No Known Activations