INDEX
    Explanations

    messages related to personal development and empowerment

    motivational language focused on self-improvement and personal agency

    New Auto-Interp
    Negative Logits
    —"
    -1.02
    ãĥīãĥ©
    -0.86
    Enlarge
    -0.84
    "—
    -0.73
    )—
    -0.68
    "]
    -0.68
    ];
    -0.67
    Å«
    -0.66
    Åį
    -0.66
     âĢķ
    -0.65
    POSITIVE LOGITS
     thats
    1.25
     alot
    1.24
     ie
    1.21
     but
    1.18
     haha
    1.18
     lol
    1.15
     whereas
    1.15
     tho
    1.12
     BUT
    1.12
     doesnt
    1.10
    Act Density 1.496%

    No Known Activations