INDEX
    Explanations

    texts related to academic writing and formal language, potentially including math symbols

    references to personal experiences and emotional reflections

    New Auto-Interp
    Negative Logits
    ',"
    -0.71
    imentary
    -0.70
    "],
    -0.69
    kefeller
    -0.68
    ];
    -0.66
    igent
    -0.65
     Newark
    -0.63
     ];
    -0.62
    odge
    -0.62
    ogether
    -0.61
    POSITIVE LOGITS
    âĢ
    1.23
     anime
    1.15
     Elsa
    1.14
     Overwatch
    1.10
     fandom
    1.10
     Blizz
    1.08
     GamerGate
    1.05
     reddit
    1.02
     Gamergate
    1.01
    ãĢİ
    1.01
    Act Density 0.905%

    No Known Activations