INDEX
    Explanations

    Twitter handles

    New Auto-Interp
    Negative Logits
     Bauer
    -0.75
    ogie
    -0.74
     Hawkins
    -0.71
     Julie
    -0.71
     Kuro
    -0.69
    Ñı
    -0.69
     disapp
    -0.69
    767
    -0.69
     Garry
    -0.68
    187
    -0.68
    POSITIVE LOGITS
    T
    1.35
    t
    1.34
     Tet
    1.26
    Ts
    1.23
    TD
    1.22
    td
    1.21
    TT
    1.21
    ts
    1.21
    TC
    1.20
    TS
    1.17
    Act Density 0.874%

    No Known Activations