INDEX
    Explanations

    requests for help and advice

    New Auto-Interp
    Negative Logits
    neutral
    -0.17
     neutral
    -0.16
    hra
    -0.16
     Neutral
    -0.15
    OfWork
    -0.15
    -neutral
    -0.14
     Claw
    -0.14
     Petr
    -0.14
    iker
    -0.14
    Neutral
    -0.14
    POSITIVE LOGITS
    etur
    0.17
    IBE
    0.17
     scopes
    0.15
    ÅĻeb
    0.15
    ABA
    0.14
    ibo
    0.14
    ιÏĥÏĦο
    0.14
    Scope
    0.14
    SCO
    0.14
     Earn
    0.14
    Act Density 0.051%

    No Known Activations