INDEX
    Explanations

    text indicating instructions or suggestions

    New Auto-Interp
    Negative Logits
    ELD
    -0.76
    Cre
    -0.58
    MpServer
    -0.58
    Fra
    -0.57
     Fundamental
    -0.57
    Pos
    -0.53
     wheelchair
    -0.52
     hurts
    -0.52
     harmed
    -0.51
     Geh
    -0.51
    POSITIVE LOGITS
     consider
    0.81
     beware
    0.80
     consult
    0.79
     subscribe
    0.76
     avoid
    0.76
     caution
    0.75
     heed
    0.75
     hesitate
    0.75
     check
    0.74
     omit
    0.72
    Act Density 15.248%

    No Known Activations