INDEX
    Explanations

    nervousness

    New Auto-Interp
    Negative Logits
    inson
    -0.07
    Teachers
    -0.07
     minWidth
    -0.06
    .argument
    -0.06
    -0.06
     northern
    -0.06
    (done
    -0.06
    -0.06
    .back
    -0.06
    [thread
    -0.06
    POSITIVE LOGITS
     queer
    0.07
    ouple
    0.07
     hlas
    0.07
    oji
    0.07
    bservable
    0.07
     Sinh
    0.07
     frontend
    0.06
    voie
    0.06
     elevated
    0.06
    _PRIVATE
    0.06
    Act Density 0.152%

    No Known Activations