INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     congress
    -0.07
     Sanchez
    -0.07
     ICE
    -0.07
     Baker
    -0.07
    unicip
    -0.07
     commence
    -0.07
    buch
    -0.07
    _indx
    -0.07
    isch
    -0.07
     bench
    -0.07
    POSITIVE LOGITS
     feel
    0.17
     felt
    0.15
     feels
    0.13
    felt
    0.13
    Feel
    0.13
     Feel
    0.12
     feeling
    0.11
    feel
    0.10
     Feeling
    0.10
    Feels
    0.10
    Act Density 0.047%

    No Known Activations