INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    osta
    -0.78
    aeda
    -0.74
    venants
    -0.73
    eger
    -0.72
    CAST
    -0.71
    ETF
    -0.67
    ozy
    -0.66
    riks
    -0.66
    quished
    -0.65
    ologically
    -0.64
    POSITIVE LOGITS
     Gaga
    1.40
    bug
    1.27
    bird
    1.12
    bugs
    1.11
    maid
    0.95
    birds
    0.91
    woman
    0.85
    cup
    0.83
    fing
    0.82
    parts
    0.82
    Act Density 0.026%

    No Known Activations