INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    onut
    -0.67
    antha
    -0.62
    UNCH
    -0.60
    zar
    -0.59
    zan
    -0.58
    anche
    -0.58
    ilde
    -0.58
    ADRA
    -0.58
    elist
    -0.58
     largeDownload
    -0.57
    POSITIVE LOGITS
     enough
    0.96
    izable
    0.90
     anymore
    0.88
    enough
    0.82
     Enough
    0.81
    .''.
    0.77
     unless
    0.77
    isable
    0.76
    ?,
    0.73
    ;
    0.72
    Act Density 0.494%

    No Known Activations