INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    leigh
    -0.88
    pper
    -0.83
    classified
    -0.76
    hire
    -0.75
    gar
    -0.75
    assault
    -0.73
    pmwiki
    -0.73
    asus
    -0.72
    pes
    -0.71
    avis
    -0.71
    POSITIVE LOGITS
     hopeful
    1.21
     optimism
    0.89
     optimistic
    0.87
     prospects
    0.81
    eful
    0.72
     Neh
    0.72
     dialogue
    0.71
     upbeat
    0.71
    ial
    0.70
     Prosper
    0.69
    Act Density 0.008%

    No Known Activations