INDEX
    Explanations

    phrases indicating various levels of desirability or importance

    phrases expressing opinions or judgments about various situations or actions

    New Auto-Interp
    Negative Logits
    acca
    -0.66
    onut
    -0.63
     dramas
    -0.60
    ership
    -0.60
    plates
    -0.59
    illa
    -0.57
    internal
    -0.57
    pread
    -0.56
    olulu
    -0.55
    opus
    -0.54
    POSITIVE LOGITS
     TAMADRA
    0.70
    natureconservancy
    0.68
     sooner
    0.66
    acted
    0.65
     feas
    0.65
    role
    0.64
     someday
    0.62
    SPONSORED
    0.62
    unic
    0.61
    keyes
    0.61
    Act Density 0.328%

    No Known Activations