INDEX
    Explanations

    expressions indicating suggestion or permission

    New Auto-Interp
    Negative Logits
    Zen
    -0.66
     Languages
    -0.64
    natureconservancy
    -0.60
    availability
    -0.58
    liest
    -0.56
    cill
    -0.56
    cled
    -0.55
    atana
    -0.55
    ombat
    -0.55
    band
    -0.55
    POSITIVE LOGITS
    tered
    1.17
    icia
    1.02
    tering
    0.98
    itia
    0.95
    ting
    0.81
     us
    0.79
     loose
    0.76
     me
    0.71
    ugal
    0.69
     go
    0.69
    Act Density 0.952%

    No Known Activations