INDEX
    Explanations

    references to hierarchical status or ranking

    New Auto-Interp
    Negative Logits
    TERN
    -0.79
    ãģ¦
    -0.73
    SEE
    -0.69
    DAY
    -0.68
    look
    -0.67
    natureconservancy
    -0.65
     Pwr
    -0.64
    keeper
    -0.64
    manship
    -0.63
    giving
    -0.60
    POSITIVE LOGITS
    isions
    1.21
    ski
    1.08
    ille
    1.07
    ania
    1.07
    ision
    1.05
    ideos
    1.05
    itating
    0.99
    iew
    0.99
    oice
    0.96
    idge
    0.95
    Act Density 0.005%

    No Known Activations