INDEX
    Explanations

    specific instances or actions mentioned in a document

    New Auto-Interp
    Negative Logits
    natureconservancy
    -0.84
    inventoryQuantity
    -0.82
     Alive
    -0.74
    iddler
    -0.68
    ylon
    -0.67
     alive
    -0.65
    urity
    -0.64
    ingham
    -0.63
     Plain
    -0.62
     Trance
    -0.61
    POSITIVE LOGITS
     toward
    1.31
     towards
    1.21
     downwards
    0.94
    irection
    0.93
     Towards
    0.93
    rils
    0.92
    ggle
    0.87
     squarely
    0.86
     downward
    0.84
    ges
    0.83
    Act Density 1.092%

    No Known Activations