INDEX
    Explanations

    pronouns indicating unidentified entities or situations

    New Auto-Interp
    Negative Logits
    DAQ
    -0.76
    ROR
    -0.64
    ZA
    -0.62
    IVE
    -0.58
    IONS
    -0.56
    pmwiki
    -0.55
    natureconservancy
    -0.54
    EMBER
    -0.54
    UG
    -0.54
    ALE
    -0.53
    POSITIVE LOGITS
     it
    2.01
    it
    1.04
     It
    1.04
    It
    1.00
     its
    0.95
     there
    0.86
     they
    0.85
     this
    0.84
     everything
    0.80
     you
    0.78
    Act Density 0.319%

    No Known Activations