INDEX
    Explanations

    pronouns referring to people or entities previously mentioned in the text

    references to people or groups in relation to their actions or characteristics

    New Auto-Interp
    Negative Logits
    AND
    -0.71
    onsequ
    -0.64
     Meaning
    -0.62
    ising
    -0.62
    PLIC
    -0.60
     Trop
    -0.59
    ont
    -0.58
    ounding
    -0.57
    istance
    -0.56
    Instruct
    -0.56
    POSITIVE LOGITS
     owns
    1.10
     specializes
    1.00
     specialize
    0.95
     participated
    0.92
     loves
    0.88
     understands
    0.87
     possesses
    0.86
     enjoys
    0.85
     cares
    0.84
     participates
    0.84
    Act Density 0.118%

    No Known Activations