INDEX
    Explanations

    references to the pronoun "they"

    New Auto-Interp
    Negative Logits
     themselves
    -0.85
    Their
    -0.82
    GraphicsUnit
    -0.81
     their
    -0.81
     Their
    -0.75
    their
    -0.74
    WireFormatLite
    -0.68
     <<<<<<<<<<<<<<
    -0.66
    FileChooser
    -0.66
     THEIR
    -0.64
    POSITIVE LOGITS
    bnf
    0.62
    ISMS
    0.56
    :].
    0.56
    exels
    0.56
    elry
    0.54
    urable
    0.53
    ABASES
    0.53
     وتسجيلات
    0.52
    thouses
    0.52
     aikana
    0.51
    Act Density 0.031%

    No Known Activations