INDEX
    Explanations

    possessive pronouns and expressions of personal ownership or identity

    New Auto-Interp
    Negative Logits
    તા
    -0.54
    WriteLiteral
    -0.49
    ptid
    -0.47
    JoinColumn
    -0.47
    ędz
    -0.44
    persky
    -0.44
    Rhestr
    -0.43
    ตน
    -0.43
    ądź
    -0.43
    antara
    -0.43
    POSITIVE LOGITS
     fault
    0.74
    BagLayout
    0.73
     EconPapers
    0.70
     responsibility
    0.68
     greatest
    0.64
     undoing
    0.64
     snippetHide
    0.63
     proudest
    0.62
     weakness
    0.61
     biggest
    0.61
    Act Density 0.131%

    No Known Activations