INDEX
    Explanations

    references to personal pronouns and their possessive forms

    New Auto-Interp
    Negative Logits
     utafitiHapana
    -0.53
    InjectAttribute
    -0.51
    SequentialGroup
    -0.49
    fjspx
    -0.49
    (!__
    -0.47
    цездатний
    -0.47
    BufferException
    -0.46
     referrerpolicy
    -0.45
     fråga
    -0.44
    styleType
    -0.44
    POSITIVE LOGITS
     Jego
    0.53
     deren
    0.52
     cujo
    0.49
     whose
    0.48
    oceros
    0.47
     그의
    0.47
     deres
    0.47
     ihres
    0.46
    deren
    0.46
     cuyo
    0.46
    Act Density 0.451%

    No Known Activations