INDEX
    Explanations

    instances where entities make their first notable appearance or debut

    repeated mentions of possessive pronouns

    New Auto-Interp
    Negative Logits
    hov
    -0.82
    earchers
    -0.64
     [];
    -0.61
    DN
    -0.60
    ibaba
    -0.59
    Guy
    -0.58
    Lago
    -0.57
    wr
    -0.56
     horizont
    -0.56
    Gi
    -0.56
    POSITIVE LOGITS
     own
    1.18
     debut
    0.88
     stride
    0.77
     impression
    0.77
    selves
    0.76
    self
    0.74
     footing
    0.73
     customary
    0.72
    çİĭ
    0.72
     displeasure
    0.70
    Act Density 0.043%

    No Known Activations