INDEX
    Explanations

    pronouns and possessive forms indicating personal experiences or ownership

    New Auto-Interp
    Negative Logits
    xl
    -0.07
       
    -0.07
    as
    -0.06
     Arb
    -0.06
    rq
    -0.06
    ektor
    -0.06
    ovat
    -0.05
     apt
    -0.05
    utschein
    -0.05
     Lager
    -0.05
    POSITIVE LOGITS
    äºĮ人
    0.08
    ãĥ³ãĥĶ
    0.07
    fcn
    0.07
    kara
    0.07
    .hd
    0.07
    ÑĢÑĥÑĩ
    0.07
     åı·
    0.07
    åĿ¦
    0.07
    styleType
    0.07
    isposable
    0.07
    Act Density 0.013%

    No Known Activations