INDEX
    Explanations

    key terms and phrases that indicate personal perspective or engagement in a narrative

    New Auto-Interp
    Negative Logits
    ä¹ħ
    -0.16
     leng
    -0.15
    acman
    -0.14
    ONO
    -0.14
    ł
    -0.14
    ides
    -0.14
    _Description
    -0.14
    渡
    -0.14
    acias
    -0.14
    elter
    -0.14
    POSITIVE LOGITS
    APE
    0.15
    öz
    0.14
    Destroy
    0.14
    yal
    0.14
     STREAM
    0.14
    ouro
    0.14
    fdc
    0.14
    offline
    0.14
     Beaver
    0.14
    jom
    0.13
    Act Density 0.001%

    No Known Activations