INDEX
    Explanations

    mentions of personal attributes and characteristics in a narrative context

    New Auto-Interp
    Negative Logits
    pong
    -0.15
    úa
    -0.15
    rung
    -0.15
    ieu
    -0.14
    itm
    -0.14
    ueva
    -0.14
    lauf
    -0.14
    helm
    -0.14
     hei
    -0.14
    ugo
    -0.13
    POSITIVE LOGITS
    ANJI
    0.15
    å¤Ħ
    0.14
    alion
    0.14
     colleg
    0.14
     заÑĢаз
    0.13
    -animate
    0.13
     CWE
    0.13
    intel
    0.13
     genuine
    0.13
     projector
    0.13
    Act Density 0.042%

    No Known Activations