INDEX
    Explanations

    expressions of positive or negative emotional reactions

    New Auto-Interp
    Negative Logits
    aver
    -0.16
    ordes
    -0.15
    u
    -0.14
    /from
    -0.14
    acades
    -0.14
    iry
    -0.14
    NR
    -0.14
    adge
    -0.13
    edException
    -0.13
    afort
    -0.13
    POSITIVE LOGITS
    ness
    0.20
    NESS
    0.19
    HeaderCode
    0.16
    GuidId
    0.16
    .nih
    0.16
     Král
    0.15
     ENTRY
    0.14
    -looking
    0.14
    iye
    0.14
     capt
    0.14
    Act Density 0.156%

    No Known Activations