INDEX
    Explanations

    references to the field of social sciences

    New Auto-Interp
    Negative Logits
    )(((
    -0.16
     Elias
    -0.15
    ecast
    -0.15
    pur
    -0.15
    asics
    -0.15
    chsel
    -0.15
    zero
    -0.14
    egal
    -0.14
    atters
    -0.14
    iguous
    -0.14
    POSITIVE LOGITS
    deniz
    0.18
    ednou
    0.17
    OrNil
    0.15
    rej
    0.14
    arih
    0.14
    BAT
    0.14
    fe
    0.14
    ftware
    0.14
    olis
    0.14
    jal
    0.14
    Act Density 0.010%

    No Known Activations