INDEX
    Explanations

    discussions concerning personal responsibility and societal expectations

    New Auto-Interp
    Negative Logits
    hazi
    -0.15
    alu
    -0.15
    rán
    -0.14
     advanced
    -0.14
    вав
    -0.14
     Fallon
    -0.14
    ึà¸ĩ
    -0.14
    nda
    -0.14
     league
    -0.14
    aze
    -0.14
    POSITIVE LOGITS
    kers
    0.14
    TERS
    0.14
    "group
    0.13
    Ł
    0.13
    avers
    0.13
    ÂŃs
    0.13
     Ler
    0.13
     Linden
    0.13
    ingroup
    0.13
     же
    0.12
    Act Density 0.913%

    No Known Activations