INDEX
    Explanations

    references to group dynamics and cooperation

    New Auto-Interp
    Negative Logits
    emarks
    -0.17
    zik
    -0.17
    serter
    -0.16
    obraz
    -0.16
    ovice
    -0.16
     ABC
    -0.15
     mrb
    -0.14
    vertis
    -0.14
     Discovery
    -0.14
    icolor
    -0.14
    POSITIVE LOGITS
    geber
    0.17
     Carr
    0.16
     Weiner
    0.16
     Juda
    0.15
    ieri
    0.15
    anst
    0.14
    inger
    0.14
    /Internal
    0.14
     ëĵ
    0.14
    NonNull
    0.14
    Act Density 0.009%

    No Known Activations