INDEX
    Explanations

    references to teamwork and collaboration in competitive contexts

    New Auto-Interp
    Negative Logits
    etta
    -0.15
     immune
    -0.14
    èįIJ
    -0.14
    禮
    -0.14
    rlen
    -0.13
    antz
    -0.13
    Kit
    -0.13
    stretch
    -0.13
    immune
    -0.13
    aget
    -0.13
    POSITIVE LOGITS
    zzo
    0.19
     train
    0.17
    Capital
    0.16
     trains
    0.16
     Capital
    0.16
     played
    0.15
    Callbacks
    0.15
     tÃŃn
    0.15
    isci
    0.15
     Qualified
    0.15
    Act Density 0.076%

    No Known Activations