INDEX
    Explanations

    phrases indicating collaboration and teamwork

    New Auto-Interp
    Negative Logits
     revis
    -0.18
    nemonic
    -0.17
     brib
    -0.15
     ampl
    -0.15
    ç¤
    -0.15
     amplify
    -0.15
     starring
    -0.14
     travers
    -0.14
     annot
    -0.14
     dealloc
    -0.14
    POSITIVE LOGITS
     be
    0.26
     get
    0.25
     conduct
    0.23
     become
    0.23
     make
    0.22
     start
    0.21
     take
    0.20
     add
    0.20
     perform
    0.19
     give
    0.19
    Act Density 1.079%

    No Known Activations