INDEX
    Explanations

    phrases beginning with 'Each'

    references to individual contributions within a collective context

    New Auto-Interp
    Negative Logits
     quite
    -0.68
     anyway
    -0.65
     embarrass
    -0.61
     negative
    -0.60
     freak
    -0.59
     offensive
    -0.58
     misinterpret
    -0.58
     sentiment
    -0.58
     very
    -0.58
     probably
    -0.58
    POSITIVE LOGITS
     Each
    3.22
    Each
    2.36
    each
    2.27
     each
    1.90
     Every
    1.85
     Whenever
    1.44
    Every
    1.42
     Both
    1.40
     apiece
    1.36
     These
    1.32
    Act Density 0.014%

    No Known Activations