INDEX
    Explanations

    conjunctions and the use of phrases that emphasize continuity or inclusion

    New Auto-Interp
    Negative Logits
     THEN
    -0.16
     denen
    -0.14
     then
    -0.14
    IFE
    -0.14
     implication
    -0.14
    msp
    -0.14
    aso
    -0.14
     Then
    -0.13
    THEN
    -0.13
    agt
    -0.13
    POSITIVE LOGITS
     is
    0.36
     has
    0.33
     can
    0.25
     was
    0.25
     will
    0.24
     should
    0.23
     may
    0.21
     ÑıвлÑıеÑĤÑģÑı
    0.21
     could
    0.20
     although
    0.20
    Act Density 0.492%

    No Known Activations