INDEX
    Explanations

    conjunctions, particularly the word "and"

    New Auto-Interp
    Negative Logits
    ®,
    -0.15
    landers
    -0.15
    ollider
    -0.14
    edBy
    -0.14
    orks
    -0.13
    ities
    -0.13
    /of
    -0.13
    amp
    -0.13
    agger
    -0.13
    egt
    -0.13
    POSITIVE LOGITS
    istrovstvÃŃ
    0.18
     ìĿ´ëĬĶ
    0.18
     though
    0.18
     zwar
    0.17
    бо
    0.15
     since
    0.15
     although
    0.15
     albeit
    0.15
     while
    0.15
     yet
    0.14
    Act Density 0.353%

    No Known Activations