INDEX
    Explanations

    instances of strong contrasts or transitions in text

    New Auto-Interp
    Negative Logits
    chen
    -0.18
    ни
    -0.16
     therefore
    -0.15
    amo
    -0.14
    ining
    -0.14
    dd
    -0.14
    :,
    -0.13
    mage
    -0.13
    o
    -0.13
    chef
    -0.13
    POSITIVE LOGITS
    że
    0.21
    tery
    0.16
    leyen
    0.14
    Ñľ
    0.14
    -syntax
    0.14
    abol
    0.14
    arith
    0.13
     wenn
    0.13
    XMLElement
    0.13
    artz
    0.13
    Act Density 0.036%

    No Known Activations