INDEX
    Explanations

    phrases indicating cause and effect

    the word "as" used in various contexts conveying comparison or effect

    New Auto-Interp
    Negative Logits
     whatsoever
    -0.69
    origin
    -0.61
    âϦ
    -0.61
    atis
    -0.60
    gger
    -0.57
    leans
    -0.56
    opian
    -0.56
    abouts
    -0.56
    regon
    -0.55
    ALLY
    -0.55
    POSITIVE LOGITS
    pects
    1.16
    ymm
    1.16
    semb
    1.13
    piring
    1.08
    ynchronous
    1.06
    bestos
    1.05
    phalt
    1.01
    piration
    0.99
     such
    0.95
    semble
    0.95
    Act Density 0.077%

    No Known Activations