INDEX
    Explanations

    statements involving the concept of assumptions or presuppositions

    New Auto-Interp
    Negative Logits
    esian
    -0.17
    essler
    -0.17
    ened
    -0.17
    Ø©
    -0.16
    rome
    -0.16
    د
    -0.15
    IRST
    -0.15
    lier
    -0.14
    icens
    -0.14
    bird
    -0.14
    POSITIVE LOGITS
    ively
    0.20
    ably
    0.19
     Assumes
    0.16
    essor
    0.16
    /assert
    0.16
    ptions
    0.15
    632
    0.15
    /request
    0.15
    696
    0.15
    oeff
    0.15
    Act Density 0.040%

    No Known Activations