INDEX
    Explanations

    phrases that indicate relationships or connections between ideas

    "and" followed by a negative word

    conjunctions and consequences

    New Auto-Interp
    Negative Logits
     ſtand
    -0.63
     ſou
    -0.57
     ſeveral
    -0.56
     disambiguazione
    -0.55
     Houſe
    -0.54
     pleaſure
    -0.52
     Inscrivez
    -0.52
     ſtre
    -0.52
    ſelf
    -0.51
    ſelves
    -0.50
    POSITIVE LOGITS
    verifyException
    0.42
    utilisons
    0.41
    thâu
    0.40
    pyx
    0.38
    قایناقلار
    0.36
     wanting
    0.35
    setViewportView
    0.35
     enx
    0.35
     jeito
    0.35
     autorytatywna
    0.34
    Act Density 0.487%

    No Known Activations