INDEX
    Explanations

    attends to the word "to" from the word "contradiction" in logical contexts

    New Auto-Interp
    Head Attr Weights
    0:0.11
    1:0.13
    2:0.11
    3:0.06
    4:0.04
    5:0.02
    6:0.05
    7:0.44
    Negative Logits
    AndEndTag
    -0.38
     &___
    -0.31
    WebVitals
    -0.28
    Према
    -0.28
     beginnetje
    -0.26
     AssemblyCulture
    -0.26
    NOPQRST
    -0.26
    icose
    -0.25
    ьаж
    -0.24
     Dostupné
    -0.24
    POSITIVE LOGITS
    gdx
    0.34
     disambiguazione
    0.28
     Houſe
    0.28
     esternos
    0.28
     Selig
    0.28
    ">>
    0.28
    ddots
    0.27
    WLAN
    0.26
     enceinte
    0.26
     arşivlendi
    0.26
    Act Density 0.455%

    No Known Activations