INDEX
    Explanations

    negations or words indicating contradiction

    New Auto-Interp
    Negative Logits
    ADOR
    -0.16
    DESC
    -0.15
    inkel
    -0.15
    ady
    -0.15
    hart
    -0.14
    NotSupportedException
    -0.14
    宿
    -0.14
    struct
    -0.13
    164
    -0.13
    Guy
    -0.13
    POSITIVE LOGITS
    ori
    0.19
    ched
    0.18
    tingham
    0.17
    epad
    0.17
    ches
    0.17
    abyrin
    0.16
    aira
    0.16
    ¤
    0.15
    zsche
    0.15
    CHK
    0.15
    Act Density 0.186%

    No Known Activations