INDEX
    Explanations

    phrases related to interactions and actions taken by different entities

    negations or instances of unfair treatment in various contexts

    New Auto-Interp
    Negative Logits
     Nare
    -0.63
    ocument
    -0.58
    nesday
    -0.58
    renheit
    -0.56
    FTWARE
    -0.56
    Queue
    -0.54
    ayson
    -0.53
    egu
    -0.53
    aea
    -0.53
    erenn
    -0.52
    POSITIVE LOGITS
    \":
    0.50
     awaits
    0.49
     thereof
    0.48
    ':
    0.48
     contagious
    0.48
    zbollah
    0.47
     awaited
    0.47
     everlasting
    0.46
     âĢº
    0.46
     ensuing
    0.45
    Act Density 0.387%

    No Known Activations