INDEX
    Explanations

    inconsistencies or contradictions in written text

    New Auto-Interp
    Negative Logits
     Rumble
    -0.77
    ocene
    -0.66
     Stockholm
    -0.61
     bang
    -0.61
     Dota
    -0.59
     Ventura
    -0.58
    stay
    -0.57
     genuinely
    -0.56
    ELY
    -0.54
    GET
    -0.54
    POSITIVE LOGITS
    forward
    0.81
     situated
    0.80
     inclined
    0.79
    quartered
    0.77
     minded
    0.76
    apy
    0.74
    etheless
    0.73
    tainment
    0.72
    leep
    0.72
    nown
    0.70
    Act Density 0.024%

    No Known Activations