INDEX
    Explanations

    statements indicating clarification or acknowledgment of facts

    New Auto-Interp
    Negative Logits
    illow
    -0.15
    NIL
    -0.15
    ç¡
    -0.15
    itas
    -0.14
    factor
    -0.14
    LOUR
    -0.14
    oples
    -0.14
    alyzer
    -0.14
    loom
    -0.13
    (coder
    -0.13
    POSITIVE LOGITS
     noted
    0.17
     clear
    0.17
     Note
    0.17
    anus
    0.16
     arg
    0.16
     note
    0.16
     consensus
    0.15
     established
    0.15
    rit
    0.14
     understood
    0.14
    Act Density 0.117%

    No Known Activations