INDEX
    Explanations

    instances of reported speech or statements in the text

    New Auto-Interp
    Negative Logits
    avier
    -0.17
     Alv
    -0.15
    oes
    -0.14
     personally
    -0.14
    оÑĤÑĮ
    -0.14
    thur
    -0.14
    asz
    -0.14
     opposite
    -0.14
    avy
    -0.14
    existent
    -0.14
    POSITIVE LOGITS
    rana
    0.18
    unta
    0.17
    ctxt
    0.15
    wart
    0.15
    ycastle
    0.15
    edList
    0.15
    vä
    0.14
    ืà¸Ńà¸Ķ
    0.14
    980
    0.14
    __,__
    0.14
    Act Density 0.075%

    No Known Activations