INDEX
    Explanations

    phrases expressing uncertainty or doubt

    New Auto-Interp
    Negative Logits
     curiosidad
    -0.31
    ându
    -0.30
     özg
    -0.27
    -
    -0.27
     dovol
    -0.27
     com
    -0.27
     is
    -0.27
     transición
    -0.26
     (
    -0.26
     conformidad
    -0.26
    POSITIVE LOGITS
    parsedMessage
    1.08
    +#+#
    0.98
     Signalez
    0.94
    OGND
    0.91
     EconPapers
    0.87
    хьтан
    0.85
    rungsseite
    0.83
    IntoConstraints
    0.79
    Autoritní
    0.79
     kasarigan
    0.79
    Act Density 0.654%

    No Known Activations