INDEX
    Explanations

    time-related expressions and indications of communication

    New Auto-Interp
    Negative Logits
    ustin
    -0.14
    isce
    -0.14
    azi
    -0.14
     controversies
    -0.14
    owan
    -0.14
    etta
    -0.14
     Jak
    -0.14
    ÑĦиÑĨи
    -0.13
    amburger
    -0.13
    anke
    -0.13
    POSITIVE LOGITS
     mant
    0.17
     Mant
    0.16
    onse
    0.15
    itar
    0.14
    pis
    0.14
    rouw
    0.14
    tar
    0.14
    cce
    0.14
    .BackgroundColor
    0.14
    akis
    0.14
    Act Density 0.002%

    No Known Activations