INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     onCancelled
    -0.58
    URLException
    -0.56
     Даль
    -0.52
     insegn
    -0.51
     pamamagitan
    -0.51
     solares
    -0.49
     courses
    -0.49
     Krakowie
    -0.48
     oreilles
    -0.48
     Moscú
    -0.48
    POSITIVE LOGITS
     #
    1.10
     hashtag
    0.92
    Datuak
    0.89
    )#
    0.84
    .#
    0.83
     \#
    0.83
     Hashtag
    0.82
     "#
    0.82
    0.80
     hashtags
    0.79
    Act Density 0.200%

    No Known Activations