INDEX
    Explanations

    negative sentiments or opposing viewpoints

    words ending in ation or rated

    New Auto-Interp
    Negative Logits
     outer
    -0.51
    日閲覧
    -0.50
    êque
    -0.50
     gut
    -0.49
     Bhatt
    -0.48
    nste
    -0.48
     delivery
    -0.48
     جریان
    -0.47
     courant
    -0.46
    ook
    -0.46
    POSITIVE LOGITS
     المعيارى
    0.78
    ########.
    0.73
    TagMode
    0.68
    EndInit
    0.67
    PerformLayout
    0.66
    Personensuche
    0.65
     <=",
    0.64
     للاسماء
    0.60
     surla
    0.59
     censiti
    0.56
    Act Density 0.053%

    No Known Activations