INDEX
    Explanations

    protests, rallies, demonstrations

    New Auto-Interp
    Negative Logits
     Marc
    -0.07
    kraine
    -0.06
     thicker
    -0.06
    ernational
    -0.06
     učitel
    -0.06
    には
    -0.06
    itant
    -0.06
     Technology
    -0.06
     mating
    -0.06
     ж
    -0.06
    POSITIVE LOGITS
     diminish
    0.07
     místo
    0.07
     činnosti
    0.07
    imeType
    0.06
    	className
    0.06
    .layoutControl
    0.06
     цю
    0.06
     Increase
    0.06
     influences
    0.06
     Jetzt
    0.06
    Act Density 0.099%

    No Known Activations