INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Had
    -0.09
     anne
    -0.08
     настоя
    -0.08
     Glock
    -0.08
    prote
    -0.08
     આજ
    -0.07
     previs
    -0.07
    ANN
    -0.07
     إخ
    -0.07
     ό
    -0.07
    POSITIVE LOGITS
    🏼
    0.09
     tactics
    0.08
     aggressively
    0.08
     جدًا
    0.07
    0.07
    主动
    0.07
    active
    0.07
     abandonment
    0.07
     abandon
    0.07
     agress
    0.07
    Act Density 0.006%

    No Known Activations