INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     khử
    0.43
     Tencent
    0.42
     hydrogenation
    0.41
     defences
    0.41
    ashop
    0.41
    irchen
    0.39
    arka
    0.39
     delimiter
    0.39
    ixi
    0.39
     WeChat
    0.38
    POSITIVE LOGITS
     Iraq
    1.16
    Iraq
    1.04
     Irak
    0.97
     عراق
    0.97
     العراق
    0.95
     wars
    0.90
     Iraqi
    0.86
     ইরা
    0.82
     Afghanistan
    0.81
    戦争
    0.76
    Act Density 0.007%

    No Known Activations