INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    akh
    -0.07
     района
    -0.07
    OH
    -0.06
     Jah
    -0.06
    价值
    -0.06
     nga
    -0.06
    sth
    -0.06
     모든
    -0.06
    adox
    -0.06
     Giang
    -0.06
    POSITIVE LOGITS
     sure
    0.10
     Unsure
    0.09
     unsure
    0.08
    isure
    0.07
     suspect
    0.07
    clear
    0.07
     Sure
    0.07
    وند
    0.07
    .ArrayAdapter
    0.07
    清楚
    0.07
    Act Density 0.012%

    No Known Activations