INDEX
    Explanations

    Foreign language

    New Auto-Interp
    Negative Logits
    -0.07
    -0.07
    -0.07
     conta
    -0.07
    zion
    -0.07
     Expect
    -0.07
    -0.07
     canada
    -0.07
    忿
    -0.07
    نسخ
    -0.07
    POSITIVE LOGITS
    中国人民
    0.08
     вн
    0.07
    	person
    0.07
     дальн
    0.07
    0.07
    	M
    0.06
    од
    0.06
     (_
    0.06
    记者表示
    0.06
     Increased
    0.06
    Act Density 0.027%

    No Known Activations