INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     кислот
    -0.07
    PREFIX
    -0.06
     Gingrich
    -0.06
    (extension
    -0.06
     ISIL
    -0.06
    퓨터
    -0.06
     unbearable
    -0.06
    ственное
    -0.06
     khác
    -0.06
    px
    -0.06
    POSITIVE LOGITS
     ژانویه
    0.06
    (笑
    0.06
    0.06
    иров
    0.06
     castle
    0.06
    AREN
    0.06
     Keystone
    0.06
    ESTAMP
    0.06
    Tweet
    0.06
    	level
    0.06
    Act Density 0.002%

    No Known Activations