INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    )이
    -0.07
     فراهم
    -0.07
    عي
    -0.06
     або
    -0.06
    	flags
    -0.06
     sons
    -0.06
     diesen
    -0.06
    ulary
    -0.06
    -0.06
    elfast
    -0.06
    POSITIVE LOGITS
     one
    0.10
     welcome
    0.07
     المملكة
    0.07
     One
    0.07
     ~
    0.07
     incident
    0.07
     ifstream
    0.06
     guy
    0.06
     delegated
    0.06
     TIME
    0.06
    Act Density 0.011%

    No Known Activations