INDEX
    Explanations

    TV episodes

    New Auto-Interp
    Negative Logits
    		
    -0.07
     کوه
    -0.07
     Adrian
    -0.06
     Forum
    -0.06
     coast
    -0.06
    owan
    -0.06
     Conspiracy
    -0.06
     wrought
    -0.06
     fingert
    -0.06
    ує
    -0.06
    POSITIVE LOGITS
     Numerous
    0.07
     طلب
    0.07
    alerts
    0.07
    .convert
    0.06
    _la
    0.06
    :@""
    0.06
     souhlas
    0.06
    ाउ
    0.06
    _che
    0.06
    .status
    0.06
    Act Density 0.022%

    No Known Activations