INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     خواهد
    -0.07
     Data
    -0.07
     한번
    -0.07
    olut
    -0.07
     احتم
    -0.07
    -0.07
     migraine
    -0.06
     rightly
    -0.06
     Kok
    -0.06
    .Func
    -0.06
    POSITIVE LOGITS
    ço
    0.06
    strom
    0.06
    	cat
    0.06
     servicio
    0.06
     clientele
    0.06
     i
    0.06
    oxic
    0.06
    beb
    0.06
    -input
    0.06
    ọi
    0.06
    Act Density 0.004%

    No Known Activations