INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     At
    -0.06
     Sheikh
    -0.06
    Kh
    -0.06
                                                                                               
    -0.06
    At
    -0.06
    ії
    -0.06
    ней
    -0.06
                                                                               
    -0.06
    _archive
    -0.06
    .At
    -0.05
    POSITIVE LOGITS
    malı
    0.07
     diyor
    0.07
     Aydın
    0.07
     abyss
    0.07
     하지
    0.07
     akşam
    0.07
    lıyor
    0.07
     çevir
    0.06
    üyor
    0.06
    ุคคล
    0.06
    Act Density 0.130%

    No Known Activations