INDEX
    Explanations

    describing qualities or states

    New Auto-Interp
    Negative Logits
    cretsiz
    0.46
     Whenever
    0.42
     Khi
    0.42
     Ihrem
    0.42
    when
    0.41
     Zero
    0.40
     ناقابل
    0.40
    គ្មាន
    0.39
     مطم
    0.39
     облег
    0.39
    POSITIVE LOGITS
    ERS
    0.51
     instabilities
    0.48
    IFIC
    0.47
    EDED
    0.47
    ാനും
    0.46
    อะ
    0.44
     planification
    0.44
    ЕНИ
    0.44
    ERRE
    0.44
     opos
    0.43
    Act Density 0.003%

    No Known Activations