INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     ordinate
    2.13
    elere
    2.11
     vẫn
    2.10
     lateinit
    2.01
     sheen
    1.98
     carbonyl
    1.98
     Maarten
    1.97
    ğraf
    1.95
     pituitary
    1.93
     Colchester
    1.92
    POSITIVE LOGITS
    й
    1.95
    ی
    1.93
    ना
    1.92
    க்கும்
    1.90
    ى
    1.81
    1.76
    ीन
    1.72
    тельно
    1.69
    ри
    1.65
    1.63
    Act Density 0.001%

    No Known Activations