INDEX
    Explanations

    competitive with other models

    New Auto-Interp
    Negative Logits
     indign
    0.40
     извър
    0.40
    asang
    0.39
     откри
    0.37
     विद्यार्थ्यांनी
    0.37
     exclaimed
    0.36
    mounted
    0.36
    ழ்த்த
    0.36
     पीड़िता
    0.35
    רת
    0.35
    POSITIVE LOGITS
     большинства
    0.50
     tradicion
    0.45
     indie
    0.45
     اکثر
    0.44
     traditionally
    0.44
     generell
    0.44
     khi
    0.44
     SOME
    0.41
     longstanding
    0.41
     future
    0.41
    Act Density 0.006%

    No Known Activations