INDEX
    Explanations

    terms related to reduction or decline

    New Auto-Interp
    Negative Logits
    .”
    -1.26
    ?”
    -1.12
    !”
    -1.09
    ,”
    -1.09
    -1.02
    …”
    -0.97
    ).”
    -0.94
     “
    -0.94
    -0.94
    :”
    -0.91
    POSITIVE LOGITS
    verwijspagina
    1.40
     للاسماء
    1.06
     sèche
    1.00
    ^(@)
    0.97
    
    0.96
     NDL
    0.95
    adays
    0.93
     beginnetje
    0.93
    lehem
    0.91
    ModelSerializer
    0.91
    Act Density 0.204%

    No Known Activations