INDEX
    Explanations

    mathematical notation and symbols, particularly in the context of equations and parameters

    New Auto-Interp
    Negative Logits
    RegressionTest
    -0.94
    клопе
    -0.87
     становника
    -0.86
     Himo
    -0.78
     Paglinawan
    -0.77
     actionMode
    -0.75
     Мексичка
    -0.75
     nahilalakip
    -0.74
     виправивши
    -0.74
    \{\\
    -0.73
    POSITIVE LOGITS
     sauvages
    0.52
     internationaux
    0.48
    0.48
     dieux
    0.48
     bonnes
    0.47
     médicaux
    0.47
     religieuses
    0.45
     blessés
    0.44
     numériques
    0.43
     chargés
    0.43
    Act Density 0.896%

    No Known Activations