INDEX
    Explanations

    syntactic structures and mathematical notation

    New Auto-Interp
    Negative Logits
     InputDecoration
    -0.89
     Мексичка
    -0.83
    Portail
    -0.82
    InjectAttribute
    -0.81
    下载附件
    -0.73
     nahilalakip
    -0.70
     للمعارف
    -0.70
    Portale
    -0.70
     חיצוניים
    -0.70
    ItemBackground
    -0.67
    POSITIVE LOGITS
    1
    1.15
    0.67
    0.61
    2
    0.51
     judiciales
    0.51
     anteriore
    0.50
     sanitaires
    0.48
     one
    0.48
     koning
    0.47
     delantera
    0.47
    Act Density 1.048%

    No Known Activations