INDEX
    Explanations

    references to mathematical equations or treatments within texts

    Mathematical or scientific expressions

    mathematical or academic terms

    New Auto-Interp
    Negative Logits
     Mal
    -0.35
     mal
    -0.34
     cou
    -0.32
    -0.31
     confusion
    -0.31
     Dars
    -0.30
     fundamental
    -0.29
     فت
    -0.29
     Elect
    -0.29
     cargos
    -0.29
    POSITIVE LOGITS
    [toxicity=0]
    1.16
    httphttps
    0.77
    Diweddarwch
    0.76
     ujednoznacz
    0.73
    0.73
    webElementXpaths
    0.72
    TypedDataSet
    0.70
     informée
    0.70
    期刊论文
    0.68
    expandindo
    0.67
    Act Density 2.182%

    No Known Activations