INDEX
    Explanations

    numerical data and statistical representations

    New Auto-Interp
    Negative Logits
    -0.68
    isissez
    -0.68
     referrerpolicy
    -0.66
    setDo
    -0.65
    npmjs
    -0.65
     religieuses
    -0.63
     verticales
    -0.63
     normaux
    -0.62
     humaines
    -0.62
     tarko
    -0.62
    POSITIVE LOGITS
    tagHelperRunner
    0.70
     للمعارف
    0.65
    tagHelper
    0.57
    الإنجليزية
    0.56
    \{\\
    0.55
    CITATION
    0.53
    7
    0.50
     Lena
    0.50
     sta
    0.50
     liga
    0.49
    Act Density 0.676%

    No Known Activations