INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    sterious
    -0.76
    談社
    -0.73
    AndEndTag
    -0.72
    transQ
    -0.71
    منابع
    -0.70
     doInBackground
    -0.70
     Mémoires
    -0.70
    RectangleBorder
    -0.67
    ToScroll
    -0.67
     Monfieur
    -0.67
    POSITIVE LOGITS
    prnewswire
    0.51
    Revenir
    0.49
     Vikipedi
    0.42
    спеди
    0.41
    KeepAlive
    0.40
     <
    0.40
    selaer
    0.40
     Inc
    0.40
     slalom
    0.39
     loudest
    0.38
    Act Density 0.557%

    No Known Activations