INDEX
    Explanations

    references to mathematical theorems and definitions

    New Auto-Interp
    Negative Logits
    UserScript
    -1.03
     utafitiHapana
    -1.03
     ویکی‌آمباردا
    -0.95
     Superhost
    -0.91
     مرئيه
    -0.83
    tvguidetime
    -0.81
    UnsafeEnabled
    -0.80
    fjspx
    -0.80
     propOrder
    -0.79
    contentLoaded
    -0.79
    POSITIVE LOGITS
     or
    0.46
    dfrac
    0.44
     source
    0.44
    multicolumn
    0.43
     sources
    0.43
     Source
    0.43
    Source
    0.42
    textbf
    0.41
    0.41
    src
    0.40
    Act Density 0.023%

    No Known Activations