INDEX
    Explanations

    references to user management and moderation of online content

    New Auto-Interp
    Negative Logits
     MainAxisSize
    -0.60
     виправивши
    -0.59
    Hentet
    -0.59
    ographiques
    -0.58
    errHandler
    -0.57
     المعيارى
    -0.57
    lech
    -0.56
    KommentareTeilen
    -0.55
     juſ
    -0.54
    postIndex
    -0.54
    POSITIVE LOGITS
     mod
    1.27
    mod
    1.17
     moder
    1.13
    moder
    1.11
     Mod
    1.11
     Moder
    1.08
     mods
    1.05
     moderation
    1.05
    Mod
    1.05
     MODER
    1.03
    Act Density 2.564%

    No Known Activations