INDEX
    Explanations

    terms related to offensive language or content

    New Auto-Interp
    Negative Logits
    SIGINT
    -0.53
     Schar
    -0.44
     center
    -0.43
     magazine
    -0.42
     controlled
    -0.42
    MediaType
    -0.41
     Controlled
    -0.41
     Koch
    -0.41
     Group
    -0.41
     Center
    -0.41
    POSITIVE LOGITS
    :✨
    1.05
    DockStyle
    0.70
    AsUp
    0.66
    UnknownFieldSet
    0.66
    Jeografia
    0.65
     للاسماء
    0.65
     näytte
    0.60
    SharedCtor
    0.60
    帖最后由
    0.60
    offensive
    0.58
    Act Density 0.322%

    No Known Activations