INDEX
    Explanations

    instances of the word "express" and its variants, as well as words related to extremism and exploitation

    New Auto-Interp
    Negative Logits
    fjspx
    -0.58
     executive
    -0.56
    TypedDataSet
    -0.56
    executive
    -0.55
     Мексичка
    -0.55
     expériment
    -0.54
     CreateTagHelper
    -0.54
    findpost
    -0.54
    Expert
    -0.53
     Audiodateien
    -0.53
    POSITIVE LOGITS
     viewDidLoad
    0.63
    IONS
    0.60
    perience
    0.58
    ventude
    0.55
    tagHelperRunner
    0.54
     Ră
    0.51
    ation
    0.51
    Orsay
    0.51
    treme
    0.49
     ciś
    0.49
    Act Density 0.378%

    No Known Activations