INDEX
    Explanations

    phrases related to accusations of dishonesty or impropriety

    New Auto-Interp
    Negative Logits
    featureID
    -0.55
    rens
    -0.53
    empre
    -0.53
    -0.52
     vieles
    -0.51
    inghouse
    -0.50
     veng
    -0.50
    queryInterface
    -0.49
     clearfix
    -0.49
    ScrollPane
    -0.47
    POSITIVE LOGITS
     people
    0.86
     fellow
    0.76
     whoever
    0.76
    onAttach
    0.75
     whome
    0.73
     me
    0.69
    ผู้
    0.68
     us
    0.68
     anyone
    0.67
    raszam
    0.66
    Act Density 4.072%

    No Known Activations