INDEX
    Explanations

    accusations of racism and hypocrisy in discussions or arguments

    New Auto-Interp
    Negative Logits
    ADOR
    -0.14
    ijd
    -0.14
    Vtbl
    -0.14
    URT
    -0.14
    ÑĸÑĢ
    -0.14
    annis
    -0.14
    aternity
    -0.13
    lore
    -0.13
    onden
    -0.13
    raž
    -0.13
    POSITIVE LOGITS
     trait
    0.24
     dangerous
    0.18
     baby
    0.17
     Baby
    0.17
    trait
    0.16
     too
    0.16
     dens
    0.16
     interrupt
    0.16
     li
    0.15
     shift
    0.15
    Act Density 0.211%

    No Known Activations