INDEX
    Explanations

    words associated with conflict, judgment, and power dynamics

    New Auto-Interp
    Negative Logits
    Архівовано
    -0.58
    UserScript
    -0.51
    iburg
    -0.46
    basicConfig
    -0.46
    -0.46
    providedIn
    -0.46
     désolés
    -0.45
     estimés
    -0.43
    awtextra
    -0.42
    [--
    -0.42
    POSITIVE LOGITS
    󠁮
    0.40
     pandemia
    0.38
     []:
    0.38
    ISupport
    0.38
     tooth
    0.37
     saites
    0.36
    abestanden
    0.35
     pola
    0.35
    rrggbb
    0.35
    <tfoot>
    0.34
    Act Density 0.030%

    No Known Activations