INDEX
    Explanations

    instances involving specific names or numeric identifiers with additional context provided

    negative connotations associated with the United Nations or its officials

    New Auto-Interp
    Negative Logits
     ingred
    -0.74
     Dickinson
    -0.71
    DragonMagazine
    -0.70
    tenance
    -0.70
     Extras
    -0.65
    ãĤ¤ãĥĪ
    -0.65
    imore
    -0.65
     ILCS
    -0.63
    ktop
    -0.62
     Flavoring
    -0.62
    POSITIVE LOGITS
    direction
    1.00
    nom
    0.94
    gee
    0.92
    Ma
    0.90
    chan
    0.90
    Fi
    0.88
    shaped
    0.88
    talk
    0.85
    fi
    0.84
    La
    0.84
    Act Density 0.075%

    No Known Activations