INDEX
    Explanations

    proper nouns or names

    references to news agencies and photo credits

    New Auto-Interp
    Negative Logits
    FML
    -0.71
     Warcraft
    -0.65
     Transformers
    -0.64
     Haram
    -0.63
    pard
    -0.60
    keley
    -0.59
    lbs
    -0.57
     LSD
    -0.55
    addons
    -0.54
     roses
    -0.53
    POSITIVE LOGITS
    Rap
    0.70
    odox
    0.70
    senal
    0.65
    icio
    0.64
    Balt
    0.64
    ãĤ¯
    0.62
    onde
    0.61
    Í
    0.61
    imil
    0.61
     ]
    0.60
    Act Density 0.180%

    No Known Activations