INDEX
    Explanations

    words related to specific terms or concepts, such as 'God', 'marriage', 'fake news', 'civil', 'assassinate', and 'socialist'

    terms that have significant cultural, social, or political implications

    New Auto-Interp
    Negative Logits
     livest
    -0.62
     originals
    -0.61
     DEC
    -0.60
     traject
    -0.59
     incentives
    -0.59
     nightly
    -0.59
     archives
    -0.58
     DVDs
    -0.58
     Paige
    -0.58
     Exhibit
    -0.58
    POSITIVE LOGITS
    âĢİ
    1.05
     refers
    0.97
     denotes
    0.94
    É
    0.92
    ت
    0.89
    د
    0.88
    ر
    0.86
     implies
    0.85
    ب
    0.84
    notation
    0.83
    Act Density 0.200%

    No Known Activations