INDEX
    Explanations

    references to specific individuals, experiences, and products, particularly related to media and societal issues

    New Auto-Interp
    Negative Logits
     Infórmanos
    -0.79
    httphttps
    -0.77
     Administrativna
    -0.76
    msgTypes
    -0.71
     Numerade
    -0.69
    AndEndTag
    -0.68
    تقاوى
    -0.66
    ſammen
    -0.65
    InputTagHelper
    -0.64
    ########.
    -0.63
    POSITIVE LOGITS
     now
    0.91
     désormais
    0.81
     newly
    0.78
     ahora
    0.73
     теперь
    0.71
     nyní
    0.69
     teraz
    0.66
     recently
    0.66
     sekarang
    0.65
     maintenant
    0.63
    Act Density 0.697%

    No Known Activations