INDEX
    Explanations

    words related to misinformation and deceit

    New Auto-Interp
    Negative Logits
    principalTable
    -0.71
    LabelTagHelper
    -0.58
    AxisAlignment
    -0.53
    ActionCreators
    -0.52
    󠁢
    -0.50
    الإنجليزية
    -0.49
     Winaray
    -0.47
     océ
    -0.47
     grammi
    -0.46
    bootstrapcdn
    -0.46
    POSITIVE LOGITS
     viewers
    1.10
     audiences
    1.04
     readers
    1.04
     listeners
    0.98
    readers
    0.87
     fans
    0.86
     audience
    0.85
     visitors
    0.81
    audience
    0.81
     viewer
    0.80
    Act Density 0.346%

    No Known Activations