INDEX
    Explanations

    misinformation rumors

    New Auto-Interp
    Negative Logits
     సాధ
    -0.08
     Naast
    -0.08
     preferably
    -0.08
     proib
    -0.08
     disables
    -0.08
     FY
    -0.08
     ↵↵  ↵↵
    -0.08
     Disable
    -0.07
    无遮挡
    -0.07
     Dro
    -0.07
    POSITIVE LOGITS
     miscon
    0.13
     perceived
    0.11
     mistakenly
    0.11
     misunderstanding
    0.11
     anecd
    0.10
     errone
    0.10
     mistaken
    0.10
     inaccur
    0.10
     sensational
    0.10
     incorrectly
    0.10
    Act Density 0.039%

    No Known Activations