INDEX
    Explanations

    mentions of and discussions around the concept of 'fake news'

    tokens indicating the end of a text or section

    New Auto-Interp
    Negative Logits
    Dialogue
    -0.63
    alties
    -0.62
     ãĢĮ
    -0.60
    rises
    -0.59
     KR
    -0.59
    AAF
    -0.59
     Leilan
    -0.58
     Keller
    -0.57
    essage
    -0.56
    immer
    -0.56
    POSITIVE LOGITS
    usterity
    0.97
    tenance
    0.94
    "
    0.87
    `,
    0.86
    »
    0.83
    terday
    0.83
    \)
    0.82
    "!
    0.82
    '?
    0.81
    /,
    0.80
    Act Density 0.308%

    No Known Activations