INDEX
    Explanations

    sentences containing assertions or opinions about media credibility and value

    New Auto-Interp
    Negative Logits
    ertino
    -0.15
     inst
    -0.15
    inst
    -0.15
    bol
    -0.15
    Äĥm
    -0.15
    iez
    -0.14
    rud
    -0.14
    mons
    -0.14
     Bol
    -0.13
     beep
    -0.13
    POSITIVE LOGITS
     Ñĥв
    0.15
    ãģ¡ãĤĩ
    0.15
    WER
    0.14
    ldr
    0.14
    WithTitle
    0.14
    zym
    0.14
    uther
    0.14
    ADB
    0.14
    raries
    0.14
    .postMessage
    0.14
    Act Density 0.120%

    No Known Activations