INDEX
    Explanations

    expressions of criticism or concern regarding media portrayals and public perception

    New Auto-Interp
    Negative Logits
     Seinfeld
    -0.27
     Règlement
    -0.24
     оп
    -0.24
    Viited
    -0.23
     frequently
    -0.22
    fohl
    -0.21
     explained
    -0.21
     pinched
    -0.21
     parada
    -0.21
     collections
    -0.21
    POSITIVE LOGITS
     EconPapers
    0.83
    httphttps
    0.77
    <unused55>
    0.75
    <unused8>
    0.75
    ſſung
    0.75
    <unused74>
    0.75
    <pad>
    0.75
    <unused14>
    0.74
    <unused3>
    0.74
    [@BOS@]
    0.74
    Act Density 0.450%

    No Known Activations