INDEX
    Explanations

    mentions of specific names, especially related to political scandals

    New Auto-Interp
    Negative Logits
    -0.53
    lift
    -0.52
    -0.49
    uatu
    -0.49
    Ralph
    -0.47
    Cos
    -0.47
     Ralph
    -0.47
    Sheffield
    -0.43
    Marvel
    -0.43
     Cos
    -0.42
    POSITIVE LOGITS
     steven
    0.96
     STEVEN
    0.88
     Nixon
    0.88
    steven
    0.81
    Nixon
    0.78
     Steven
    0.74
    Steven
    0.73
     Whence
    0.70
     Stevenson
    0.66
     pavillon
    0.66
    Act Density 0.275%

    No Known Activations