INDEX
    Explanations

    mentions of political figures and events

    New Auto-Interp
    Negative Logits
     GOODMAN
    -0.81
    OUP
    -0.72
     Drawn
    -0.71
    ãĤ¤ãĥĪ
    -0.69
     Fever
    -0.65
     Sabha
    -0.65
     Sawyer
    -0.64
     Flavoring
    -0.59
    MSN
    -0.59
    arity
    -0.59
    POSITIVE LOGITS
    clamation
    1.41
    orbit
    1.26
    uber
    1.26
    ogenous
    1.22
    terior
    1.14
    tern
    1.10
    clus
    1.08
    portation
    1.07
    clud
    1.07
    oplan
    1.07
    Act Density 0.012%

    No Known Activations