INDEX
    Explanations

    names of politicians or public figures

    names of public figures and events in a political context

    New Auto-Interp
    Negative Logits
    Contents
    -0.76
    strument
    -0.73
    material
    -0.70
    ÙĴ
    -0.69
     ().
    -0.69
    $.
    -0.68
    common
    -0.68
    Ire
    -0.66
     Morty
    -0.65
    Donnell
    -0.65
    POSITIVE LOGITS
     reacts
    1.17
     slams
    1.15
     Says
    1.09
     defends
    1.07
     shuts
    1.06
     approves
    1.02
     warns
    1.01
     refuses
    0.99
     denies
    0.99
     responds
    0.97
    Act Density 0.502%

    No Known Activations