INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Head Attr Weights
    0:0.08
    1:0.08
    2:0.09
    3:0.07
    4:0.09
    5:0.08
    6:0.06
    7:0.07
    8:0.08
    9:0.08
    10:0.08
    11:0.08
    Negative Logits
    reth
    -2.80
    rates
    -2.51
     comply
    -2.50
    �士
    -2.45
     explosives
    -2.39
     ingred
    -2.38
     scen
    -2.37
    gypt
    -2.36
    cyl
    -2.35
    Weapons
    -2.31
    POSITIVE LOGITS
     Santorum
    3.19
    edin
    2.79
     AMA
    2.72
     Libertarian
    2.70
     Maher
    2.63
     Rubin
    2.61
     Kaepernick
    2.60
     Krugman
    2.60
     Akin
    2.55
     ["
    2.54
    Act Density 0.000%

    No Known Activations

    This feature has no known activations.