INDEX
    Explanations

    statements from various public figures and their opinions on political matters

    New Auto-Interp
    Head Attr Weights
    0:0.03
    1:0.12
    2:0.06
    3:0.02
    4:0.03
    5:0.06
    6:0.06
    7:0.08
    8:0.14
    9:0.23
    10:0.05
    11:0.09
    Negative Logits
     backdrop
    -1.39
     gram
    -1.27
     vantage
    -1.21
     Gram
    -1.21
     Gry
    -1.19
    DAQ
    -1.15
     hinges
    -1.14
     competition
    -1.09
     bloodstream
    -1.08
    Gall
    -1.07
    POSITIVE LOGITS
    Downloadha
    1.45
     "{
    1.43
     goodbye
    1.39
    psc
    1.37
    amera
    1.30
    omething
    1.30
    kef
    1.29
    anne
    1.27
    :]
    1.22
     "'
    1.21
    Act Density 0.002%

    No Known Activations