INDEX
    Explanations

    proper nouns, especially related to political figures and specific organizations

    references to pointing or directing attention towards specific subjects or individuals

    New Auto-Interp
    Head Attr Weights
    0:0.04
    1:0.02
    2:0.11
    3:0.06
    4:0.06
    5:0.06
    6:0.04
    7:0.03
    8:0.33
    9:0.13
    10:0.05
    11:0.02
    Negative Logits
    conservancy
    -1.22
    soever
    -1.21
    roy
    -1.20
    interstitial
    -1.18
     repaired
    -1.15
    emouth
    -1.08
    sembly
    -1.08
    ashington
    -1.08
     grapp
    -1.06
    lymp
    -1.06
    POSITIVE LOGITS
    dial
    1.31
     Genocide
    1.22
    zinski
    1.20
     causation
    1.14
    TextColor
    1.11
     veto
    1.11
     flashing
    1.10
    enance
    1.09
    azar
    1.09
    urai
    1.07
    Act Density 0.046%

    No Known Activations