INDEX
    Explanations

    words related to political ideologies

    references to right-wing political affiliations or ideologies

    New Auto-Interp
    Negative Logits
     Remastered
    -0.70
     cellul
    -0.68
    DAQ
    -0.67
    atable
    -0.67
    arette
    -0.67
    è¦ļéĨĴ
    -0.66
    arettes
    -0.65
    ulative
    -0.64
     Mehran
    -0.62
    aples
    -0.61
    POSITIVE LOGITS
    eous
    1.35
     wing
    1.07
    wing
    1.06
     winger
    0.99
     flank
    0.94
     fielder
    0.92
    ward
    0.90
    move
    0.83
     handed
    0.83
    most
    0.78
    Act Density 0.053%

    No Known Activations