INDEX
    Explanations

    terms related to politics and political commentary

    New Auto-Interp
    Negative Logits
    encil
    -0.16
    ITOR
    -0.16
    ιακ
    -0.16
    anova
    -0.14
    entials
    -0.14
     Vinci
    -0.14
    ego
    -0.14
    legg
    -0.14
    .sym
    -0.14
    ANCE
    -0.14
    POSITIVE LOGITS
    ische
    0.36
    ischen
    0.32
    isch
    0.32
    ischer
    0.32
    isches
    0.30
    ches
    0.21
    ishes
    0.19
    ické
    0.19
    ycz
    0.19
    iker
    0.18
    Act Density 0.043%

    No Known Activations