INDEX
    Explanations

    references to specific political figures and their affiliations

    New Auto-Interp
    Negative Logits
    agna
    -0.17
    zas
    -0.16
    .sax
    -0.14
    orro
    -0.14
    าà¸ĵ
    -0.14
    obar
    -0.13
     crowned
    -0.13
     çIJ
    -0.13
    ISIBLE
    -0.13
    pike
    -0.13
    POSITIVE LOGITS
    .scalablytyped
    0.16
    ockey
    0.15
     distant
    0.14
    /renderer
    0.14
    代
    0.14
    ariant
    0.14
     cle
    0.14
     mandates
    0.13
    Parms
    0.13
    763
    0.13
    Act Density 0.034%

    No Known Activations