INDEX
    Explanations

    instances of text relating to political and social issues, including controversies, discrimination, scientific debates, and economic challenges

    New Auto-Interp
    Negative Logits
     Bris
    -0.71
     Sapphire
    -0.69
     guiActiveUnfocused
    -0.68
     indo
    -0.65
     Bengal
    -0.64
    iewicz
    -0.64
    creen
    -0.63
     confines
    -0.63
     detached
    -0.63
     Opera
    -0.63
    POSITIVE LOGITS
    IJ
    1.13
    ª
    1.12
    ¹
    1.12
    ł
    1.09
    Ĵ
    1.08
    ı
    1.07
    ij
    1.03
    £
    0.94
    ³
    0.93
    certain
    0.91
    Act Density 0.144%

    No Known Activations