INDEX
    Explanations

    mentions of political controversy or conflict

    instances of negativity or criticism

    New Auto-Interp
    Negative Logits
     detached
    -0.76
     Sakuya
    -0.76
     guiActiveUnfocused
    -0.75
     Eisen
    -0.72
     Niet
    -0.70
     fragmentation
    -0.68
     Sapphire
    -0.68
     mosqu
    -0.68
     Elys
    -0.64
     bombard
    -0.64
    POSITIVE LOGITS
    ł
    1.12
    IJ
    1.10
    ¹
    1.08
    ª
    1.08
    ij
    1.03
    £
    1.00
    Ĵ
    0.99
    ı
    0.97
    âĸº
    0.97
    ¸
    0.93
    Act Density 0.200%

    No Known Activations