INDEX
    Explanations

    phrases related to specific entities or concepts, possibly involving conflicts or controversies

    mentions of a specific character or entity symbolized by the character "Ļ"

    New Auto-Interp
    Negative Logits
     disadvant
    -0.75
     misunder
    -0.72
     mathemat
    -0.66
     contrace
    -0.65
     seiz
    -0.65
     condem
    -0.64
     Palestin
    -0.64
    merce
    -0.63
     regulation
    -0.63
    ozy
    -0.63
    POSITIVE LOGITS
    ï¸ı
    1.48
    ï¸
    0.96
    âĹ
    0.94
     
    0.92
    Balt
    0.86
    âĸº
    0.84
    gypt
    0.83
    ¯¯
    0.83
    âĪ
    0.83
    âĻ
    0.82
    Act Density 0.449%

    No Known Activations