INDEX
    Explanations

    phrases related to various current events and news topics

    phrases related to political events and their implications

    New Auto-Interp
    Negative Logits
    .]
    -0.75
    .).
    -0.74
    '.
    -0.71
    '."
    -0.70
     '.
    -0.68
    .'"
    -0.66
    ".
    -0.65
    ].
    -0.64
    !".
    -0.64
    !.
    -0.64
    POSITIVE LOGITS
    âĢ
    1.79
     âĢ
    1.46
    âĢł
    1.34
    ãĢ
    1.29
    â
    1.13
    âľ
    1.08
    âĹ
    1.07
    *,
    1.06
    âĶ
    1.05
     âĶ
    1.05
    Act Density 1.272%

    No Known Activations