INDEX
    Explanations

    phrases related to government actions and policy decisions

    New Auto-Interp
    Negative Logits
     jadx
    -0.15
     odense
    -0.15
     ÑĤоже
    -0.14
     similarly
    -0.14
     Bieber
    -0.14
     aalborg
    -0.13
     Kaepernick
    -0.13
    éĤ£ç§į
    -0.13
    ذÙĦÙĥ
    -0.13
    ï¼ł
    -0.13
    POSITIVE LOGITS
    *
    0.15
    [
    0.15
    0.14
    isce
    0.14
    agenta
    0.14
    !--
    0.14
    "
    0.14
    ansk
    0.13
    iming
    0.13
     recently
    0.13
    Act Density 0.622%

    No Known Activations