INDEX
    Explanations

    names and proper nouns related to legal, geographical, or political contexts

    New Auto-Interp
    Negative Logits
    lá
    -0.16
     Franti
    -0.15
    aby
    -0.14
    .localized
    -0.14
    (Have
    -0.13
    ovny
    -0.13
    ries
    -0.13
    ãĥĮ
    -0.13
    ót
    -0.13
    undry
    -0.13
    POSITIVE LOGITS
    ÌĨ
    0.15
    خت
    0.15
     von
    0.14
    usa
    0.13
    ÙĪگر
    0.13
    IC
    0.12
    ↵↵
    0.12
    .lib
    0.12
    USA
    0.12
     Dive
    0.12
    Act Density 0.206%

    No Known Activations