INDEX
    Explanations

    phrases indicating origin or location of individuals or groups

    New Auto-Interp
    Negative Logits
    uche
    -0.17
    ané
    -0.16
    hower
    -0.16
    vrier
    -0.15
    RICT
    -0.14
    ertura
    -0.14
    ouri
    -0.14
    ør
    -0.14
    ovenant
    -0.14
    Readable
    -0.14
    POSITIVE LOGITS
    çĭ
    0.16
    iesz
    0.15
    -To
    0.15
    agner
    0.14
     ward
    0.14
    .flink
    0.14
     æķ
    0.14
    ison
    0.14
    yat
    0.14
     pady
    0.13
    Act Density 0.001%

    No Known Activations