INDEX
    Explanations

    names and proper nouns of individuals

    New Auto-Interp
    Negative Logits
     Pays
    -0.19
    ils
    -0.15
    ularity
    -0.15
    zens
    -0.15
     pays
    -0.14
    ining
    -0.14
    gings
    -0.14
    .MixedReality
    -0.14
    ym
    -0.14
    ede
    -0.14
    POSITIVE LOGITS
     routes
    0.15
     natives
    0.15
    تب
    0.15
    Barrier
    0.14
    -corner
    0.14
     dreaming
    0.14
    ìłĪ
    0.14
    گاÙĨ
    0.14
    istrovstvÃŃ
    0.14
     native
    0.13
    Act Density 0.070%

    No Known Activations