INDEX
    Explanations

    phrases indicating movement or existence within specific spaces or environments

    New Auto-Interp
    Negative Logits
    wy
    -0.16
    اÙ쨏
    -0.15
    dle
    -0.14
    edd
    -0.14
    oufl
    -0.14
    eters
    -0.14
    ukes
    -0.14
     Hoover
    -0.14
    bourg
    -0.14
    iasi
    -0.13
    POSITIVE LOGITS
     town
    0.19
    ä¹±
    0.16
    æĪ
    0.16
    urope
    0.16
    otland
    0.15
    .testing
    0.15
    -town
    0.15
    /down
    0.15
    jd
    0.15
    town
    0.14
    Act Density 0.031%

    No Known Activations