INDEX
    Explanations

    proper nouns, particularly associated with religious figures and locations

    New Auto-Interp
    Negative Logits
    izmet
    -0.15
    senal
    -0.15
    ãĤĥ
    -0.14
    átis
    -0.14
    ince
    -0.14
    kke
    -0.14
    pline
    -0.14
     WaitForSeconds
    -0.13
     رÙħ
    -0.13
    -ST
    -0.13
    POSITIVE LOGITS
     Mir
    0.16
     intens
    0.15
     Sau
    0.14
    ẩn
    0.14
     already
    0.14
     numbered
    0.13
    orden
    0.13
     '
    0.13
     Gu
    0.13
     bite
    0.13
    Act Density 0.312%

    No Known Activations