INDEX
    Explanations

    Hebrew words or characters

    Hebrew letters or characters

    New Auto-Interp
    Negative Logits
    hell
    -0.78
    aston
    -0.77
    ophon
    -0.76
    alien
    -0.73
    arding
    -0.72
    kamp
    -0.70
    alore
    -0.70
    atos
    -0.69
    amia
    -0.68
    oons
    -0.68
    POSITIVE LOGITS
    ño
    0.75
    Ľ
    0.74
    BLIC
    0.74
    ãĥ¼ãĥĨ
    0.72
     partName
    0.70
    Äĩ
    0.70
    å§«
    0.66
    ׾
    0.66
    lda
    0.66
    odcast
    0.66
    Act Density 0.034%

    No Known Activations