INDEX
    Explanations

    specific codes or numbers

    New Auto-Interp
    Negative Logits
    Word
    -0.68
    angle
    -0.68
    Orth
    -0.68
    Faites
    -0.68
     bài
    -0.68
    replace
    -0.66
    Angle
    -0.65
    itors
    -0.64
     Camila
    -0.64
    werker
    -0.63
    POSITIVE LOGITS
    wasi
    0.73
    Relevance
    0.68
    prov
    0.68
    MIG
    0.68
    Gul
    0.67
    Santi
    0.66
     ГУ
    0.65
    зор
    0.65
    immunity
    0.65
    Arist
    0.65
    Act Density 0.068%

    No Known Activations