INDEX
    Explanations

    predeterminers and definite articles indicating specificity or importance

    New Auto-Interp
    Negative Logits
    olar
    -0.18
    ammen
    -0.15
    nock
    -0.14
    ined
    -0.14
     it
    -0.14
    vala
    -0.14
    InstanceState
    -0.14
    oken
    -0.14
    ré
    -0.14
    oup
    -0.14
    POSITIVE LOGITS
    arti
    0.15
    áÄį
    0.14
     wed
    0.14
    SAFE
    0.14
    aket
    0.14
    SFML
    0.13
     nightmares
    0.13
    vÄĽd
    0.13
    ETCH
    0.13
    CDF
    0.13
    Act Density 0.041%

    No Known Activations