INDEX
    Explanations

    phrases that indicate the existence or status of individuals

    New Auto-Interp
    Negative Logits
    adro
    -0.18
    uja
    -0.16
    idor
    -0.15
    uis
    -0.15
    ente
    -0.15
    ar
    -0.14
    engo
    -0.14
    SCALL
    -0.14
    Äĥm
    -0.14
    RW
    -0.14
    POSITIVE LOGITS
    oton
    0.15
     leve
    0.15
    516
    0.15
    opal
    0.14
    emes
    0.14
    ertz
    0.14
    ivid
    0.14
     Hel
    0.14
    rix
    0.14
    icus
    0.14
    Act Density 0.037%

    No Known Activations