INDEX
    Explanations

    proper nouns, particularly names of individuals and places

    New Auto-Interp
    Negative Logits
    ino
    -0.16
    моÑĤ
    -0.15
    oller
    -0.15
    วล
    -0.15
    INO
    -0.15
    NL
    -0.14
     JADX
    -0.14
     Angiospermae
    -0.14
     <!--[
    -0.14
    åĭ¤
    -0.14
    POSITIVE LOGITS
    à¹Ģà¸ķà¸Ńร
    0.14
     Pou
    0.14
    jde
    0.14
     è´
    0.13
    chos
    0.13
     hairy
    0.13
    zdy
    0.13
     bare
    0.13
     å¥
    0.13
     Gan
    0.13
    Act Density 0.001%

    No Known Activations