INDEX
    Explanations

    references to academic or scientific publications and citations

    New Auto-Interp
    Negative Logits
    IBUT
    -0.19
    etsk
    -0.16
    okable
    -0.16
    <quote
    -0.16
    StandardItem
    -0.15
    ÑĤÑĶ
    -0.15
     ZemÄĽ
    -0.15
    libc
    -0.14
    obao
    -0.14
    ascus
    -0.14
    POSITIVE LOGITS
     hang
    0.17
     Indies
    0.15
    itto
    0.15
     Hermes
    0.15
     ay
    0.15
     terminal
    0.14
     Kem
    0.14
     comparative
    0.14
    oria
    0.14
     cogn
    0.14
    Act Density 0.044%

    No Known Activations