INDEX
    Explanations

    elements related to authors and their works

    New Auto-Interp
    Negative Logits
    ovky
    -0.19
    angi
    -0.18
    )((((
    -0.15
    hower
    -0.15
    adera
    -0.15
    itol
    -0.15
    ắng
    -0.15
    ogne
    -0.15
    éľŀ
    -0.14
    bucks
    -0.14
    POSITIVE LOGITS
    otto
    0.16
    importe
    0.15
    042
    0.14
    ANY
    0.14
     distur
    0.14
    zin
    0.14
    ÑĤаб
    0.14
     outnumber
    0.14
    erland
    0.14
    586
    0.13
    Act Density 0.440%

    No Known Activations