INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     incidental
    -0.06
     Mezi
    -0.06
     analý
    -0.06
     RADIO
    -0.06
     nad
    -0.06
     poměr
    -0.06
     hâl
    -0.06
     ifad
    -0.06
    βά
    -0.06
     nabíd
    -0.06
    POSITIVE LOGITS
     work
    0.17
     Work
    0.17
    work
    0.16
    Work
    0.16
     works
    0.15
    -work
    0.14
    works
    0.14
     worked
    0.13
    WORK
    0.12
    (work
    0.12
    Act Density 0.136%

    No Known Activations