INDEX
    Explanations

    terms related to surgical procedures and instruments

    New Auto-Interp
    Negative Logits
    ynos
    -0.17
    ctr
    -0.15
    elow
    -0.15
     ucwords
    -0.15
    ButtonModule
    -0.14
    asz
    -0.14
    itter
    -0.14
    ofire
    -0.14
     æł
    -0.14
    eward
    -0.14
    POSITIVE LOGITS
     con
    0.15
    dust
    0.15
    side
    0.15
    DTD
    0.15
    noop
    0.15
    bek
    0.14
    cess
    0.14
    athom
    0.14
     dame
    0.14
    ener
    0.13
    Act Density 0.003%

    No Known Activations