INDEX
    Explanations

    numerical data and publication references

    New Auto-Interp
    Negative Logits
    iri
    -0.15
    ault
    -0.15
    ivic
    -0.14
     éĻIJ
    -0.14
    éħį
    -0.14
    лÑĸв
    -0.14
    abaj
    -0.14
    emachine
    -0.14
    ache
    -0.13
    нÑİ
    -0.13
    POSITIVE LOGITS
     Dy
    0.16
    iÄį
    0.15
    каÑģ
    0.15
    ke
    0.15
    regar
    0.15
     Shaft
    0.14
    kee
    0.14
     Heck
    0.14
    eli
    0.14
    yla
    0.14
    Act Density 0.122%

    No Known Activations