INDEX
    Explanations

    references to quantitative measures and thresholds

    New Auto-Interp
    Negative Logits
    vÄĽt
    -0.14
    OTTOM
    -0.14
    اÛĮاÙĨ
    -0.14
    izzy
    -0.13
    iasm
    -0.13
     <<=
    -0.13
    okie
    -0.12
    à¹ĩà¸Ļว
    -0.12
    iltr
    -0.12
    okoj
    -0.12
    POSITIVE LOGITS
     open
    1.36
    open
    1.21
    -open
    1.20
     opened
    1.16
     OPEN
    1.14
     Open
    1.13
    Open
    1.10
    .open
    1.10
    _open
    1.09
     opens
    1.09
    Act Density 0.875%

    No Known Activations