INDEX
    Explanations

    mathematical expressions and notation

    New Auto-Interp
    Negative Logits
    lobal
    -0.18
    eltas
    -0.16
    TOCOL
    -0.15
    eras
    -0.15
    oft
    -0.14
    rupa
    -0.14
    abaj
    -0.14
    oÄį
    -0.14
    acman
    -0.14
    rvé
    -0.14
    POSITIVE LOGITS
    irim
    0.18
    алом
    0.15
     sob
    0.15
    807
    0.15
     Owens
    0.15
    Scope
    0.14
    143
    0.14
    ;;;;;;;;
    0.14
    _SHA
    0.14
    inel
    0.14
    Act Density 0.090%

    No Known Activations