INDEX
    Explanations

    mathematical expressions and notations

    New Auto-Interp
    Negative Logits
    ,
    -0.60
    -0.52
    []
    
    -0.52
     Bar
    -0.51
     void
    -0.50
     bar
    -0.50
    thâu
    -0.48
     w
    -0.48
     fan
    -0.47
     W
    -0.47
    POSITIVE LOGITS
     avoient
    0.97
     feroit
    0.96
     étoit
    0.93
     auroit
    0.90
     étoient
    0.88
    }}}}
    0.88
     pouvoit
    0.85
     ainfi
    0.81
    }}}
    0.81
    )))))
    0.81
    Act Density 1.056%

    No Known Activations