INDEX
    Explanations

    mathematical symbols and expressions, particularly fractions and coordinates

    New Auto-Interp
    Negative Logits
     ro
    -0.16
    icari
    -0.15
    >NN
    -0.14
     gy
    -0.14
    ighb
    -0.14
    -LAST
    -0.14
    :disable
    -0.14
     dy
    -0.14
     lean
    -0.14
    uida
    -0.14
    POSITIVE LOGITS
    ouch
    0.16
    ów
    0.15
    EventManager
    0.14
    .jms
    0.14
    âĦĸâĦĸ
    0.14
    öl
    0.13
     baiser
    0.13
    óż
    0.13
     sóc
    0.13
    arel
    0.13
    Act Density 0.120%

    No Known Activations