INDEX
    Explanations

    numbers and mathematical expressions

    New Auto-Interp
    Negative Logits
     Some
    -0.57
    IVEREF
    -0.56
    Some
    -0.56
     some
    -0.49
     qualche
    -0.48
    some
    -0.46
    ècie
    -0.46
    tdessen
    -0.44
    gdx
    -0.43
    board
    -0.43
    POSITIVE LOGITS
     half
    0.97
    half
    0.90
     HALF
    0.82
     Half
    0.78
    HALF
    0.78
    Half
    0.77
     halves
    0.71
     mitad
    0.70
    一半
    0.70
    ásra
    0.69
    Act Density 0.593%

    No Known Activations