INDEX
    Explanations

    various forms of numerical data and formatting

    New Auto-Interp
    Negative Logits
     Flor
    -0.16
     flor
    -0.16
     Ravens
    -0.16
    ismet
    -0.15
     tub
    -0.15
    lyn
    -0.15
     Cush
    -0.15
     Holly
    -0.14
     Russ
    -0.14
    Lens
    -0.14
    POSITIVE LOGITS
    166
    0.44
    66
    0.42
    466
    0.42
    Ķ
    0.38
     Yang
    0.35
    066
    0.34
     diamond
    0.33
     Diamond
    0.32
    666
    0.31
    566
    0.30
    Act Density 0.031%

    No Known Activations