INDEX
    Explanations

    numerical data or quantitative metrics

    New Auto-Interp
    Negative Logits
     Forty
    -0.26
     Fifty
    -0.26
     forty
    -0.23
     sixty
    -0.22
     seventy
    -0.20
    64
    -0.20
     fifty
    -0.20
    .ZERO
    -0.19
    <quote
    -0.18
     вÑĸз
    -0.18
    POSITIVE LOGITS
    02
    0.45
    03
    0.45
    04
    0.45
    06
    0.45
    09
    0.44
    07
    0.44
    05
    0.44
    08
    0.43
    01
    0.42
    2
    0.41
    Act Density 0.116%

    No Known Activations