INDEX
    Explanations

    numeric values followed by their units of measurement

    quantitative metrics and measurements

    New Auto-Interp
    Negative Logits
    Synopsis
    -0.71
    ormal
    -0.63
    ady
    -0.62
    gat
    -0.60
    imen
    -0.60
     Pulitzer
    -0.59
    bred
    -0.58
    values
    -0.58
    harm
    -0.57
    Wiki
    -0.56
    POSITIVE LOGITS
     increments
    1.04
     apiece
    0.97
    /$
    0.81
    alion
    0.80
    +.
    0.78
     thereafter
    0.73
    TPS
    0.73
     hers
    0.71
    osuke
    0.70
     rul
    0.70
    Act Density 0.259%

    No Known Activations