INDEX
    Explanations

    numerical values appearing in a structured format

    measurements and quantities in various contexts

    New Auto-Interp
    Negative Logits
    phrine
    -0.94
     Halls
    -0.82
    theless
    -0.73
    deen
    -0.66
     GOODMAN
    -0.65
    hower
    -0.64
    thirds
    -0.63
     Galile
    -0.62
     kitchens
    -0.61
     Rumble
    -0.60
    POSITIVE LOGITS
    icago
    0.97
    .,
    0.95
    ./
    0.82
     ........
    0.81
    ickr
    0.80
    Avg
    0.78
    hered
    0.77
    emp
    0.77
    nyder
    0.76
    iances
    0.75
    Act Density 0.023%

    No Known Activations