INDEX
    Explanations

    numerical figures, symbols, and formatting elements in text

    text segments with special characters or formatting symbols

    New Auto-Interp
    Negative Logits
    seless
    -0.77
    acles
    -0.75
     fireplace
    -0.74
    ahime
    -0.73
    inator
    -0.71
    unia
    -0.70
     Beir
    -0.69
    nesses
    -0.68
    anse
    -0.68
    liness
    -0.64
    POSITIVE LOGITS
    (*
    0.81
     (*
    0.79
    ERROR
    0.76
    STD
    0.74
    STDOUT
    0.73
    testing
    0.73
    TEXT
    0.71
    catentry
    0.70
    Thompson
    0.69
    FREE
    0.69
    Act Density 0.025%

    No Known Activations