INDEX
    Explanations

    punctuation marks

    punctuation or commas in the text

    New Auto-Interp
    Negative Logits
    ,
    -0.91
    ,...
    -0.87
     (>
    -0.73
    -
    -0.72
    SourceFile
    -0.72
    -,
    -0.72
    STON
    -0.71
    ,-
    -0.67
    vale
    -0.65
    Previous
    -0.65
    POSITIVE LOGITS
     somew
    0.74
    udes
    0.61
    prototype
    0.61
     disclaim
    0.60
     albeit
    0.58
     depending
    0.57
     inclined
    0.56
     beh
    0.56
     namely
    0.54
     ought
    0.52
    Act Density 0.239%

    No Known Activations