INDEX
    Explanations

    punctuation marks, particularly periods, question marks, and colons

    New Auto-Interp
    Negative Logits
     Dig
    -0.07
     Bend
    -0.06
    ads
    -0.06
     dig
    -0.06
    dig
    -0.06
    MING
    -0.06
    minster
    -0.06
    ming
    -0.06
    Dig
    -0.05
     pop
    -0.05
    POSITIVE LOGITS
    .si
    0.07
    /Area
    0.07
    dük
    0.07
    inas
    0.06
    )prepare
    0.06
    inky
    0.06
    UILTIN
    0.06
    λεκ
    0.06
    .Library
    0.06
    eyh
    0.06
    Act Density 0.024%

    No Known Activations