INDEX
    Explanations

    instances of punctuation or symbols, particularly those indicating a pause or list in writing

    comparisons and contrasts

    New Auto-Interp
    Negative Logits
     respectively
    -0.39
     Sef
    -0.39
    -0.38
     followed
    -0.37
     SAFE
    -0.36
    ների
    -0.36
    Safe
    -0.35
    BASELINE
    -0.35
     pena
    -0.35
     Safe
    -0.34
    POSITIVE LOGITS
     otomatig
    0.71
    aarrggbb
    0.66
     ujednoznacz
    0.62
    EndContext
    0.57
    bootstrapcdn
    0.54
    GEBURTSDATUM
    0.54
     utafitiHapana
    0.54
     mijne
    0.51
    Ours
    0.49
    AsUp
    0.48
    Act Density 0.024%

    No Known Activations