INDEX
    Explanations

    punctuation marks, particularly commas

    New Auto-Interp
    Negative Logits
    antz
    -0.76
    imize
    -0.72
     disadvantage
    -0.70
    inar
    -0.69
    ussy
    -0.68
    cius
    -0.67
    natureconservancy
    -0.67
     disadvantages
    -0.67
    ge
    -0.65
    amins
    -0.64
    POSITIVE LOGITS
     albeit
    1.09
     presumably
    0.88
     although
    0.88
     unsurprisingly
    0.86
     ostensibly
    0.85
     coinc
    0.82
     preceded
    0.81
     flanked
    0.80
     incidentally
    0.80
     huh
    0.79
    Act Density 0.191%

    No Known Activations