INDEX
    Explanations

    references to historical or demographic contexts

    Follows a single digit number

    modal verbs and pronouns

    New Auto-Interp
    Negative Logits
    ´
    -0.84
    -0.79
    -0.77
    `
    -0.73
    ’,
    -0.72
    ’.
    -0.72
    `,
    -0.70
     Dont
    -0.70
    ”,
    -0.70
     compan
    -0.69
    POSITIVE LOGITS
    ConstraintMaker
    0.74
    tagHelperRunner
    0.65
     للاسماء
    0.64
    PerformLayout
    0.61
     فريبيس
    0.60
    .)}
    0.59
    WriteTagHelper
    0.57
    :✨
    0.57
    abetes
    0.56
     \&
    0.55
    Act Density 0.140%

    No Known Activations