INDEX
    Explanations

    phrases indicating references or citations

    Immediately before "Here's" or "Here is"

    New Auto-Interp
    Negative Logits
     Meksiku
    -0.95
     '\\;'
    -0.85
    الدراسه
    -0.85
    expandindo
    -0.84
     صوتيه
    -0.84
     nahilalakip
    -0.80
    principalColumn
    -0.78
     autorytatywna
    -0.78
    تقاوى
    -0.77
    makeConstraints
    -0.77
    POSITIVE LOGITS
     some
    0.98
     how
    0.96
     what
    0.92
     a
    0.87
     another
    0.86
     my
    0.80
     the
    0.79
     our
    0.72
     an
    0.71
     why
    0.70
    Act Density 0.112%

    No Known Activations