INDEX
    Explanations

    expressions of agreement or strong opinions

    following pronouns or question words

    program name and description

    New Auto-Interp
    Negative Logits
     AssemblyCulture
    -0.81
    AndEndTag
    -0.78
    SourceChecksum
    -0.78
    </caption>
    -0.76
     الرياضيه
    -0.74
    !")
    
    -0.72
    momix
    -0.70
    SOUNDBITE
    -0.68
    ^(@)
    -0.68
     ddelweddau
    -0.67
    POSITIVE LOGITS
     What
    0.77
    Honestly
    0.76
     Honestly
    0.75
     Why
    0.73
    What
    0.72
     Wasn
    0.70
     I
    0.70
    I
    0.68
    Looks
    0.68
     Maybe
    0.67
    Act Density 0.155%

    No Known Activations