INDEX
    Explanations

    proper nouns, specifically names of people in various contexts

    Word fragments, potentially the ends of words

    New Auto-Interp
    Negative Logits
    StoryboardSegue
    -0.96
    OGND
    -0.87
     виправивши
    -0.83
     الرياضيه
    -0.83
    uarts
    -0.83
     snippetHide
    -0.80
    InstrumentedTest
    -0.79
    原始内容存档于
    -0.78
    SOUNDBITE
    -0.77
     estekak
    -0.77
    POSITIVE LOGITS
    0.52
    ber
    0.51
    ha
    0.49
    ya
    0.49
    har
    0.45
    <em>
    0.45
    ra
    0.45
    As
    0.44
    ch
    0.44
    0.43
    Act Density 0.141%

    No Known Activations