INDEX
    Explanations

    terms related to identification and classification

    New Auto-Interp
    Negative Logits
    AndView
    -0.15
    .backend
    -0.15
    ä¼į
    -0.15
     Masc
    -0.14
    anuts
    -0.14
    å¯Ħ
    -0.14
    amma
    -0.14
    okes
    -0.14
    CKER
    -0.13
    emet
    -0.13
    POSITIVE LOGITS
     indicators
    0.23
     signs
    0.21
     indicator
    0.21
     typically
    0.19
    Indicator
    0.19
    ingo
    0.18
    Characteristic
    0.18
     Indicator
    0.17
    indicator
    0.17
    marker
    0.17
    Act Density 0.152%

    No Known Activations