INDEX
    Explanations

    terms related to classification and categorization in various contexts

    New Auto-Interp
    Negative Logits
    bsite
    -0.15
    ENCIL
    -0.15
    zeÅĪ
    -0.15
    amura
    -0.14
    erah
    -0.14
    ppv
    -0.14
    marshall
    -0.14
    estate
    -0.14
    ksam
    -0.14
    vertising
    -0.14
    POSITIVE LOGITS
     
    0.16
    ÂŃ
    0.15
    ize
    0.15
    ous
    0.15
    âĢį
    0.14
    ians
    0.14
     graduate
    0.14
    ization
    0.14
    ism
    0.14
    алÑİ
    0.14
    Act Density 0.472%

    No Known Activations