INDEX
    Explanations

    nouns and their descriptors

    New Auto-Interp
    Negative Logits
    ache
    -0.17
    igan
    -0.16
    Ñıв
    -0.15
    емо
    -0.14
    ıģı
    -0.14
    .addTo
    -0.13
    éļ
    -0.13
    ÅĻiv
    -0.13
    exion
    -0.13
    .Raise
    -0.13
    POSITIVE LOGITS
     consisting
    0.23
     consists
    0.22
     consist
    0.21
     gá»ĵm
    0.19
     consisted
    0.18
     include
    0.17
    pear
    0.17
    pheres
    0.16
     includes
    0.16
    cons
    0.16
    Act Density 0.185%

    No Known Activations