INDEX
    Explanations

    articles and descriptive phrases that signify qualities or characteristics of nouns

    New Auto-Interp
    Negative Logits
     dit
    -0.07
     rates
    -0.06
    acha
    -0.06
     dara
    -0.06
    ema
    -0.06
     sealing
    -0.06
     pregnant
    -0.05
     ÐľÐ¸Ðº
    -0.05
     coc
    -0.05
     Rig
    -0.05
    POSITIVE LOGITS
    ECTOR
    0.08
    ¼åIJĪ
    0.08
    uteur
    0.07
    uthor
    0.07
    atural
    0.07
    ôme
    0.07
    лага
    0.07
    leta
    0.07
    鼨
    0.07
    tring
    0.07
    Act Density 0.039%

    No Known Activations