INDEX
    Explanations

    adjectives and adverbial forms in various contexts

    New Auto-Interp
    Negative Logits
    itu
    -0.16
    Ïħ
    -0.16
    ifo
    -0.16
    oller
    -0.15
    ined
    -0.15
    ÑĮ
    -0.15
    à¯į
    -0.14
    ÑĮÑİ
    -0.14
    ìľ¼ë¡ľ
    -0.14
    itore
    -0.14
    POSITIVE LOGITS
    tics
    0.30
    lation
    0.30
    e
    0.29
    sis
    0.28
    ellow
    0.28
    tic
    0.27
    yyyy
    0.27
    ea
    0.26
    yyy
    0.26
    eah
    0.26
    Act Density 0.067%

    No Known Activations