INDEX
    Explanations

    copyright and publication information

    New Auto-Interp
    Negative Logits
    oma
    -0.19
    Tween
    -0.17
    ella
    -0.16
     Trou
    -0.14
     reactive
    -0.14
    -Clause
    -0.14
     randomly
    -0.14
    aston
    -0.14
    azz
    -0.13
     Reactive
    -0.13
    POSITIVE LOGITS
    anzi
    0.16
     navr
    0.15
    eza
    0.15
    ivec
    0.15
    ÅĻes
    0.14
     aliqua
    0.14
    ãĤ¥
    0.14
     agre
    0.14
     vorhand
    0.14
    ầm
    0.13
    Act Density 0.028%

    No Known Activations