INDEX
    Explanations

    phrases that express advancement or progression

    New Auto-Interp
    Negative Logits
    du
    -0.15
    лиÑĤелÑĮ
    -0.14
     aktual
    -0.14
    TK
    -0.14
    lide
    -0.13
    Gesture
    -0.13
    kat
    -0.13
     Invent
    -0.13
    oba
    -0.13
    kor
    -0.13
    POSITIVE LOGITS
    aire
    0.18
    EMU
    0.17
    æĩĤ
    0.15
    ellig
    0.15
    yah
    0.15
    reesome
    0.15
    UTO
    0.14
    _rewrite
    0.14
     HEIGHT
    0.14
    imens
    0.14
    Act Density 0.064%

    No Known Activations