INDEX
    Explanations

    references to specific applications or proposals in a research context

    New Auto-Interp
    Negative Logits
     übrigens
    -0.66
     persino
    -0.52
    Хьажоргаш
    -0.52
    ailleurs
    -0.51
     zelfs
    -0.50
     MMO
    -0.49
     frattempo
    -0.48
    şört
    -0.47
     sowieso
    -0.47
    Кстати
    -0.47
    POSITIVE LOGITS
     purpose
    0.52
    整體
    0.39
    目的是
    0.39
     intent
    0.38
     Vereinigte
    0.37
    Overall
    0.37
     целью
    0.37
    UnusedPrivate
    0.36
     Overall
    0.36
    0.36
    Act Density 0.050%

    No Known Activations