INDEX
    Explanations

    references to usage and user involvement in various contexts

    New Auto-Interp
    Negative Logits
     for
    -0.38
     длÑı
    -0.28
     untuk
    -0.27
    	for
    -0.26
     für
    -0.26
    for
    -0.26
     για
    -0.24
     voor
    -0.23
    为
    -0.23
     pentru
    -0.23
    POSITIVE LOGITS
     sake
    0.60
     purposes
    0.58
    æĿ¥è¯´
    0.25
     purpose
    0.24
     reasons
    0.23
    èĢĮ
    0.22
    pur
    0.21
    ummies
    0.20
     PURPOSE
    0.20
    purpose
    0.19
    Act Density 1.047%

    No Known Activations