INDEX
    Explanations

    phrases related to communication and access to resources

    New Auto-Interp
    Negative Logits
     للمعارف
    -0.54
    ypress
    -0.49
    InternalFrame
    -0.47
     nombreux
    -0.46
    Shear
    -0.44
    estad
    -0.44
     Dominus
    -0.42
     nombreuses
    -0.42
     >=",
    -0.42
    -0.41
    POSITIVE LOGITS
     only
    1.42
    only
    1.31
     jedynie
    1.23
     seulement
    1.21
    Only
    1.17
     Only
    1.15
     lediglich
    1.12
    而已
    1.11
     лишь
    1.07
    ONLY
    1.04
    Act Density 0.603%

    No Known Activations