INDEX
    Explanations

    historical reasons and context

    New Auto-Interp
    Negative Logits
    więks
    1.26
     Rakyat
    1.23
    érations
    1.21
    shake
    1.21
    peripheral
    1.20
    ప్రదేశ
    1.20
    helmet
    1.19
     ż
    1.18
    ຈັດສົ່ງ
    1.18
    ainder
    1.17
    POSITIVE LOGITS
     merupakan
    1.33
    ang
    1.03
     ang
    1.00
    us
    0.98
     besondere
    0.98
     באמצעות
    0.95
    नु
    0.93
     strenuous
    0.93
     mustered
    0.92
     त्या
    0.91
    Act Density 0.004%

    No Known Activations