INDEX
    Explanations

    improving phrasing and word choice

    New Auto-Interp
    Negative Logits
    oare
    0.46
    0.42
     الري
    0.42
    0.42
     coals
    0.41
     השי
    0.41
    0.40
     autochtones
    0.40
    フォーマンス
    0.40
     المعادلات
    0.40
    POSITIVE LOGITS
     takich
    0.42
    át
    0.41
     dimin
    0.41
    ikiran
    0.41
     Такие
    0.39
     war
    0.39
    Medicare
    0.38
     grappling
    0.38
    0.38
     portent
    0.38
    Act Density 0.000%

    No Known Activations