INDEX
    Explanations

    ongoing or future actions

    tokens that indicate continuation or ongoing/continuous action (words signaling something continues).

    New Auto-Interp
    Negative Logits
    '
    1.79
    1.55
     
    1.38
    ،
    1.22
    \
    1.17
    1.17
    ).
    1.08
    ı
    1.08
    \"
    1.05
    ,
    1.04
    POSITIVE LOGITS
    the
    1.98
    تي
    1.60
    r
    1.48
    на
    1.48
    n
    1.41
    توان
    1.35
    ر
    1.34
    is
    1.33
    u
    1.25
    to
    1.24
    Act Density 0.087%

    No Known Activations