INDEX
Explanations
ongoing or future actions
tokens that indicate continuation or ongoing/continuous action (words signaling something continues).
New Auto-Interp
Negative Logits
'
1.79
’
1.55
1.38
،
1.22
\
1.17
、
1.17
).
1.08
ı
1.08
\"
1.05
,
1.04
POSITIVE LOGITS
the
1.98
تي
1.60
r
1.48
на
1.48
n
1.41
توان
1.35
ر
1.34
is
1.33
u
1.25
to
1.24
Activations Density 0.087%