INDEX
Explanations
mathematical fractions and equations
New Auto-Interp
Negative Logits
il
0.84
x
0.72
k
0.70
↵
0.58
ש
0.53
el
0.52
ين
0.51
formar
0.48
j
0.48
desist
0.47
POSITIVE LOGITS
{0.89
a
0.77
be
0.74
0
0.64
হইয়৷
0.61
Хо
0.59
۔
0.58
тебя
0.55
\
0.55
as
0.55
Activations Density 0.120%