INDEX
Explanations
adaptive computation and learning
New Auto-Interp
Negative Logits
t
1.71
at
1.14
us
1.12
as
1.10
i
1.07
u
1.06
ut
1.05
ون
1.05
ا
0.98
tans
0.94
POSITIVE LOGITS
on
1.05
0.86
sonra
0.85
isn
0.82
अदालत
0.77
കഥ
0.76
не
0.73
ли
0.70
Adaptive
0.70
。
0.70
Activations Density 0.006%