INDEX
Explanations
sequences indicating change or movement over time
New Auto-Interp
Negative Logits
awi
-0.16
erken
-0.16
366
-0.16
ÏģÏħ
-0.15
amas
-0.14
-alist
-0.14
umbed
-0.14
äll
-0.14
Replacement
-0.14
orce
-0.14
POSITIVE LOGITS
again
0.26
again
0.25
Again
0.22
Again
0.22
lại
0.21
novamente
0.20
ëĭ¤ìĭľ
0.19
Ñģнова
0.19
wieder
0.19
weer
0.18
Activations Density 0.139%