INDEX
Explanations
processes unfold and continue
New Auto-Interp
Negative Logits
when
0.86
significant
0.72
عندما
0.72
вообще
0.71
effective
0.70
可以用
0.70
want
0.69
иметь
0.69
start
0.69
قرار
0.68
POSITIVE LOGITS
progresses
1.68
prepares
1.47
continues
1.43
continue
1.40
prepare
1.34
unfolds
1.29
continúa
1.27
nears
1.24
struggled
1.22
继续
1.16
Activations Density 0.104%