INDEX
Explanations
advancement and progression
New Auto-Interp
Negative Logits
actually
0.95
when
0.92
surprise
0.91
actually
0.86
exclamation
0.80
immediately
0.79
appearing
0.77
Actually
0.76
when
0.76
surprise
0.76
POSITIVE LOGITS
progresses
1.81
prepares
1.41
progressed
1.27
unfolds
1.27
struggled
1.24
progress
1.22
prepare
1.21
iler
1.19
matures
1.16
nears
1.14
Activations Density 0.216%