INDEX
Explanations
pause, letting the words hang in the air
New Auto-Interp
Negative Logits
solit
1.17
around
0.97
B
0.96
T
0.93
Decor
0.92
orn
0.89
Se
0.89
industri
0.87
from
0.86
BatchNorm
0.86
POSITIVE LOGITS
ropshire
1.57
보호
1.43
필수
1.40
<unused415>
1.36
확인
1.33
확대
1.33
Ketua
1.29
추가
1.28
계속
1.28
保護
1.28
Activations Density 0.245%