INDEX
Explanations
Here's a table summarizing the key differences:
New Auto-Interp
Negative Logits
완전
0.84
મિંગ
0.83
cepat
0.78
TRPV
0.76
ऱ्या
0.75
CDB
0.75
beware
0.74
PanelView
0.74
fär
0.73
xb
0.73
POSITIVE LOGITS
0.84
0.83
starting
0.81
0.81
0.81
starting
0.81
0.80
0.80
0.79
0.79
Activations Density 0.036%