INDEX
Explanations
trace amounts and small quantities
New Auto-Interp
Negative Logits
simplistic
0.44
}-[
0.38
லுக்கு
0.38
省
0.38
easy
0.37
shortest
0.37
下了
0.37
简
0.37
simplest
0.37
తగ్గ
0.37
POSITIVE LOGITS
trace
1.80
traces
1.70
trace
1.59
Trace
1.55
Trace
1.52
traces
1.43
少量
1.13
неболь
1.10
TRACE
1.09
pequeñas
1.09
Activations Density 0.061%