INDEX
Explanations
phrases indicating rates of change or growth
New Auto-Interp
Negative Logits
ãĥ¼ãĥ
-0.08
udging
-0.07
Mitar
-0.07
millennium
-0.07
ransition
-0.06
Targets
-0.06
aida
-0.06
å¨ľ
-0.06
InParameter
-0.06
anson
-0.06
POSITIVE LOGITS
rate
0.10
rates
0.09
olon
0.08
Rate
0.07
faster
0.07
rates
0.07
rate
0.07
Rates
0.07
Rates
0.07
ret
0.07
Activations Density 0.008%