INDEX
Explanations
comparisons in terms of improvement or decline over time
New Auto-Interp
Negative Logits
Became
-0.58
spin
-0.55
annels
-0.54
unique
-0.53
ointed
-0.53
urai
-0.52
Finally
-0.52
PLEASE
-0.52
finally
-0.52
-0.51
POSITIVE LOGITS
predecessors
1.30
previous
1.24
counterparts
1.05
predecessor
1.01
usual
0.99
previously
0.95
earlier
0.95
usual
0.93
preceding
0.90
elsewhere
0.89
Activations Density 5.346%