INDEX
Explanations
comparisons indicating improvement or optimal choices
phrases indicating the best or optimal way to do something
New Auto-Interp
Negative Logits
naires
-0.69
mostly
-0.67
stairs
-0.65
Occasionally
-0.63
itially
-0.62
wick
-0.60
insofar
-0.60
adra
-0.59
ryu
-0.59
Mostly
-0.57
POSITIVE LOGITS
than
1.04
Than
0.92
encaps
0.78
than
0.76
testament
0.75
illustration
0.69
exempl
0.68
nor
0.67
juxtap
0.65
deserving
0.65
Activations Density 0.097%