INDEX
Explanations
phrases related to improvements or better versions
references to the word "Better" and its variations, indicating a focus on improvement or enhancements
New Auto-Interp
Negative Logits
trl
-0.73
ette
-0.71
ettes
-0.70
FK
-0.69
Pione
-0.66
SEE
-0.65
heter
-0.65
NetMessage
-0.64
ARS
-0.64
Dresden
-0.63
POSITIVE LOGITS
suited
0.96
than
0.91
behaved
0.87
Than
0.82
than
0.77
lihood
0.75
idge
0.74
acquainted
0.73
chance
0.73
Faster
0.72
Activations Density 0.035%