INDEX
Explanations
changes or modifications made to something
terms related to modifications and updates
New Auto-Interp
Negative Logits
Fit
-0.60
Fein
-0.57
Knot
-0.56
Cure
-0.56
rics
-0.55
dan
-0.53
eers
-0.53
Nile
-0.53
lore
-0.51
AIDS
-0.51
POSITIVE LOGITS
drastically
1.04
accordingly
1.02
dramatically
1.00
substantially
0.97
considerably
0.95
significantly
0.95
radically
0.89
slightly
0.85
greatly
0.84
markedly
0.83
Activations Density 0.171%