INDEX
Explanations
comparisons using the phrase "not as" to highlight differences in quality or extent
comparative expressions that convey limitations or shortcomings
New Auto-Interp
Negative Logits
COUR
-0.72
ACTIONS
-0.71
TAMADRA
-0.65
Macintosh
-0.63
YES
-0.61
NUM
-0.61
itudes
-0.60
YES
-0.60
*:
-0.59
onomy
-0.59
POSITIVE LOGITS
anymore
0.89
yet
0.86
imilar
0.83
irlf
0.83
far
0.81
much
0.80
vernment
0.79
pired
0.78
well
0.75
evidenced
0.74
Activations Density 0.084%