INDEX
Explanations
criticisms and negative comments
negative evaluations and criticisms
New Auto-Interp
Negative Logits
`.
-0.83
.''.
-0.78
soType
-0.75
''.
-0.74
".
-0.72
'.
-0.72
)).
-0.71
"!
-0.70
.?
-0.69
"))
-0.68
POSITIVE LOGITS
rundown
0.63
goofy
0.60
slick
0.60
upfront
0.59
flashy
0.59
quirks
0.58
grunt
0.58
lackluster
0.56
bloated
0.56
handful
0.56
Activations Density 1.802%