INDEX
Explanations
phrases related to product evaluations and performance assessments
New Auto-Interp
Negative Logits
truly
-0.18
genius
-0.16
greatness
-0.16
perfection
-0.16
literally
-0.16
absolutely
-0.15
Truly
-0.15
ãģ¾ãĤĭ
-0.15
orgeous
-0.14
brilliance
-0.14
POSITIVE LOGITS
decent
0.56
reasonably
0.42
reasonable
0.38
fairly
0.33
reasonable
0.32
respectable
0.32
okay
0.28
fair
0.28
OK
0.28
moderately
0.28
Activations Density 0.074%