INDEX
Explanations
positive descriptions and evaluations
expressions of subjective opinion or evaluation
New Auto-Interp
Negative Logits
ļéĨĴ
-0.77
failed
-0.66
ãĤ´ãĥ³
-0.66
Alone
-0.65
ourning
-0.64
çͰ
-0.63
ifled
-0.63
den
-0.63
but
-0.62
ãĥĨ
-0.62
POSITIVE LOGITS
nonetheless
1.55
nevertheless
1.24
worth
1.22
worthwhile
1.19
undeniable
1.16
undeniably
1.13
manageable
1.09
achievable
1.00
damn
1.00
definitely
0.97
Activations Density 0.398%