INDEX
Explanations
positive descriptions of experiences or relationships
New Auto-Interp
Negative Logits
zbek
-0.15
Sadly
-0.15
Sadly
-0.15
ÙĬÙĥÙĬ
-0.15
unfortunately
-0.14
092
-0.14
zego
-0.14
Beast
-0.14
sadly
-0.13
Unfortunately
-0.13
POSITIVE LOGITS
remedy
0.24
worse
0.24
Worse
0.22
therefore
0.21
ê·¸ëŀĺ
0.18
solution
0.18
workaround
0.18
remedies
0.17
solutions
0.17
remed
0.17
Activations Density 0.592%