INDEX
Explanations
expressions of regret
expressions of regret and remorse
New Auto-Interp
Negative Logits
ĪĴ
-0.72
uana
-0.69
rigs
-0.69
icles
-0.69
agnetic
-0.65
icle
-0.63
adj
-0.63
女
-0.63
emonic
-0.62
place
-0.62
POSITIVE LOGITS
fully
1.15
ful
1.00
fulness
0.96
FUL
0.84
regrets
0.83
imaru
0.82
faced
0.80
vier
0.79
regret
0.78
ting
0.78
Activations Density 0.019%