INDEX
Explanations
expressions of gratitude and appreciation
New Auto-Interp
Negative Logits
transférez
-0.57
ſch
-0.53
virano
-0.53
ſeveral
-0.53
houſe
-0.52
rilev
-0.51
Houſe
-0.51
createState
-0.50
aarrggbb
-0.49
ſever
-0.49
POSITIVE LOGITS
gratitude
0.82
Gratitude
0.72
grateful
0.53
appreciation
0.52
thanking
0.52
agradecimiento
0.50
thanked
0.50
thank
0.49
thankful
0.48
thanksgiving
0.48
Activations Density 0.010%