INDEX
Explanations
expressions of gratitude and appreciation
New Auto-Interp
Negative Logits
destul
-0.37
quieras
-0.37
schre
-0.35
ніципа
-0.35
vys
-0.35
Schle
-0.35
bună
-0.35
höhe
-0.34
leads
-0.34
annoncé
-0.34
POSITIVE LOGITS
honored
1.05
grateful
1.01
proud
0.97
humbled
0.91
privileged
0.90
thankful
0.88
honoured
0.88
pleased
0.78
thrilled
0.76
glad
0.76
Activations Density 0.218%