INDEX
Explanations
greeting phrases and expressions of welcome
New Auto-Interp
Negative Logits
dessutom
-0.55
прочем
-0.51
zudem
-0.49
inoltre
-0.49
außerdem
-0.48
Außerdem
-0.47
Zudem
-0.46
Außerdem
-0.46
Inoltre
-0.45
però
-0.44
POSITIVE LOGITS
welcome
0.72
👋
0.71
sorry
0.66
glad
0.63
delighted
0.63
sorry
0.61
congratulations
0.61
Apologies
0.60
apologies
0.59
congratulations
0.59
Activations Density 0.287%