INDEX
Explanations
greetings or well-wishes in text
expressions of happiness or well-wishes related to celebrations or special occasions
New Auto-Interp
Negative Logits
lege
-0.75
vae
-0.75
æ©Ł
-0.74
ciplinary
-0.72
vere
-0.70
arin
-0.69
$$$$
-0.67
afort
-0.66
urally
-0.65
xy
-0.64
POSITIVE LOGITS
endings
1.18
Birthday
0.96
birthday
0.94
Hour
0.89
holidays
0.87
ending
0.87
hour
0.83
Mondays
0.81
Ending
0.81
Gilmore
0.79
Activations Density 0.052%