INDEX
Explanations
introductions and greetings in written communications
mentions of a community or group engagement
New Auto-Interp
Negative Logits
ilitarian
-0.71
rities
-0.71
arcity
-0.68
ividual
-0.61
urat
-0.60
mob
-0.58
oreal
-0.58
govtrack
-0.57
VERTISEMENT
-0.56
emo
-0.55
POSITIVE LOGITS
Welcome
1.06
Welcome
0.97
welcome
0.95
Today
0.93
thank
0.93
Yesterday
0.92
Thank
0.91
congratulations
0.91
Firstly
0.90
thanks
0.88
Activations Density 0.136%