INDEX
Explanations
phrases related to community events and organizational activities
New Auto-Interp
Negative Logits
CRET
-0.14
hetto
-0.14
sed
-0.13
usher
-0.13
convincing
-0.13
claimed
-0.13
demanded
-0.13
convinc
-0.12
_totals
-0.12
aight
-0.12
POSITIVE LOGITS
pleased
0.35
proud
0.28
excited
0.28
delighted
0.28
happy
0.27
thrilled
0.26
please
0.24
Accept
0.23
accepting
0.22
currently
0.21
Activations Density 0.142%