INDEX
Explanations
phrases or sentences starting with 'we'
repetitive pronouns and collective language indicating inclusion or teamwork
New Auto-Interp
Negative Logits
forms
-0.67
Organizations
-0.59
Rarity
-0.59
Authority
-0.57
Oprah
-0.56
Commissioner
-0.56
whel
-0.55
âĢ¢âĢ¢
-0.55
gratification
-0.55
Coalition
-0.54
POSITIVE LOGITS
ighed
1.09
arers
1.05
assume
1.00
eding
0.96
're
0.96
ourselves
0.96
aning
0.95
athered
0.94
've
0.92
igh
0.92
Activations Density 0.187%