INDEX
Explanations
statements and claims about political or corporate actions and decisions
New Auto-Interp
Negative Logits
565
-0.16
ÅĤem
-0.15
ãĥ«ãĥĪ
-0.14
ulu
-0.14
WO
-0.14
ColorBrush
-0.14
ìŀ¥ìĿĦ
-0.14
ÑĪло
-0.14
HEST
-0.14
æľŃ
-0.14
POSITIVE LOGITS
himself
0.33
company
0.31
his
0.29
Company
0.25
team
0.24
company
0.24
ãĥģãĥ¼ãĥł
0.24
åħ¬åı¸
0.23
his
0.23
compan
0.20
Activations Density 0.431%