INDEX
Explanations
references to organizations or entities, particularly corporations
New Auto-Interp
Negative Logits
req
-0.88
syn
-0.77
rose
-0.74
joy
-0.69
Female
-0.67
ĺħ
-0.65
boarding
-0.64
eful
-0.64
dL
-0.63
uberty
-0.62
POSITIVE LOGITS
orpor
0.97
wide
0.92
headquartered
0.89
oreal
0.87
corporation
0.87
shareholder
0.81
owned
0.79
porate
0.79
corporations
0.75
conglomer
0.75
Activations Density 0.022%