INDEX
Explanations
phrases related to personal or political interests
references to self-serving behaviors and interests in decision-making
New Auto-Interp
Negative Logits
drop
-0.87
Trace
-0.76
DragonMagazine
-0.74
etheless
-0.74
xit
-0.71
wayne
-0.69
anmar
-0.69
dra
-0.69
ername
-0.68
ori
-0.67
POSITIVE LOGITS
interests
1.55
interest
1.34
profit
1.30
convenience
1.24
agendas
1.22
profits
1.16
selfish
1.16
gratification
1.12
interest
1.12
greed
1.12
Activations Density 0.519%