INDEX
Explanations
instances of monetary transactions and payment-related phrases
New Auto-Interp
Negative Logits
spending
-0.16
rell
-0.16
oje
-0.15
xcf
-0.15
egrate
-0.15
andom
-0.15
³
-0.15
oks
-0.14
stag
-0.14
Spending
-0.14
POSITIVE LOGITS
dividends
0.22
attention
0.20
Attention
0.19
lip
0.18
attention
0.18
bills
0.18
homage
0.18
compliment
0.17
compliments
0.16
respects
0.16
Activations Density 0.107%