INDEX
Explanations
amounts of money and financial figures in a context related to spending or funding
New Auto-Interp
Negative Logits
दर
-0.16
eli
-0.15
377
-0.15
amon
-0.15
anywhere
-0.14
977
-0.14
anything
-0.14
leigh
-0.14
ortal
-0.14
"[%
-0.14
POSITIVE LOGITS
worth
0.42
Worth
0.33
worth
0.32
sworth
0.27
of
0.23
in
0.22
toward
0.20
towards
0.19
-w
0.16
à¹ĥà¸Ļà¸ģาร
0.16
Activations Density 0.068%