INDEX
Explanations
monetary values and their associated numerical contexts
New Auto-Interp
Negative Logits
thumbs
-0.18
zero
-0.16
909
-0.15
951
-0.15
usz
-0.15
937
-0.15
itself
-0.14
793
-0.14
billions
-0.14
949
-0.14
POSITIVE LOGITS
à¹Ģà¸ķà¸Ńร
0.19
æĦıä¹ī
0.16
odo
0.15
adera
0.15
Ãłng
0.15
idi
0.14
full
0.14
ãģķãĤī
0.14
odor
0.14
ataka
0.14
Activations Density 0.067%