INDEX
Explanations
mentions of monetary values and related numerical data
New Auto-Interp
Negative Logits
ATEGORY
-0.14
£
-0.14
ilden
-0.14
رÙģØª
-0.14
elan
-0.13
uien
-0.13
áp
-0.13
umberland
-0.13
Daniels
-0.13
èĦĤ
-0.13
POSITIVE LOGITS
200
0.54
bush
0.28
Û²Û°Û°
0.27
Bush
0.26
®
0.22
Bush
0.21
300
0.18
data
0.18
ï¼Ĵï¼IJ
0.18
400
0.17
Activations Density 0.039%