INDEX
Explanations
dollar signs indicating currency or financial references
New Auto-Interp
Negative Logits
••••
-0.81
lati
-0.70
eſt
-0.70
Vio
-0.69
wand
-0.67
ly
-0.65
esti
-0.65
iſt
-0.65
Hame
-0.64
DUN
-0.64
POSITIVE LOGITS
}$
1.27
}}$
1.25
}$
1.22
$)$
1.20
})$
1.17
]$
1.16
]}$
1.16
)}$
1.14
\}$
1.14
}]$
1.13
Activations Density 0.180%