INDEX
Explanations
acronyms or specific keywords
the name of a specific brand or company
New Auto-Interp
Negative Logits
yip
-0.77
udeb
-0.71
maid
-0.67
beware
-0.67
punitive
-0.66
quartered
-0.66
ename
-0.66
aukee
-0.65
isconsin
-0.65
license
-0.64
POSITIVE LOGITS
<
0.97
crow
0.85
Rubin
0.79
Ches
0.69
Kraft
0.63
<+
0.63
Hein
0.62
Ceres
0.62
ATL
0.62
Mets
0.61
Activations Density 0.000%