INDEX
Explanations
mentions of commercial activities or contexts
New Auto-Interp
Negative Logits
ascript
-0.78
oha
-0.68
utenberg
-0.68
acea
-0.66
Tro
-0.65
Compl
-0.64
Compensation
-0.64
ruct
-0.63
cffff
-0.62
advertisement
-0.62
POSITIVE LOGITS
clos
0.66
Hes
0.64
izing
0.63
ginger
0.62
grocer
0.61
courts
0.60
lihood
0.60
FSA
0.59
derog
0.59
ization
0.58
Activations Density 0.043%