INDEX
Explanations
descriptive phrases related to a negative assessment or criticism
New Auto-Interp
Negative Logits
catentry
-0.89
Recommend
-0.79
ItemImage
-0.77
ourses
-0.76
ructure
-0.75
Import
-0.75
ilitation
-0.74
isSpecialOrderable
-0.73
ODUCT
-0.72
Initially
-0.69
POSITIVE LOGITS
gigg
1.16
dudes
1.07
jokes
1.06
boobs
1.05
nerds
1.05
dick
1.05
shit
1.03
fries
1.02
booze
1.02
crap
1.01
Activations Density 6.886%