INDEX
Explanations
references to actions, conditions, or attributes related to products and their specifications
after "be" or "being"
classifications or states
New Auto-Interp
Negative Logits
intermédiaire
-0.65
illustrazione
-0.63
sufficiente
-0.63
Lordships
-0.62
suprême
-0.62
alternativo
-0.61
rând
-0.59
quelcon
-0.59
normaux
-0.59
titolata
-0.59
POSITIVE LOGITS
vegetarian
0.79
non
0.78
female
0.77
male
0.73
UnusedPrivate
0.71
unisex
0.69
vegan
0.67
bilingual
0.66
FEMALE
0.64
Vegetarian
0.63
Activations Density 0.883%