INDEX
Explanations
phrases indicating the age of individuals, particularly in years
New Auto-Interp
Negative Logits
urry
-0.15
GGLE
-0.15
onica
-0.15
.twimg
-0.14
urr
-0.14
езд
-0.14
isse
-0.14
ÙĨدگÛĮ
-0.14
اÙĨات
-0.14
nette
-0.14
POSITIVE LOGITS
enu
0.16
hete
0.16
Char
0.14
pok
0.14
Lunch
0.14
-fashioned
0.14
gratuiti
0.13
picker
0.13
TObject
0.13
lunch
0.13
Activations Density 0.024%