INDEX
Explanations
names of nationalities or cultures
names of various cuisines or types of food
New Auto-Interp
Negative Logits
odder
-0.96
ueller
-0.75
ertodd
-0.75
ividual
-0.74
perty
-0.71
lease
-0.71
abilities
-0.70
olicy
-0.69
vable
-0.69
isconsin
-0.68
POSITIVE LOGITS
immigrant
1.07
immigrants
1.06
ancestry
1.02
heritage
0.96
oslov
0.92
proverb
0.92
descent
0.90
cuisine
0.90
istani
0.89
apolis
0.83
Activations Density 0.153%