INDEX
Explanations
anthropology Ruth Benedict Edward Sapir
New Auto-Interp
Negative Logits
répondre
0.46
Gao
0.42
ataya
0.41
sız
0.40
Jiangsu
0.40
Peny
0.40
鉛
0.38
Suc
0.37
ця
0.37
Db
0.36
POSITIVE LOGITS
anthropological
0.67
ethn
0.63
anthropology
0.63
ethnographic
0.63
anthropologists
0.61
anthropologist
0.55
Ethn
0.55
Anthropology
0.54
Ethn
0.54
Anthrop
0.53
Activations Density 0.002%