INDEX
Explanations
biographical information about individuals, specifically details about their origins and family
New Auto-Interp
Negative Logits
abis
-0.14
ross
-0.13
BI
-0.13
lider
-0.13
ocs
-0.13
اÛĮد
-0.13
orate
-0.12
ÏĥÏĩ
-0.12
richt
-0.12
Wikispecies
-0.12
POSITIVE LOGITS
born
0.44
raised
0.43
native
0.43
raised
0.38
born
0.38
Raised
0.36
natives
0.36
native
0.36
originally
0.35
Born
0.35
Activations Density 0.170%