INDEX
Explanations
references to the birth or origin of individuals
instances of the word "born."
New Auto-Interp
Negative Logits
tiss
-0.74
phis
-0.74
awaru
-0.73
uyomi
-0.70
psey
-0.69
acco
-0.68
olicy
-0.67
dden
-0.67
eredith
-0.66
vernment
-0.66
POSITIVE LOGITS
lings
0.88
born
0.87
born
0.82
forms
0.77
days
0.75
Born
0.74
ãĤ´
0.73
abad
0.70
birth
0.70
ãĥĥãĤ¯
0.69
Activations Density 0.017%