INDEX
Explanations
occurrences of the word "bir" or its variations, which relate to birth or biracial topics
New Auto-Interp
Negative Logits
esan
-0.16
aos
-0.15
Dispose
-0.15
Sed
-0.15
issing
-0.15
CELL
-0.14
ncia
-0.14
ti
-0.14
tem
-0.14
edy
-0.14
POSITIVE LOGITS
thing
0.26
acial
0.26
ken
0.20
git
0.20
ational
0.19
foon
0.18
thers
0.18
bir
0.18
ght
0.18
dbl
0.17
Activations Density 0.005%