INDEX
Explanations
mentions of birth-related terms and concepts
references to birth and related documentation
New Auto-Interp
Negative Logits
eredith
-0.83
ornings
-0.79
atile
-0.77
vernment
-0.72
aunder
-0.72
olulu
-0.72
ourning
-0.70
awaru
-0.70
;;;;;;;;
-0.69
Flavoring
-0.69
POSITIVE LOGITS
days
1.26
birth
0.94
canal
0.94
rate
0.93
stones
0.88
flies
0.87
date
0.83
stone
0.82
forms
0.82
marks
0.80
Activations Density 0.020%