INDEX
Explanations
references to adoption and associated discussions
New Auto-Interp
Negative Logits
çĬ¯
-0.15
ÑĪÑĮ
-0.14
nackte
-0.14
stro
-0.14
XD
-0.14
_SYN
-0.13
аÑĢод
-0.13
ROID
-0.13
ilton
-0.13
ubb
-0.13
POSITIVE LOGITS
adoption
0.45
adopt
0.42
Adoption
0.42
Adopt
0.33
adopting
0.33
adopted
0.32
adopt
0.31
adopts
0.28
birth
0.25
agency
0.24
Activations Density 0.015%