INDEX
Explanations
elements related to personal relationships and family dynamics
New Auto-Interp
Negative Logits
becoming
-0.19
bec
-0.18
worden
-0.17
becomes
-0.17
ingleton
-0.16
wordt
-0.16
uncio
-0.15
Lester
-0.15
shouldBe
-0.15
Become
-0.15
POSITIVE LOGITS
dar
0.17
LEN
0.17
ann
0.16
abb
0.16
marketplace
0.15
OnError
0.14
abl
0.14
anh
0.14
ford
0.14
wash
0.14
Activations Density 0.023%