INDEX
Explanations
references to family and relationships
New Auto-Interp
Negative Logits
Friend
-0.15
deniz
-0.14
ÑģÑĤоÑĢ
-0.14
ugi
-0.14
iben
-0.14
rescia
-0.14
ancestor
-0.13
754
-0.13
ahir
-0.13
ANJI
-0.13
POSITIVE LOGITS
whom
0.23
ages
0.22
grown
0.21
adopted
0.21
attend
0.20
twins
0.17
named
0.17
attending
0.17
(named
0.17
age
0.17
Activations Density 0.179%