INDEX
Explanations
the term "mother" and its related context
New Auto-Interp
Negative Logits
manner
-0.16
istine
-0.15
overall
-0.15
å¤
-0.15
-valu
-0.14
aldo
-0.14
tul
-0.14
yb
-0.14
overall
-0.14
Overall
-0.14
POSITIVE LOGITS
ignon
0.16
0.15
оÑģÑĥд
0.15
untas
0.15
ahi
0.15
pf
0.15
دث
0.15
Unters
0.15
inen
0.15
olas
0.14
Activations Density 0.013%