INDEX
Explanations
phrases indicating significant emotional or legal situations related to familial relationships
New Auto-Interp
Negative Logits
borg
-0.14
ãĤ±ãĥĥãĥĪ
-0.14
sel
-0.13
Stam
-0.13
elsey
-0.13
yaparak
-0.13
nyder
-0.13
ople
-0.12
SEL
-0.12
_SF
-0.12
POSITIVE LOGITS
there
0.23
there
0.18
thì
0.17
we
0.16
we
0.15
713
0.15
untu
0.14
åīĩ
0.14
214
0.14
od
0.14
Activations Density 0.542%