INDEX
Explanations
references to familial relationships and marriages
New Auto-Interp
Negative Logits
itſelf
-1.06
Theſe
-0.96
Efq
-0.92
myſelf
-0.89
themſelves
-0.88
ſche
-0.86
виправивши
-0.85
Beſ
-0.85
Houſe
-0.84
Anſ
-0.84
POSITIVE LOGITS
S
0.47
EndContext
0.47
con
0.46
nhiêu
0.44
E
0.43
J
0.43
Le
0.43
Mor
0.43
O
0.43
(
0.43
Activations Density 0.055%