INDEX
Explanations
references to familial relationships, particularly concerning grandchildren and nephews
New Auto-Interp
Negative Logits
dr
-0.59
stay
-0.57
buy
-0.55
Bab
-0.55
खरी
-0.51
bes
-0.51
Mom
-0.49
δου
-0.48
der
-0.47
بوابة
-0.47
POSITIVE LOGITS
grandchildren
1.43
grandkids
1.29
grandchild
1.25
grandchildren
1.22
grandsons
1.05
granddaughter
1.02
grandson
0.95
日閲覧
0.85
kaynağından
0.84
nephews
0.81
Activations Density 0.022%