INDEX
Explanations
references to same-sex relationships and marriage equality
United States, parents, bill, dollar
New Auto-Interp
Negative Logits
reuung
-0.48
brengen
-0.47
genodigd
-0.47
coloridos
-0.44
ędziesz
-0.44
rachtet
-0.41
stoel
-0.41
esserung
-0.41
erba
-0.41
kamers
-0.40
POSITIVE LOGITS
Italijani
0.65
Autoritní
0.59
""],
0.53
Савезне
0.49
متعلقه
0.48
***!
0.47
Diwedd
0.45
jspb
0.45
SequentialGroup
0.45
تكبرها
0.45
Activations Density 0.046%