INDEX
Explanations
references to policies or regulations
New Auto-Interp
Negative Logits
nacionalidad
-0.38
AssemblyTitle
-0.36
oredCriteria
-0.34
binatang
-0.32
olduk
-0.31
olup
-0.30
hewan
-0.30
tilbake
-0.30
potrze
-0.30
suaminya
-0.30
POSITIVE LOGITS
policy
2.05
policy
1.85
Policy
1.82
Policy
1.78
POLICY
1.77
policies
1.70
Policies
1.59
Policies
1.55
POLICY
1.52
policies
1.50
Activations Density 0.050%