INDEX
Explanations
references to policies and policy makers
New Auto-Interp
Negative Logits
__':
-0.79
Демографія
-0.77
تقاوى
-0.69
متعلقه
-0.64
]--;
-0.62
+#+#
-0.56
shire
-0.55
NSCoder
-0.54
gainera
-0.53
propOrder
-0.53
POSITIVE LOGITS
makers
0.91
making
0.89
maker
0.88
formulation
0.71
maker
0.69
Formulation
0.67
makers
0.66
decisions
0.65
holder
0.64
direction
0.63
Activations Density 0.169%