INDEX
Explanations
terms related to political and social power dynamics
New Auto-Interp
Negative Logits
ĻĤ
-0.60
++++++++++++++++
-0.59
¶
-0.58
MSN
-0.57
ibrary
-0.55
ãĤ·ãĥ£
-0.55
Profile
-0.54
©¶æ
-0.52
à¥
-0.52
********************************
-0.52
POSITIVE LOGITS
thereof
1.02
thereto
1.01
alike
1.01
consequ
0.98
thereafter
0.92
versa
0.92
respectively
0.91
therein
0.89
subsequent
0.88
consequently
0.83
Activations Density 0.646%