INDEX
Explanations
mentions of support and advocacy in political and social contexts
New Auto-Interp
Negative Logits
REA
-0.13
rupa
-0.13
azzo
-0.13
ewidth
-0.13
ãģĵãĤĵ
-0.12
exus
-0.12
_guid
-0.12
ÑģÑĤÑĢаÑħ
-0.12
_periods
-0.12
sembled
-0.12
POSITIVE LOGITS
cause
0.78
causes
0.67
Cause
0.63
cause
0.63
Cause
0.55
Causes
0.53
causa
0.50
causing
0.40
caused
0.36
efforts
0.35
Activations Density 0.249%