INDEX
Explanations
mentions of individuals being members of various organizations or groups
references to membership in organizations or groups
New Auto-Interp
Negative Logits
itals
-0.79
eneg
-0.75
ancies
-0.72
itudes
-0.71
rums
-0.71
urers
-0.69
asks
-0.67
ources
-0.65
ankind
-0.65
iasm
-0.64
POSITIVE LOGITS
of
1.17
OF
0.92
OF
0.92
thereof
0.87
Of
0.87
Of
0.82
Marginal
0.67
Joined
0.67
of
0.64
maker
0.63
Activations Density 0.101%