INDEX
Explanations
mentions of individuals being members of a specific group or committee
references to membership in various groups or organizations
New Auto-Interp
Negative Logits
enegger
-0.80
bows
-0.70
urses
-0.65
ources
-0.62
uras
-0.62
anche
-0.61
veyard
-0.61
ricanes
-0.61
obiles
-0.60
asions
-0.59
POSITIVE LOGITS
hips
0.97
of
0.87
board
0.87
thereof
0.79
member
0.77
doms
0.76
holder
0.75
boards
0.73
ridge
0.70
Koen
0.69
Activations Density 0.037%