INDEX
Explanations
mentions of people holding specific positions or affiliations in various organizations
articles frequently associated with notable figures and their roles or attributes
New Auto-Interp
Negative Logits
andon
-0.66
mares
-0.64
oops
-0.63
fixes
-0.63
ifts
-0.63
words
-0.62
osity
-0.62
anism
-0.62
idi
-0.62
girls
-0.60
POSITIVE LOGITS
member
1.25
contributor
1.12
supporter
1.09
proponent
1.07
fixture
1.07
participant
1.06
frequent
1.05
trustee
1.03
collaborator
0.98
shareholder
0.98
Activations Density 0.159%