INDEX
Explanations
references to people in a specific group that the document's author is part of
references to peers and contemporaries in various contexts
New Auto-Interp
Negative Logits
rament
-0.69
ruction
-0.68
Jew
-0.60
Werewolf
-0.60
trap
-0.59
TRUMP
-0.59
Salvation
-0.59
Arena
-0.59
Sold
-0.58
lot
-0.58
POSITIVE LOGITS
peers
4.21
peer
2.10
contemporaries
2.04
classmates
1.96
counterparts
1.81
colleagues
1.77
brethren
1.58
peer
1.49
superiors
1.48
mentors
1.48
Activations Density 0.009%