INDEX
Explanations
phrases related to harmful incidents or accidents involving individuals
references to male individuals and their actions or experiences
New Auto-Interp
Negative Logits
endif
-0.89
peak
-0.78
Interest
-0.69
Dominion
-0.68
soType
-0.62
Crusade
-0.61
icion
-0.59
Revolution
-0.59
Rapt
-0.59
problem
-0.58
POSITIVE LOGITS
'd
1.11
underwent
1.03
unsuccessfully
1.00
participated
1.00
feared
0.99
witnessed
0.97
received
0.96
testified
0.93
alleges
0.92
encountered
0.91
Activations Density 0.229%