INDEX
Explanations
mentions of political news or social events
topics related to politics, social issues, and civil rights
New Auto-Interp
Negative Logits
oneself
-0.63
ardless
-0.53
anwhile
-0.53
ividually
-0.52
orks
-0.51
Azerb
-0.51
cair
-0.50
hovah
-0.50
ichever
-0.49
vertising
-0.47
POSITIVE LOGITS
brethren
0.72
counterparts
0.67
counterpart
0.65
woes
0.62
holdings
0.61
arsenal
0.60
iest
0.57
portfolio
0.55
buddies
0.55
career
0.54
Activations Density 0.564%