INDEX
Explanations
names of notable individuals and organizations mentioned in news articles
mentions of important public figures and their actions
New Auto-Interp
Negative Logits
Newsletter
-1.02
SPONSORED
-1.02
'';
-0.78
Tsukuyomi
-0.77
Advertisement
-0.72
anyways
-0.67
};
-0.65
};
-0.65
âĨij
-0.65
';
-0.64
POSITIVE LOGITS
has
1.03
believes
0.87
wants
0.83
says
0.82
warns
0.80
intends
0.79
denies
0.79
alleges
0.78
accuses
0.76
may
0.75
Activations Density 0.269%