INDEX
Explanations
words related to various societal and political topics, specifically emphasizing the concept of issues, such as political debates, human rights, and controversial topics
discussions surrounding political issues
New Auto-Interp
Negative Logits
ammers
-0.82
urses
-0.81
ramid
-0.80
uner
-0.77
ramids
-0.77
ittle
-0.73
ellow
-0.72
ãĥ³ãĤ¸
-0.72
glas
-0.72
ongyang
-0.71
POSITIVE LOGITS
flared
0.88
confronting
0.88
raised
0.86
relating
0.81
facing
0.79
pertaining
0.79
arising
0.78
affecting
0.78
plag
0.78
resolutions
0.77
Activations Density 0.050%