INDEX
Explanations
phrases related to political and social issues, particularly focusing on the topics of justice, government systems, and societal struggles
New Auto-Interp
Negative Logits
"""
-0.83
cade
-0.83
arya
-0.78
ĺħ
-0.74
eday
-0.73
Journals
-0.73
Lessons
-0.73
Voy
-0.73
Majority
-0.72
anmar
-0.72
POSITIVE LOGITS
spaced
0.97
overlapping
0.85
protected
0.84
constructed
0.84
insignificant
0.82
combust
0.81
shaped
0.80
disparate
0.80
diverse
0.80
cohesive
0.78
Activations Density 7.487%