INDEX
Explanations
terms related to justice, fairness, and compassion
themes related to justice, fairness, and inclusivity in societal contexts
New Auto-Interp
Negative Logits
aunts
-0.96
hops
-0.87
planes
-0.85
events
-0.82
steps
-0.82
andals
-0.80
posts
-0.80
casts
-0.80
stones
-0.79
notes
-0.78
POSITIVE LOGITS
environment
1.17
atmosphere
1.09
solution
1.06
relationship
1.04
outlook
1.02
outcome
1.02
approach
1.00
attitude
0.97
response
0.96
system
0.94
Activations Density 0.319%