INDEX
Explanations
phrases expressing the potential for making a positive impact through various actions, such as speaking up, voting, offering support, and sharing feedback
expressions related to influence or making a difference
New Auto-Interp
Negative Logits
Purg
-0.87
Brun
-0.75
hid
-0.71
BSD
-0.69
olitan
-0.65
Vul
-0.65
ylum
-0.64
Relax
-0.63
Abstract
-0.63
Gleaming
-0.60
POSITIVE LOGITS
impact
1.37
influence
1.33
impact
1.29
influencing
1.29
Influence
1.24
impacting
1.18
mattered
1.09
invaluable
1.06
affect
1.06
Impact
1.06
Activations Density 0.568%