INDEX
Explanations
phrases related to fighting, resisting, and taking action for a cause or belief
phrases related to advocacy and fighting for causes
New Auto-Interp
Negative Logits
glances
-0.74
additions
-0.71
averages
-0.70
averaging
-0.67
calculations
-0.67
comments
-0.64
elig
-0.63
applicants
-0.63
Lot
-0.63
flashes
-0.62
POSITIVE LOGITS
defend
1.61
liberate
1.50
protect
1.45
overthrow
1.42
oppose
1.39
eradicate
1.37
uphold
1.35
preserve
1.35
topple
1.32
restore
1.31
Activations Density 0.246%