INDEX
Explanations
terms related to alleviating issues, particularly poverty and health problems, via social justice initiatives
New Auto-Interp
Negative Logits
freely
-0.70
cringe
-0.66
Origin
-0.65
boldly
-0.65
Odin
-0.65
ultras
-0.65
oats
-0.64
passionately
-0.62
unprotected
-0.62
blindly
-0.61
POSITIVE LOGITS
uating
1.27
icating
1.16
uate
1.12
uated
1.11
uates
1.10
ving
1.08
iating
1.06
pling
1.06
inished
1.05
ating
1.05
Activations Density 0.079%