INDEX
Explanations
actions related to protection, sacrifice, and rescue
New Auto-Interp
Negative Logits
gebra
-0.69
accelerator
-0.68
solicitation
-0.67
venants
-0.66
slang
-0.65
Wave
-0.63
OUT
-0.63
visual
-0.62
quizz
-0.62
Analy
-0.62
POSITIVE LOGITS
dignity
1.11
unborn
1.05
humankind
1.05
taxpayers
1.03
endangered
0.99
grandchildren
0.97
livelihood
0.97
innoc
0.96
mankind
0.95
innocent
0.94
Activations Density 3.007%