INDEX
Explanations
terms related to non-profit organizations or movements
references to non-profit organizations or entities
New Auto-Interp
Negative Logits
flush
-0.81
tack
-0.80
lav
-0.72
touch
-0.69
digging
-0.68
gut
-0.67
rush
-0.67
bricks
-0.67
trooper
-0.67
heart
-0.66
POSITIVE LOGITS
Non
2.94
Non
1.86
NON
1.34
Semi
1.15
Female
1.06
Negative
1.06
Minor
1.02
Same
1.01
Partial
1.01
non
1.01
Activations Density 0.021%