INDEX
Explanations
references to charitable actions and fundraising activities
New Auto-Interp
Negative Logits
ymoon
-0.16
DMIN
-0.16
zel
-0.15
_closure
-0.15
zzo
-0.14
izzo
-0.13
eton
-0.13
xit
-0.13
-radius
-0.13
floating
-0.13
POSITIVE LOGITS
charities
0.42
charity
0.39
causes
0.30
charitable
0.29
organizations
0.28
Charity
0.27
organisations
0.23
children
0.23
Char
0.23
char
0.23
Activations Density 0.246%