INDEX
Explanations
phrases discussing charity, donations, and their implications
New Auto-Interp
Negative Logits
ismet
-0.15
igon
-0.14
attice
-0.14
.gdx
-0.13
isd
-0.13
afx
-0.13
ymoon
-0.13
'{@-0.13
theid
-0.13
ydk
-0.13
POSITIVE LOGITS
charity
0.62
donation
0.54
charitable
0.54
charities
0.50
philanth
0.49
Charity
0.49
donations
0.48
donating
0.45
fundraising
0.42
donate
0.41
Activations Density 0.021%