INDEX
Explanations
phrases related to charitable giving and donations
New Auto-Interp
Negative Logits
yll
-0.16
wan
-0.16
å½¹
-0.15
ourmet
-0.15
otti
-0.15
aber
-0.14
Strand
-0.14
hev
-0.14
ern
-0.14
ic
-0.14
POSITIVE LOGITS
ÄįnÄĽ
0.15
ERRU
0.15
IMITIVE
0.15
-append
0.15
Dod
0.15
itive
0.14
UPER
0.14
é¡į
0.14
.mozilla
0.14
Äįka
0.14
Activations Density 0.026%