INDEX
Explanations
references to charitable donations and acts of giving
New Auto-Interp
Negative Logits
ern
-0.16
yll
-0.15
ourmet
-0.15
ingen
-0.14
hev
-0.14
charm
-0.14
ERM
-0.14
ings
-0.14
anc
-0.14
otti
-0.13
POSITIVE LOGITS
èħ
0.15
UPER
0.15
ERRU
0.15
é¡į
0.15
itive
0.15
AGED
0.15
.mozilla
0.14
ApplicationBuilder
0.14
-append
0.14
andom
0.14
Activations Density 0.031%