INDEX
Explanations
quantifiable data related to donations and economic metrics
New Auto-Interp
Negative Logits
ider
-0.17
tr
-0.16
lec
-0.15
called
-0.14
ga
-0.14
dipl
-0.14
her
-0.14
idge
-0.14
-
-0.14
family
-0.14
POSITIVE LOGITS
Glover
0.16
润
0.15
ê³µ
0.15
akter
0.15
ìĭľìĺ¤
0.15
flix
0.15
moden
0.15
prov
0.15
nrw
0.14
stup
0.14
Activations Density 0.135%