INDEX
Explanations
generous and selfless giving
New Auto-Interp
Negative Logits
皎
0.68
One
0.63
ak
0.63
واحد
0.63
واحدة
0.62
지가
0.61
其他
0.60
الإنسان
0.60
म
0.60
इतर
0.60
POSITIVE LOGITS
excretion
0.77
currency
0.75
money
0.72
lavish
0.72
donation
0.70
spending
0.68
shower
0.67
hoax
0.64
excre
0.64
secretion
0.64
Activations Density 0.063%