INDEX
Explanations
phrases and terms related to generosity and support in various contexts
New Auto-Interp
Negative Logits
eday
-0.16
orro
-0.16
si
-0.15
oplan
-0.15
ÑĨеÑģ
-0.15
ternet
-0.15
indr
-0.14
омен
-0.14
shint
-0.14
313
-0.14
POSITIVE LOGITS
lest
0.15
ifter
0.15
plr
0.15
hetto
0.15
583
0.14
amente
0.14
ëħ
0.13
ยาà¸Ļ
0.13
ously
0.13
liest
0.13
Activations Density 0.016%