INDEX
Explanations
words related to gift-giving and appreciation
New Auto-Interp
Negative Logits
tics
-0.77
idents
-0.73
Occupations
-0.70
utsu
-0.67
arios
-0.65
ANE
-0.64
666
-0.64
ane
-0.62
odan
-0.62
tera
-0.62
POSITIVE LOGITS
giving
1.17
baskets
0.99
gift
0.98
gifts
0.97
recipient
0.94
basket
0.94
bestowed
0.91
recipients
0.90
wra
0.83
Gift
0.82
Activations Density 0.031%