INDEX
Explanations
references to cultural or religious holidays and their significance
New Auto-Interp
Negative Logits
stile
-0.15
ubu
-0.14
holidays
-0.14
ayet
-0.14
Sinai
-0.14
PTR
-0.13
odate
-0.13
ikers
-0.13
rox
-0.13
è¯
-0.13
POSITIVE LOGITS
decor
0.24
exchanging
0.22
decorations
0.21
exchange
0.19
cards
0.19
Decor
0.19
decor
0.18
decoration
0.18
Decor
0.18
decorated
0.18
Activations Density 0.065%