INDEX
Explanations
phrases expressing feelings of luck or fortunate circumstances
New Auto-Interp
Negative Logits
iaux
-0.18
enda
-0.16
tent
-0.16
iginal
-0.14
alion
-0.14
елеÑĦ
-0.14
ãĥ¼ãĥį
-0.14
utas
-0.13
IData
-0.13
kategori
-0.13
POSITIVE LOGITS
enough
0.24
timing
0.23
charms
0.22
Timing
0.20
lucky
0.20
charm
0.20
indeed
0.19
æģµ
0.19
fortunate
0.18
Enough
0.18
Activations Density 0.025%