INDEX
Explanations
references to luck or being lucky
New Auto-Interp
Negative Logits
hibit
-0.68
upaten
-0.60
<<<<<<<<<<<<<<
-0.59
hibition
-0.58
entar
-0.58
tellten
-0.57
esehen
-0.56
acchar
-0.55
ъем
-0.54
pham
-0.53
POSITIVE LOGITS
luck
2.98
LUCK
2.75
lucky
2.61
Luck
2.60
luck
2.46
Luck
2.44
lucky
2.39
Lucky
2.23
Lucky
2.11
LUCK
2.11
Activations Density 0.069%