INDEX
Explanations
references to luck
occurrences of the word "luck."
New Auto-Interp
Negative Logits
ainers
-0.74
ĨĴ
-0.65
sis
-0.65
helle
-0.65
rients
-0.64
arag
-0.63
issions
-0.62
bern
-0.61
pta
-0.60
anon
-0.59
POSITIVE LOGITS
luck
0.98
charms
0.92
istically
0.83
iest
0.81
charm
0.80
flix
0.77
iness
0.75
lucky
0.74
idious
0.74
fully
0.72
Activations Density 0.009%