INDEX
Explanations
references to luck or fortunate circumstances
references to feeling fortunate or in a privileged position
New Auto-Interp
Negative Logits
andise
-0.73
anon
-0.72
helle
-0.69
aton
-0.69
arer
-0.69
owe
-0.68
adish
-0.68
hemat
-0.67
along
-0.65
alach
-0.64
POSITIVE LOGITS
enough
1.07
lucky
0.98
fortunate
0.95
circumstance
0.84
unlucky
0.83
charms
0.83
few
0.79
circumstances
0.76
luck
0.75
charm
0.69
Activations Density 0.052%