INDEX
Explanations
references to fate and its impact on individuals and their circumstances
New Auto-Interp
Negative Logits
ekil
-0.17
voks
-0.17
ynthia
-0.16
ibold
-0.15
票
-0.14
iggins
-0.14
ynamo
-0.14
uegos
-0.14
Nation
-0.14
oningen
-0.14
POSITIVE LOGITS
fully
0.17
Fate
0.16
lessly
0.16
une
0.15
LAB
0.15
happ
0.15
destiny
0.14
ure
0.14
unes
0.14
ubl
0.14
Activations Density 0.015%