INDEX
Explanations
words related to overcoming challenges and perseverance
phrases that express contrast or contradiction
New Auto-Interp
Negative Logits
actionDate
-0.75
çͰ
-0.73
ãĤ®
-0.69
ãĥĨ
-0.66
WIND
-0.65
UD
-0.64
Moon
-0.62
soDeliveryDate
-0.62
ghetto
-0.60
EO
-0.60
POSITIVE LOGITS
fortunately
1.25
luckily
1.24
nonetheless
1.18
nevertheless
1.16
thankfully
1.08
hey
0.96
positives
0.87
suffice
0.84
tolerated
0.83
relent
0.79
Activations Density 0.393%