INDEX
Explanations
phrases related to personal stories or experiences, especially those involving personal conflict or struggles
New Auto-Interp
Negative Logits
Travels
-0.65
aw
-0.64
activ
-0.61
eele
-0.59
exerc
-0.55
ize
-0.55
slee
-0.54
advoc
-0.54
conven
-0.54
culminated
-0.54
POSITIVE LOGITS
nor
1.91
Nor
1.59
nor
1.57
Instead
1.36
Nor
1.36
Instead
1.27
yet
1.26
Neither
1.20
anymore
1.18
unless
1.13
Activations Density 2.398%