INDEX
Explanations
phrases related to personal struggles or challenges
expressions of frustration or dissatisfaction
New Auto-Interp
Negative Logits
ppings
-0.64
geries
-0.64
Bowen
-0.58
Chao
-0.56
subsequent
-0.54
Combine
-0.53
repeated
-0.53
rompt
-0.53
ries
-0.52
Milton
-0.52
POSITIVE LOGITS
âĢ
1.29
âĺ
1.26
ðŁij
1.15
âľ
1.10
âĿ
1.08
ðŁij
1.05
.ãĢį
1.05
ðŁ
1.04
â
1.01
âĢ
1.00
Activations Density 0.373%