INDEX
Explanations
phrases related to conflict, betrayal, and subjective value
significant contrasts or shifts in tone within narratives
New Auto-Interp
Negative Logits
withdrawal
-0.80
withdraw
-0.79
sacked
-0.78
hement
-0.77
pensions
-0.75
abwe
-0.73
illance
-0.73
exchanged
-0.72
legally
-0.70
bloc
-0.70
POSITIVE LOGITS
Favorite
1.17
Advertisement
1.12
Episode
1.04
Anyway
1.00
advertisement
0.95
Fun
0.91
SPONSORED
0.90
Includes
0.89
Recommended
0.89
Related
0.89
Activations Density 0.448%