INDEX
Explanations
questions starting with "Why" followed by a verb in past tense
questions about political action and social relevance
New Auto-Interp
Negative Logits
disg
-0.61
EStreamFrame
-0.59
marrow
-0.58
confir
-0.55
illet
-0.54
redeem
-0.54
glim
-0.53
redeemed
-0.53
qt
-0.53
breeze
-0.52
POSITIVE LOGITS
instead
1.10
Matters
0.99
instead
0.98
?:
0.94
rather
0.91
so
0.85
anymore
0.85
nowadays
0.80
rather
0.80
Instead
0.80
Activations Density 0.350%