INDEX
Explanations
phrases or sentences related to official statements or declarations
instances of frustration or criticism regarding certain actions or decisions
New Auto-Interp
Negative Logits
)?
-0.79
').
-0.76
)--
-0.75
!).
-0.73
?).
-0.72
}.
-0.70
)!
-0.69
!'
-0.67
?),
-0.66
.--
-0.65
POSITIVE LOGITS
"â̦
0.80
"
0.78
"...
0.75
"'
0.71
"[
0.67
Blumenthal
0.66
wcs
0.66
underestimated
0.65
"#
0.62
misunderstood
0.62
Activations Density 1.603%