INDEX
Explanations
statements made in formal settings, specifically within legal and governmental contexts
New Auto-Interp
Negative Logits
underestimate
-0.85
lately
-0.82
underestimated
-0.79
mismatch
-0.77
setups
-0.77
monop
-0.76
landscape
-0.76
unpredict
-0.75
sideways
-0.75
sometimes
-0.74
POSITIVE LOGITS
Statement
0.96
Refer
0.92
<|endoftext|>
0.90
Refer
0.89
Shares
0.88
Comment
0.84
Additional
0.83
"...
0.83
Additionally
0.82
Accessed
0.82
Activations Density 0.361%