INDEX
Explanations
phrases indicating the speaker's viewpoint or explanation
phrases indicating opinions or assertions from various sources
New Auto-Interp
Negative Logits
cffffcc
-0.76
empt
-0.73
plug
-0.68
acts
-0.63
enriched
-0.63
bailed
-0.63
acad
-0.63
riched
-0.63
aven
-0.63
icates
-0.62
POSITIVE LOGITS
Polly
0.73
Compass
0.73
historian
0.72
Jonathan
0.71
NYT
0.70
Stef
0.69
pace
0.69
Mattis
0.68
Joel
0.68
Laura
0.68
Activations Density 0.049%