INDEX
Explanations
statements of explanation or justification
the presence of specific verb forms that indicate ongoing or significant actions or states
New Auto-Interp
Negative Logits
Highlights
-0.76
icter
-0.76
Cosponsors
-0.74
regrets
-0.72
vantage
-0.70
racuse
-0.69
osion
-0.68
til
-0.68
Cause
-0.68
cause
-0.66
POSITIVE LOGITS
technically
1.06
ostensibly
1.01
purely
0.93
essentially
0.92
supposed
0.91
already
0.89
inherently
0.88
geographically
0.88
predominantly
0.87
strictly
0.87
Activations Density 0.298%