INDEX
Explanations
direct quotations
quotation marks or speech indicators in the text
New Auto-Interp
Negative Logits
accomp
-0.85
carrier
-0.84
removable
-0.82
cleanup
-0.81
adjud
-0.78
disproportionately
-0.77
cubic
-0.76
dominate
-0.75
flared
-0.75
replacement
-0.75
POSITIVE LOGITS
We
1.46
Absolutely
1.44
There
1.42
I
1.41
Whoever
1.40
It
1.38
If
1.36
Obviously
1.36
You
1.35
Personally
1.34
Activations Density 0.126%