INDEX
Explanations
anomalies or discrepancies within a text, as indicated by non-standard characters
instances of contextually significant quotes or statements about various topics
New Auto-Interp
Negative Logits
commissions
-0.74
cens
-0.71
enlist
-0.70
censored
-0.69
honored
-0.69
seams
-0.69
joint
-0.68
wound
-0.68
discontinued
-0.67
subsidized
-0.67
POSITIVE LOGITS
"[
1.45
"(
1.44
"
1.33
"'
1.28
Refer
1.24
READ
1.24
Read
1.23
Advertisement
1.23
Shape
1.21
Article
1.21
Activations Density 0.108%