INDEX
Explanations
phrases involving written documents or agreements
instances of the word "written" in various contexts
New Auto-Interp
Negative Logits
tics
-0.85
ertodd
-0.84
alian
-0.82
alos
-0.81
apo
-0.81
nels
-0.81
cius
-0.81
olit
-0.80
rir
-0.78
hov
-0.77
POSITIVE LOGITS
agreement
1.06
submission
1.00
correspondence
0.99
message
0.95
consent
0.95
presentation
0.95
acknowledgement
0.94
communication
0.92
acknowledgment
0.91
permission
0.91
Activations Density 0.101%