INDEX
Explanations
phrases or terms that involve written communication or documentation
references to written documentation or formal statements
New Auto-Interp
Negative Logits
tics
-0.86
olen
-0.84
rir
-0.81
aho
-0.80
alos
-0.79
ertodd
-0.78
olit
-0.78
alian
-0.78
Ĭ±
-0.76
alach
-0.76
POSITIVE LOGITS
statement
1.18
declaration
1.12
explanation
1.10
agreement
1.09
invitation
1.09
apology
1.09
acknowledgment
1.08
acknowledgement
1.05
indication
1.02
communication
1.02
Activations Density 0.068%