INDEX
Explanations
references to specific legal clauses or provisions
legal clauses
New Auto-Interp
Negative Logits
getN
-0.50
wouldn
-0.47
doesn
-0.44
Amer
-0.43
Sher
-0.43
Watertown
-0.42
Kristi
-0.42
Histor
-0.42
getR
-0.41
didn
-0.41
POSITIVE LOGITS
Clause
1.11
clause
1.10
Clause
1.05
clause
1.04
clauses
0.94
Clauses
0.91
clauses
0.87
Phrase
0.65
Clau
0.63
Clubhouse
0.62
Activations Density 0.005%