INDEX
Explanations
phrases related to legal or procedural statements
occurrences of the article "a"
New Auto-Interp
Negative Logits
lines
-0.69
lasses
-0.66
pter
-0.66
chart
-0.65
lins
-0.64
bite
-0.63
LIN
-0.62
erver
-0.62
Orlando
-0.62
reports
-0.61
POSITIVE LOGITS
usterity
1.47
cknow
0.96
irst
0.95
esthetic
0.95
verages
0.88
couple
0.85
ption
0.85
lot
0.84
qua
0.84
ird
0.83
Activations Density 0.082%