INDEX
Explanations
references to legal infractions and violations
words related to infractions or violations of laws and codes
New Auto-Interp
Negative Logits
lining
-0.70
soDeliveryDate
-0.66
kn
-0.65
building
-0.65
ned
-0.65
nas
-0.64
wills
-0.64
itability
-0.63
ning
-0.62
anas
-0.61
POSITIVE LOGITS
raction
0.97
ractions
0.88
Citation
0.83
oons
0.69
Extras
0.69
Forbidden
0.68
citation
0.67
Jagu
0.67
terness
0.67
ij士
0.67
Activations Density 0.016%