Publication: Law Smells Detection with Machine Learning
No Thumbnail Available
Open/View Files
Date
2024-05-14
Authors
Published Version
Published Version
Journal Title
Journal ISSN
Volume Title
Publisher
The Harvard community has made this article openly available. Please share how this access benefits you.
Citation
Dechtiar, Moriya. 2024. Law Smells Detection with Machine Learning. Master's thesis, Harvard University Division of Continuing Education.
Research Data
Abstract
While major advancements have been achieved in many fields utilizing artificial
intelligence for a variety of tasks, some specialized areas remain difficult to tackle. The
legal domain is one such area. It is often said that legal language is a dialect of English
and one that requires a Law degree to be fluent in. In this work we examined parallels
between software engineering and legal drafting to develop definitions for contract
smells, quick indications for potential issues with legal contracts. We created an auto
labelled dataset of these contracts smells using engineered prompts and demonstrated
how even a small set of human labels can significantly improve auto labelling results
with few shots techniques. We demonstrated using bi-directional deep learning models
that these contract smells can indeed be successfully detected automatically with high
accuracy after further fine tuning BERT as well as LegalBert. This work underscores
the feasibility of applying advanced NLP techniques to automate aspects of legal
contract review and provides a strong foundation to further develop models for this
purpose.
Description
Other Available Sources
Keywords
Auto Labeling, BERT, Contract Smells, LegalBert, Machine Learning, Natural Language Processing, Computer science, Law, Artificial intelligence
Terms of Use
This article is made available under the terms and conditions applicable to Other Posted Material (LAA), as set forth at Terms of Service