New Paper Just Published: Natural Language Processing in the Legal Domain – Date Written: January 24, 2023


Daniel Martin Katz

Illinois Tech – Chicago Kent College of Law; Bucerius Center for Legal Technology & Data Science; Stanford CodeX – The Center for Legal Informatics; 273 Ventures

Dirk Hartung

Bucerius Law School – Center for Legal Technology and Data Science; Stanford University – Stanford Codex Center

Lauritz Gerlach

Bucerius Law School

Abhik Jana

University of Hamburg; Language Technology Group, Department of Informatics, Universität Hamburg

Michael James Bommarito

273 Ventures; Licensio, LLC; Bommarito Consulting, LLC; Michigan State College of Law; Stanford Center for Legal Informatics

Date Written: January 24, 2023


In this paper, we summarize the current state of the field of NLP and Law with a specific focus on recent technical and substantive developments. To support our analysis, we construct and analyze a corpus of more than six hundred NLP and Law related papers published over the past decade. Our analysis highlights several major trends. Namely, we document an increasing number of papers written, tasks undertaken, and languages covered over the course of the past decade. We observe an increase in the sophistication of the methods which researchers deployed in this applied context. Slowly but surely, Legal NLP is beginning to match the methodological sophistication of general NLP. We believe this to be a positive trend for the future of the field, but many questions in both the academic and commercial sphere still remain open.


Keywords: Legal Data, NLP, NLProc, Legal NLP, Legal Analytics, Evaluation, AI, Legal Engineering, Legal Science, Engineering, Neural NLP, Legal Tech

JEL Classification: C45, C55, K49, O33, O30

Katz, Daniel Martin and Hartung, Dirk and Gerlach, Lauritz and Jana, Abhik and Jana, Abhik and Bommarito, Michael James, Natural Language Processing in the Legal Domain (January 24, 2023). Available at SSRN: