Law and Corpus Linguistics
   HOME

TheInfoList



OR:

Law and corpus linguistics (LCL) is a new academic sub-discipline that uses large databases of examples of language usage equipped with tools designed by linguists called
corpora Corpus is Latin for "body". It may refer to: Linguistics * Text corpus, in linguistics, a large and structured set of texts * Speech corpus, in linguistics, a large set of speech audio files * Corpus linguistics, a branch of linguistics Music * ...
to better get at the meaning of words and phrases in legal texts (statutes, constitutions, contracts, etc.). Thus, LCL is the application of corpus linguistic tools, theories, and methodologies to issues of legal interpretation in much the same way
law and economics Law and economics, or economic analysis of law, is the application of microeconomic theory to the analysis of law, which emerged primarily from scholars of the Chicago school of economics. Economic concepts are used to explain the effects of law ...
is the application of economic tools, theories, and methodologies to various legal issues.


History

A 2005 law review article by Lawrence Solan noted in passing that corpus linguistics had potential for its application to interpreting legal texts. But the first systematic exploration and advocacy of applying the tools and methodologies of corpus linguistics to legal interpretive questions of law and corpus linguistics came in the fall of 2010, when the BYU Law Review published a note by Stephen Mouritsen, entitled ''The Dictionary is Not a Fortress: Definitional Fallacies and a Corpus-Based Approach to Plain Meaning''. The note argued that dictionaries are the primary linguistic tool used by judges to determine the plain or ordinary meaning of words and phrases, and highlighted the deficiencies of such an approach. In its stead, the note proposed using corpus linguistics. And the note would be later cited by
Adam Liptak Adam Liptak (born September 2, 1960) is an American journalist, lawyer and instructor in law and journalism. He is the Supreme Court correspondent for '' The New York Times''. Liptak has written for '' The New Yorker'', ''Vanity Fair'', '' Rolli ...
in a ''
New York Times ''The New York Times'' (''the Times'', ''NYT'', or the Gray Lady) is a daily newspaper based in New York City with a worldwide readership reported in 2020 to comprise a declining 840,000 paid print subscribers, and a growing 6 million paid ...
'' article on statutory construction. Law and corpus linguistics (LCL) gained greater legitimacy in July 2011 with the first judicial opinion in American history utilizing corpus linguistics to determine the meaning of a legal text: ''In re the Adoption of Baby E.Z.''266 P.3d (Utah 2011). Available at https://www.utcourts.gov/opinions/supopin/InReEZ071911.pdf. In a concurrence in part and in the judgment, Justice Thomas Lee wrote to put forth an alternative ground for the majority's holding—interpreting the phrase "custody determination" by using corpus linguistics. Justice Lee looked at 500 randomized sample sentences from the Corpus of Contemporary American English (COCA) and found that the most common sense of "custody" was in the context of divorce rather than adoption. Further, he found that "custody" is ten times more likely to co-occur (or collocate) with "divorce" than with "adoption". From that evidence Justice Lee concluded that he "would find that the custody proceedings covered by the Act are limited to proceedings resulting in the modifiable custody orders of a divorce", rather than the broader range of custody proceedings. Other jurisprudence and scholarship would follow. In a 2015 concurrence in ''State v. Rasabout'', Justice Lee used a COCA search to determine that "discharge" when used with a firearm (or one of its synonyms) overwhelmingly referred to a single shot rather than emptying the entire magazine of the weapon. And in 2016, four of the five justices joined a footnote in a majority opinion by Justice Lee commending a party for using corpus linguistics in its briefing even though the Court found it unnecessary to resolve the related question. Finally, in 2016 the
Michigan Supreme Court The Michigan Supreme Court is the highest court in the U.S. state of Michigan. It is Michigan's court of last resort and consists of seven justices. The Court is located in the Michigan Hall of Justice at 925 Ottawa Street in Lansing, the sta ...
became the first court to use a linguist-designed corpus in a majority opinion (COCA), with both the majority and the dissent turning to COCA to determine the meaning of the word "information". In 2020, courts desiring to bolster the legal theory of
original intent Original intent is a theory in law concerning constitutional and statutory interpretation. It is frequently used as a synonym for originalism; while original intent is indeed one theory in the originalist family, it has some salient differenc ...
have sought the opportunity to undertake analyses of statutes utilizing corpus linguistics. In a
Ninth Circuit Court of Appeals The United States Court of Appeals for the Ninth Circuit (in case citations, 9th Cir.) is the U.S. federal court of appeals that has appellate jurisdiction over the U.S. district courts in the following federal judicial districts: * District ...
case, Jones, et al. v. Becerra, et al, (9th Cir. Case No. 20-56174), a case involving the
Second Amendment The second (symbol: s) is the unit of time in the International System of Units (SI), historically defined as of a day – this factor derived from the division of the day first into 24 hours, then to 60 minutes and finally to 60 seconds each ...
and the constitutionality of a
California California is a state in the Western United States, located along the Pacific Coast. With nearly 39.2million residents across a total area of approximately , it is the most populous U.S. state and the 3rd largest by area. It is also the m ...
statute which bans the sale of firearms to individuals under the age of 21, a Ninth Circuit panel requested that the parties address three questions: 1) “What is the original public meaning of the Second Amendment phrases: ‘A well regulated Militia’; ‘the right of the people’; and ‘shall not be infringed’? 2) How does the tool of corpus linguistics help inform the determination of the original public meaning of those Second Amendment phrases?” 3) How do the data yielded from corpus linguistics assist in the interpretation of the constitutionality of age-based restrictions under the Second Amendment? As to scholarship, in 2012, Mouritsen followed up his original work with an article in the Columbia Science and Technology Law Review, where he further refined and promoted the use of corpus-based methods for determining questions of legal ambiguity. Additionally, in 2016 two essays and an article on law and corpus linguistics were published. The Yale Law Journal Forum published ''Corpus Linguistics & Original Public Meaning: A New Tool to Make Originalism More Empirical''. Written by Justice Lee and two co-authors, the essay urged originalists to turn to corpus linguistics to improve the rigor and accuracy of originalist scholarship. And in response, the Forum published an essay by Lawrence Solan (a Brooklyn Law professor with a PhD in linguistics), ''Can Corpus Linguistics Help Make Originalism Scientific?'' The Boston University Public Interest Law Journal published ''The Merciful Corpus: The Rule of Lenity, Ambiguity and Corpus Linguistics'' by Daniel Ortner. In the article Ortner applied corpus linguistics to determining whether sufficient ambiguity exists to trigger the rule of lenity in five Supreme Court cases. Looking forward, in 2017 two more articles are slated for publication. Lee Strang focuses on corpus linguistics and originalism in the U.C. Davis Law Review, and Lawrence Solan and Tammy Gales explore corpus linguistics in the context of finding ordinary meaning in statutory interpretation in the International Journal of Legal Discourse. Lawyers and journalists have also taken notice of corpus linguistics at it relates to the law. In 2010, Neal Goldfarb filed the first known brief in the Supreme Court using corpus linguistics (COCA) to determine whether the ordinary meaning of "personal" referred to corporations in the case '' FCC v. AT&T''. The amicus brief looked at the top collocates (words that co-occur) of "personal" in COHA as well as BYU's Time Magazine Corpus. And writing for
The Atlantic ''The Atlantic'' is an American magazine and multi-platform publisher. It features articles in the fields of politics, foreign affairs, business and the economy, culture and the arts, technology, and science. It was founded in 1857 in Boston, ...
,
Ben Zimmer Benjamin Zimmer (born 1971) is an American linguist, lexicographer, and language commentator. He is a language columnist for ''The Wall Street Journal'' and contributing editor for ''The Atlantic''. He was formerly a language columnist for ''The ...
took note of this new trend, referring to corpus linguistics in the courts as "Like Lexis on Steroids". On the academic front, in 2013 BYU Law School started the first class on law and corpus linguistics, co-taught by Mouritsen, Lee, and (now Dean)
Gordon Smith Gordon Smith may refer to: In politics *Gordon H. Smith (born 1952), former U.S. Senator from Oregon, and current Area Authority for the LDS Church * Gordon Elsworth Smith (1918–2005), Canadian politician * Gordon Smith (academic) (1927–2009), ...
. The class is currently in its fourth year. And in February 2016, BYU Law School hosted the inaugural conference on LCL, with over two dozen legal and linguistic scholars from around the country discussing and debating the next steps forward for the growing academic movement. A second conference is scheduled for February 2017. At the conference BYU Law School announced its plans and progress on the Corpus of Founding Era American English (COFEA), a corpus that will cover 1760–1799.See "Current Projects"
/ref> To date 120 million words have been collected from founding era letters, diaries, newspapers, non-fiction books, fiction, sermons, speeches, debates, legal cases, and other legal materials.


References

{{reflist Corpus linguistics