From Networks to Named Entities and Back Again
Exploring Classical Arabic Isnad Networks
DOI:
https://doi.org/10.25517/jhnr.v8i1.135Keywords:
name disambiguation, network analysis, natural language processing, hadithAbstract
This paper explores new methods for disambiguating the identity of individuals in classical Arabic citations (isnāds) using a network-based approach. After training a model to extract name mentions from classical Arabic, we embed these mentions in vector space using fine-tuned BERT representations and use community detection to infer clusters of coreferent mentions. The best-performing clustering approach reduces error on the CoNLL metric by 30%. Then, as a case study, we examine the problem of determining the number of direct transmitters to Ibn ʿAsākir (d. 1176) in a set of isnāds taken from the 12th century historical text Taʾrīkh Madīnat Dimashq (TMD, History of Damascus), using our method to replicate human judgement.

Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2023 Ryan Muther, David Smith, Sarah Bowen Savant

This work is licensed under a Creative Commons Attribution-NoDerivatives 4.0 International License.
A Creative Commons Attribution-NoDerivatives 4.0 (CC BY-ND 4.0) license applies to all contents published in JHNR. While articles published on JHNR can be copied by anyone for noncommercial purposes if proper credit is given, all materials are published under an open-access license with authors retaining full and permanent ownership of their work. For details please consult the Open Access and Copyright Notice.