About · Provenance
About the hadith data
Every hadith you read here arrives through a chain — from the Prophet ﷺ down through narrators, scholars, editors, and finally the digital corpora we ingest. This page is the short version of that chain.
Corpus: 16 readable collections (5 with per-scholar gradings) · 48,826 hadiths · 65,939 per-scholar rulings across 9 muhaddithūn.
Sources & licences
We do not maintain our own manuscript archive. The corpus is assembled from three external sources, each carrying its own licence (or, in one case, its own licence gap) and its own strengths.
Collections & editions
Every hadith collection available on TalibNotes, graded collections first. "With gradings" is the number of hadiths in that collection that carry at least one per-scholar ruling. Ṣaḥīḥ al-Bukhārī and Ṣaḥīḥ Muslim are present for reading but have no per-scholar column — the compilers' sahih screening is universally accepted, so scholarly grade lists for these two collections are not maintained in the fawazahmed0 corpus.
| Collection | Hadiths | With gradings | Notes |
|---|---|---|---|
| Sunan an-Nasa'iسنن النسائي | 5,704 | 5,625 | Compiled by al-Nasāʾī (d. 303/915). Sughrā recension; print edition: ʿAbd al-Fattāḥ Abū Ghudda (al-Mujtabā). |
| Sunan Abi Dawudسنن أبي داود | 5,283 | 5,084 | Compiled by Abū Dāwūd al-Sijistānī (d. 275/889). Print edition: Muḥammad Muḥyī al-Dīn ʿAbd al-Ḥamīd (Maktabat al-ʿAṣriyya). |
| Sunan Ibn Majahسنن ابن ماجه | 4,413 | 4,287 | Compiled by Ibn Mājah (d. 273/887). Print edition: Muḥammad Fuʾād ʿAbd al-Bāqī. |
| Jami' at-Tirmidhiجامع الترمذي | 4,220 | 3,774 | Compiled by al-Tirmidhī (d. 279/892). Print edition: Bashshār ʿAwwād Maʿrūf (Dār al-Gharb al-Islāmī). |
| Muwatta Malikموطأ مالك | 1,985 | 1,779 | Compiled by Imām Mālik (d. 179/795). Narration of Yaḥyā ibn Yaḥyā al-Laythī; print edition: Muḥammad Fuʾād ʿAbd al-Bāqī. |
| Sahih al-Bukhariصحيح البخاري | 7,669 | — | Compiled by Imām al-Bukhārī (d. 256/870). Print edition: Muḥammad Fuʾād ʿAbd al-Bāqī (Dār Ṭūq al-Najāh). |
| Sahih Muslimصحيح مسلم | 7,527 | — | Compiled by Imām Muslim (d. 261/875). Print edition: Muḥammad Fuʾād ʿAbd al-Bāqī (Dār Iḥyāʾ al-Turāth al-ʿArabī, Beirut). |
| Mishkat al-Masabihمشكاة المصابيح | 4,427 | — | Mishkāt al-Maṣābīḥ — cross-collection anthology by al-Khaṭīb al-Tabrīzī (d. 741/1340), expanding al-Baghawī's Maṣābīḥ al-Sunna. No edition cross-reference ingested. |
| Sunan al-Darimiسنن الدارمي | 2,757 | — | Compiled by al-Dārimī (d. 255/869). No edition cross-reference ingested. |
| Bulugh al-Maramبلوغ المرام | 1,767 | — | Bulūgh al-Marām min Adillat al-Aḥkām — legal-evidence anthology by Ibn Ḥajar al-ʿAsqalānī (d. 852/1449). No edition cross-reference ingested. |
| Al-Adab Al-Mufradالأدب المفرد | 1,326 | — | Al-Adab al-Mufrad — etiquette compilation by Imām al-Bukhārī (d. 256/870). No edition cross-reference ingested. |
| Riyad as-Salihinرياض الصالحين | 1,217 | — | Riyāḍ al-Ṣāliḥīn — topical anthology by al-Nawawī (d. 676/1277). No edition cross-reference ingested. |
| Shama'il at-Tirmidhiالشمائل المحمدية | 401 | — | Al-Shamāʾil al-Muḥammadiyya — Prophetic-description hadiths compiled by al-Tirmidhī (d. 279/892). No edition cross-reference ingested. |
| An-Nawawi's Forty + Ibn Rajab's Ziyādātالأربعون النووية وزيادات ابن رجب | 50 | — | Al-Arbaʿūn al-Nawawiyya — forty foundational hadiths selected by al-Nawawī (d. 676/1277). Numbering is canonical. |
| Forty Hadith Qudsiالأحاديث القدسية | 40 | — | Forty Ḥadīth Qudsī — compiled by al-Nawawī and Ibn Rajab, collecting divine-speech narrations. Numbering is canonical. |
| Shah Waliullah's Fortyأربعون شاه ولي الله | 40 | — | Al-Arbaʿūn — forty hadiths selected by Shāh Walīullāh al-Dihlawī (d. 1176/1762). No edition cross-reference ingested; LK corpus numbering is used. |
Al-Nawawī's Forty, the Qudsī Forty, and Shāh Walīullāh's Forty are readable but deliberately excluded from per-scholar grading ingest — matn-based mapping adds no signal for short canonical compilations already universally memorised, so they carry "—" in the gradings column.
Grading scholars (muhaddithūn)
The 9 scholars whose rulings fawazahmed0 ingested. Each ruling is attributed to a specific published work so different editions by the same scholar do not collapse into each other.
Zubair ʿAlī Zaʾī· 1957–2013 CE
زبير علي زئي
18,649 rulings
Primary works: Tahqiqi Sunan an-Nasa'i (Zubair Ali Zai) (5,580); Tahqiqi Sunan Abi Dawud (Zubair Ali Zai) (5,041); Tahqiqi Sunan Ibn Maja (Zubair Ali Zai) (4,275)
Muḥammad Nāṣir al-Dīn al-Albānī· 1914–1999 CE
محمد ناصر الدين الألباني
18,536 rulings
Primary works: Sahih/Daif Sunan an-Nasa'i (Albani) (5,596); Sahih/Daif Sunan Abi Dawud (Albani) (5,063); Sahih/Daif Sunan Ibn Maja (Albani) (4,254)
Shuʿayb al-Arnaʾūṭ· 1928–2016 CE
شعيب الأرناؤوط
6,254 rulings
Primary works: Tahqiq Sunan Ibn Maja (Arnaut) (3,206); Tahqiq Sunan Abi Dawud (Arnaut) (3,048)
ʿAbd al-Fattāḥ Abū Ghuddah· 1917–1997 CE
عبد الفتاح أبو غدة
5,570 rulings
Primary works: Sunan an-Nasa'i (Abu Ghuddah ed.) (5,570)
Muḥammad Muḥyī al-Dīn ʿAbd al-Ḥamīd· 1900–1972 CE
محمد محيي الدين عبد الحميد
4,984 rulings
Primary works: Sunan Abi Dawud (Muhyi al-Din Abdul Hamid ed.) (4,984)
Muḥammad Fuʾād ʿAbd al-Bāqī· 1882–1968 CE
محمد فؤاد عبد الباقي
4,260 rulings
Primary works: Sunan Ibn Maja (Fuad Abd al-Baqi ed.) (4,260)
Aḥmad Muḥammad Shākir· 1892–1958 CE
أحمد محمد شاكر
3,581 rulings
Primary works: Tahqiq Sunan al-Tirmidhi (Ahmad Shakir) (3,581)
Bashshār ʿAwwād Maʿrūf· b. 1940 CE
بشار عواد معروف
2,326 rulings
Primary works: tirmidhi (Bashar Awad Maarouf — unspecified source) (2,326)
Salīm al-Hilālī· b. 1957 CE
سليم الهلالي
1,779 rulings
Primary works: Muwatta Malik (Salim al-Hilali ed.) (1,779)
About the hadith numbering
Two equally first-class numbering systems, switchable from settings.
Every hadith you read here carries two reference numbers, both of which point to the same Arabic matn. Neither is “the” number; they answer different questions, so the reader lets you choose which one rides on top. The print-edition canonical number (ʿAbd al-Bāqī for the Sahihayn, Ibn Mājah and the Muwaṭṭaʾ; Bashshār ʿAwwād for al-Tirmidhī; Muḥyī al-Dīn for Abū Dāwūd; Abū Ghuddah for al-Nasāʾī) is the default primary badge across the site — the same number you will find cited in scholarly works and on sunnah.com. The LK Hadith Corpus number is the alternate view. Use Settings → Numbering in the reader's settings drawer to switch between the two modes.
Print-edition number
The number readers see cited externally
The hadith number from each collection's widely-cited print edition. This is the number you will find quoted in scholarly works, sharḥ texts, classroom handouts, and on sunnah.com (which follows the same print editions and serves the same numbers).
Per-collection editions: Muḥammad Fuʾād ʿAbd al-Bāqī for al-Bukhārī (Dār Ṭūq al-Najāh), Muslim (Dār Iḥyāʾ al-Turāth al-ʿArabī), Ibn Mājah, and the Muwaṭṭaʾ; Bashshār ʿAwwād Maʿrūf (Dār al-Gharb al-Islāmī) for al-Tirmidhī; Muḥammad Muḥyī al-Dīn ʿAbd al-Ḥamīd (Maktabat al-ʿAṣriyya) for Abū Dāwūd; ʿAbd al-Fattāḥ Abū Ghudda for al-Nasāʾī (al-Mujtabā).
LK Hadith Corpus number
The academic citation we ship under
The position of the hadith inside the Leeds + King Saud (LK) academic corpus — the dataset Altammami, Atwell & Alsalka assembled and that we cite as the immediate source of every matn and translation on this site (Altammami et al., IJASAT / IMAN 2019). Stable across editions; useful when you want the same numbering scheme our underlying tooling uses for grading, sharḥ alignment, and search.
The two numbers agree in some collections and diverge in others. They line up on ≈ 68 % of Ṣaḥīḥ al-Bukhārī hadiths, ≈ 44 % of Sunan Ibn Mājah, and as little as 3 % of Ṣaḥīḥ Muslim. When they differ, the reader shows the primary number per the active mode and the other as a muted subtitle so you always have both in view.
Provenance. For Sahih Muslim, the print number is sourced from the OpenITI/0275AH pre-clean branch (a digitisation of ʿAbd al-Bāqī's Dār Iḥyāʾ al-Turāth edition; CC BY-SA), which preserves the print's parenthesised hadith numbering. For the other 8 mapped collections, the print number is derived by pairing each LK matn against fawazahmed0/hadith-api — which mirrors the print editions — using the matn-based Jaccard mapping described in the methodology section. We are migrating each collection in turn to a direct OpenITI re-derivation against its own print edition.
Deep links. /hadith/<slug>/r/<number> resolves the print-edition number (e.g. /hadith/tirmidhi/r/3386). Anchors in shared links stay stable regardless of which numbering mode you are viewing in. For the ~1.1 % of print numbers that don't resolve to a unique LK row, the reader falls back to the print-edition matn itself; those rows carry an attribution badge and don't show grades or sanad breakdowns (those features depend on LK-corpus tooling).
LK numbering is unavailable for seven supplementary compilations (Sunan al-Dārimī, Shamāʾil, Riyāḍ al-Ṣāliḥīn, Bulūgh al-Marām, al-Adab al-Mufrad, Mishkāt, and Shāh Walīullāh's Forty) where no external print-edition mapping has been ingested — those collections show the LK number directly, with no toggle.
How to cite
For classroom handouts, footnotes, and papers. Cite the print edition so a reader with the physical book can follow you; mention TalibNotes only where the secondary aggregation matters.
Short reference — Ṣaḥīḥ Muslim 1 (ʿAbd al-Bāqī ed.). The collection name plus the print-edition number plus a short editor tag is what matches print and what a reader can look up on sunnah.com.
Full citation — Muslim, Ṣaḥīḥ Muslim, no. 1 (Muḥammad Fuʾād ʿAbd al-Bāqī ed., Dār Iḥyāʾ al-Turāth al-ʿArabī, Beirut), via TalibNotes: talibnotes.com/hadith/muslim/1/1.
Academic citation (LK) — when citing the dataset rather than the hadith, use the LK paper: Altammami, Atwell & Alsalka, The Arabic–English Parallel Corpus of Authentic Hadith, IJASAT / IMAN 2019. The LK number is the corpus-internal identifier you will want to reference in that context.
Arabic text note. Matns are minimally vocalized throughout the corpus; full tashkīl is not ingested. Ṣaḥīḥ al-Bukhārī is the one collection whose text was manually annotated by the LK team — the rest are automatically segmented and should be cross-checked against a print copy for anything high-stakes.
Methodology
How our hadiths line up with upstream sources, how we decide when a match is real, and what we do when it isn't.
Matn-based matching. We do not assume fawazahmed0's per-collection hadith number lines up with our own. Numbering conventions drift between editions, and an off-by-one in one chapter can silently misattribute every ruling from that chapter onwards. Instead we compare the normalised Arabic matn of each hadith token-by-token (Jaccard similarity after tashkīl strip, stopword removal, and quote normalisation) and require both a similarity threshold and a gap to the second-best candidate before we accept a pairing.
Independent cross-check. Once the pairing set is built, we re-validate a random slice against the same hadith in OpenITI — a corpus with independent editorial lineage — so a systematic drift in our primary sources would not pass silently.
QA pipeline. Five progressive automated passes over the ambiguous bucket (edit-distance on tokens, stopword-adjusted overlap, normalised SHA comparison, length parity, and a final human-curated allow-list), followed by a manual review of remaining edge cases before any row is ingested. Acceptance gates per collection: unique + ambiguous ≥ 60%, no_match ≤ 40%, ambiguous ≤ 5%. Every collection surfaced on this page cleared all three gates before any row was ingested.
Full write-up and per-collection QA sheets live in the repo under docs/hadith-enrichment/.
What this page doesn't cover (yet)
- Rijāl biographies. Narrator-level lookups are not part of the corpus. No current plans to add them.
- Sharḥ (classical commentary). Two sharḥ rails ship today, both keyed to the ʿAbd al-Bāqī canonical numbering: al-Nawawī's al-Minhāj on Ṣaḥīḥ Muslim and Ibn Ḥajar al-ʿAsqalānī's Fatḥ al-Bārī on Ṣaḥīḥ al-Bukhārī. Other sharḥ source works (ʿUmdat al-Qārī, Tuḥfat al-Aḥwadhī, ʿAwn al-Maʿbūd, the Sindī ḥāshiyas, etc.) are planned as follow-up passes.
- Takhrīj clusters. Cross-references ("also narrated by...") are stashed in the grading metadata but not yet surfaced as a browseable graph. Also planned.
- Per-ruling attributions. Sunnah.com's parentheticals like "(Al-Albānī)" and "(Darussalam)" were dropped by their CSV export. We point readers to the per-scholar chips instead — but we may recover these parentheticals later to tighten the legacy "Sunnah.com" chip's provenance.
Typography
The hadith reader ships two Arabic typefaces, switchable from the page-settings drawer.
- KFGQPC Uthman Taha Naskh — the default. The King Fahd Glorious Qur'ān Printing Complex's Naskh cut. Used across the reader for matn and sharḥ body.
- Kitab — by Khaled Hosny / The Katib Project Authors, derived from SIL Scheherazade. Distributed under the SIL Open Font License. A more classical Naskh tuned for running prose; we ship it from nuqayah/kitab-font under its OFL terms; the licence text lives at
/fonts/hadith/Kitab-OFL.txt.
Corrections
Spot a misattribution, a broken pairing, a typo in a grade label, or a source we should cite differently? Please tell us. Good corrections help everyone who reads this corpus after you.