UAX #41: Common References for UAXs
Technical Reports
Unicode® Standard Annex #41
Common References for Unicode Standard Annexes
Version
Unicode 17.0.0
Editors
Ken Whistler
Date
2025-07-30
This Version
Previous Version
Latest Version
Latest Proposed Update
Revision
36
Summary
This annex presents a common set of references for the Unicode Standard Annexes.
Status
This document has been reviewed by Unicode members and other
interested parties, and has been approved for publication by the Unicode
Consortium. This is a stable document and may be used as reference
material or cited as a normative reference by other specifications.
A Unicode Standard Annex (UAX)
forms an integral part of the
Unicode Standard, but is published online as a separate document. The
Unicode Standard may require conformance to normative content in a Unicode
Standard Annex, if so specified in the Conformance chapter of that version
of the Unicode Standard. The version number of a UAX document corresponds
to the version of the Unicode Standard of which it forms a part.
Please submit corrigenda and other comments with the online reporting
form [
Feedback
].
For the latest version of the Unicode Standard, see [
Unicode
].
For a list of current Unicode Technical Reports, see [
Reports
].
For more information about versions of the Unicode Standard, see [
Versions
].
For any errata which may apply to this annex, see [
Errata
].
Contents
References to Publications by the Unicode Consortium
References to Other Standards
Other References
References to Publications by the Unicode Consortium
Publications may be listed more than once under different headings.
Blocks
Character Block Property Data File
Latest version:
Version 17.0.0:
Charts
Character Code Charts
Latest version:
Index of character names with links to the corresponding code charts:
Charts14
Charts for the Unicode Line Breaking Algorithm Test Files
Latest version:
Version 17.0.0:
Charts15
Normalization Charts
Latest version:
Charts29
Charts for the Unicode Text Segmentation Test Files
Latest version:
Version 17.0.0:
CJKRadicals
CJK Radicals Data File
Latest version:
Version 17.0.0:
CLDR
Unicode Common Locale Data Repository
Code9
Reference Implementations of the Unicode Bidirectional Algorithm
C reference code:
Java reference code:
Code14
Sample Implementation of the Unicode Line Breaking Algorithm
Corrections
Normalization Corrections Data File
Latest version:
Version 17.0.0:
Corrigendum1
Corrigendum #1: UTF-8 Shortest Form
Corrigendum2
Corrigendum #2: Yod with Hiriq Normalization
Corrigendum3
Corrigendum #3: U+F951 Normalization
Corrigendum4
Corrigendum #4: Five CJK Canonical Mapping Errors
Corrigendum5
Corrigendum #5: Normalization Idempotency
Corrigendum6
Corrigendum #6: Bidi Mirroring
Corrigendum7
Corrigendum #7: UAX #14, Unicode Line Breaking Algorithm, rule LB8
Corrigendum8
Corrigendum #8: Bidi_Class Fix for U+070F Syriac Abbreviation Mark
Corrigendum9
Corrigendum #9: Clarification About Noncharacters
Data9
Unicode Bidirectional Algorithm Property Data Files
Latest version:
Version 17.0.0:
Data11
East Asian Width Property Data File
Latest version:
Version 17.0.0:
Data14
Unicode Line Breaking Algorithm Property Data File
Latest version:
Version 17.0.0:
Data14Derived
Unicode Line Breaking Algorithm Derived Property Data File
Latest version:
Version 17.0.0:
Data24
Unicode Script Property Data File
Latest version:
Version 17.0.0:
Data34
Unicode Named Character Sequences Data File
Latest version:
Version 17.0.0:
Data45
U-Source Ideographs Data File
Latest version:
Version 17.0.0:
Data50
Unicode Vertical Text Layout Property Data File
Latest version:
Version 17.0.0:
Data51
Unicode Emoji Data Files
Latest Version:
Version 17.0.0:
DataProv
Provisional Named Sequences Data File
Latest version:
Version 17.0.0:
Demo9
Online Demo of the Unicode Bidirectional Algorithm
DerivedBIDI
Derived Bidirectional Type Property Data File
Latest version:
Version 17.0.0:
EquivalentUnifiedIdeograph
List of Unified Ideographs Equivalent to CJK Radicals or CJK Strokes
Latest version:
Version 17.0.0:
Errata
Updates and Errata
Exclusions
Composition Exclusion Table
Latest version:
Version 17.0.0:
FAQ
Frequently Asked Questions
Answers to common questions on technical issues:
Feedback
Contact Form
Error reporting and information requests:
Glossary
Glossary of Unicode Terms
Glyphs45
U-Source Ideographs Glyph Table
Latest version:
Version 17.0.0:
HangulST
Hangul Syllable Type Property Data File
Latest version:
Version 17.0.0:
NormProps
Derived Normalization Properties Data File
Latest version:
Version 17.0.0:
Policies
Unicode Consortium Policies
Props
Unicode Text Segmentation Property Data Files
Latest version:
Version 17.0.0:
PropValue
Property Value Aliases Data File
Latest version:
Version 17.0.0:
Reports
Unicode Technical Reports
List of Unicode Standard Annexes, Technical Standards, and Technical Reports:
RSChart45
U-Source Radical-Stroke Index
Latest version:
Version 17.0.0:
Stability
Unicode Character Encoding Stability Policy
StandardizedVariants
Standardized Variation Sequences Data File
Latest version:
Version 17.0.0:
Tests9
Unicode Bidirectional Algorithm Test Data File
Latest version:
Version 17.0.0:
Tests14
Unicode Line Breaking Algorithm Test Data File
Latest version:
Version 17.0.0:
Tests15
Unicode Normalization Forms Test Data File
Latest version:
Version 17.0.0:
Tests29
Unicode Text Segmentation Test Data Files
Latest version:
Version 17.0.0:
UAX9
Unicode Standard Annex #9:
Unicode Bidirectional Algorithm
Latest version:
Version 17.0.0:
UAX11
Unicode Standard Annex #11:
East Asian Width
Latest version:
Version 17.0.0:
UAX14
Unicode Standard Annex #14:
Unicode Line Breaking Algorithm
Latest version:
Version 17.0.0:
UAX15
Unicode Standard Annex #15:
Unicode Normalization Forms
Latest version:
Version 17.0.0:
UAX24
Unicode Standard Annex #24:
Unicode Script Property
Latest version:
Version 17.0.0:
UAX29
Unicode Standard Annex #29:
Unicode Text Segmentation
Latest version:
Version 17.0.0:
UAX31
Unicode Standard Annex #31:
Unicode Identifiers and Syntax
Latest version:
Version 17.0.0:
UAX34
Unicode Standard Annex #34:
Unicode Named Character Sequences
Latest version:
Version 17.0.0:
UAX38
Unicode Standard Annex #38:
Unicode Han Database (Unihan)
Latest version:
Version 17.0.0:
UAX41
Unicode Standard Annex #41:
Common References for Unicode Standard Annexes
Latest version:
Version 17.0.0:
UAX42
Unicode Standard Annex #42:
Unicode Character Database in XML
Latest version:
Version 17.0.0:
UAX44
Unicode Standard Annex #44:
Unicode Character Database
Latest version:
Version 17.0.0:
UAX45
Unicode Standard Annex #45:
U-Source Ideographs
Latest version:
Version 17.0.0:
UAX50
Unicode Standard Annex #50: Unicode Vertical Text Layout
Latest version:
Version 17.0.0:
UAX53
Unicode Standard Annex #53: Unicode Arabic Mark Rendering
Latest version:
Version 17.0.0:
UAX57
Unicode Standard Annex #57: Unicode Egyptian Hieroglyph Database (Unikemet)
Latest version:
Version 17.0.0:
UCD
About the Unicode Character Database
For detailed documentation, see [
UAX44
].
Unicode
The Unicode Standard
Latest version:
Version 17.0.0:
Unicode3.0
The Unicode Consortium,
The Unicode Standard, Version 3.0.0
defined by:
The Unicode Standard, Version 3.0
(Reading, MA: Addison-Wesley, 2000. ISBN 0-201-61633-5),
Unicode3.1
The Unicode Consortium,
The Unicode Standard, Version 3.1.0
defined by:
The Unicode Standard, Version 3.0
(Reading, MA: Addison-Wesley, 2000. ISBN 0-201-61633-5),
as amended by the
Unicode Standard Annex #27: Unicode 3.1
Unicode3.2
The Unicode Consortium,
The Unicode Standard, Version 3.2.0
defined by:
The Unicode Standard, Version 3.0
(Reading, MA: Addison-Wesley, 2000. ISBN 0-201-61633-5),
as amended by the
Unicode Standard Annex #27: Unicode 3.1
and the
Unicode Standard Annex #28: Unicode 3.2
Unicode4.0
The Unicode Consortium,
The Unicode Standard, Version 4.0.0
defined by:
The Unicode Standard, Version 4.0
(Boston, MA: Addison-Wesley, 2003. ISBN 0-321-18578-1),
Unicode4.0.1
The Unicode Consortium,
The Unicode Standard, Version 4.0.1
defined by:
The Unicode Standard, Version 4.0
(Boston, MA: Addison-Wesley, 2003. ISBN 0-321-18578-1),
as amended by
Unicode 4.0.1
Unicode4.1
The Unicode Consortium,
The Unicode Standard, Version 4.1.0
defined by:
The Unicode Standard, Version 4.0
(Boston, MA: Addison-Wesley, 2003. ISBN 0-321-18578-1),
as amended by
Unicode 4.0.1
and
Unicode 4.1.0
Unicode5.0
The Unicode Consortium,
The Unicode Standard, Version 5.0.0
defined by:
The Unicode Standard, Version 5.0
(Boston, MA: Addison-Wesley, 2007. ISBN 0-321-48091-0),
Unicode5.1
The Unicode Consortium,
The Unicode Standard, Version 5.1.0
defined by:
The Unicode Standard, Version 5.0
(Boston, MA: Addison-Wesley, 2007. ISBN 0-321-48091-0),
as amended by
Unicode 5.1.0
Unicode5.2
The Unicode Consortium,
The Unicode Standard, Version 5.2.0
defined by:
The Unicode Standard, Version 5.2
(Mountain View, CA: The Unicode Consortium, 2009. ISBN 978-1-936213-00-9),
Unicode6.0
The Unicode Consortium,
The Unicode Standard, Version 6.0.0
(Mountain View, CA: The Unicode Consortium, 2011. ISBN 978-1-936213-01-6)
Unicode6.1
The Unicode Consortium,
The Unicode Standard, Version 6.1.0
(Mountain View, CA: The Unicode Consortium, 2012. ISBN 978-1-936213-02-3)
Unicode6.2
The Unicode Consortium,
The Unicode Standard, Version 6.2.0
(Mountain View, CA: The Unicode Consortium, 2012. ISBN 978-1-936213-07-8)
Unicode6.3
The Unicode Consortium,
The Unicode Standard, Version 6.3.0
(Mountain View, CA: The Unicode Consortium, 2013. ISBN 978-1-936213-08-5)
Unicode7.0
The Unicode Consortium,
The Unicode Standard, Version 7.0.0
(Mountain View, CA: The Unicode Consortium, 2014. ISBN 978-1-936213-09-2)
Unicode8.0
The Unicode Consortium,
The Unicode Standard, Version 8.0.0
(Mountain View, CA: The Unicode Consortium, 2015. ISBN 978-1-936213-10-8)
Unicode9.0
The Unicode Consortium,
The Unicode Standard, Version 9.0.0
(Mountain View, CA: The Unicode Consortium, 2016. ISBN 978-1-936213-13-9)
Unicode10.0
The Unicode Consortium,
The Unicode Standard, Version 10.0.0
(Mountain View, CA: The Unicode Consortium, 2017. ISBN 978-1-936213-16-0)
Unicode11.0
The Unicode Consortium,
The Unicode Standard, Version 11.0.0
(Mountain View, CA: The Unicode Consortium, 2018. ISBN 978-1-936213-19-1)
Unicode12.0
The Unicode Consortium,
The Unicode Standard, Version 12.0.0
(Mountain View, CA: The Unicode Consortium, 2019. ISBN 978-1-936213-22-1)
Unicode12.1
The Unicode Consortium,
The Unicode Standard, Version 12.1.0
(Mountain View, CA: The Unicode Consortium, 2019. ISBN 978-1-936213-25-2)
Unicode13.0
The Unicode Consortium,
The Unicode Standard, Version 13.0.0
(Mountain View, CA: The Unicode Consortium, 2020. ISBN 978-1-936213-26-9)
Unicode14.0
The Unicode Consortium,
The Unicode Standard, Version 14.0.0
(Mountain View, CA: The Unicode Consortium, 2021. ISBN 978-1-936213-29-0)
Unicode15.0
The Unicode Consortium,
The Unicode Standard, Version 15.0.0
(Mountain View, CA: The Unicode Consortium, 2022. ISBN 978-1-936213-32-0)
Unicode15.1
The Unicode Consortium,
The Unicode Standard, Version 15.1.0
(South San Francisco, CA: The Unicode Consortium, 2023. ISBN 978-1-936213-33-7)
Unicode16.0
The Unicode Consortium,
The Unicode Standard, Version 16.0.0
(South San Francisco: The Unicode Consortium, 2024. ISBN 978-1-936213-34-4)
Unicode17.0
The Unicode Consortium,
The Unicode Standard, Version 17.0.0
(South San Francisco: The Unicode Consortium, 2025. ISBN 978-1-936213-35-1)
Unihan
Unihan Database
Latest version:
Version 17.0.0:
UTC
Unicode Technical Committee
UTN5
Unicode Technical Note #5:
Canonical Equivalence in Applications
UTN43
Unicode Technical Note #43:
Unihan Database Property "kStrange"
UTN45
Unicode Technical Note #45:
Unihan Property History
UTN50
Unicode Technical Note #50:
KP-Source Property Value History
UTN54
Unicode Technical Note #54:
Annotated Line Breaking Algorithm
UTR17
Unicode Technical Report #17: Unicode Character Encoding Model
Latest version:
UTR23
Unicode Technical Report #23: The Unicode Character Property Model
Latest version:
UTR25
Unicode Technical Report #25: Unicode Support for Mathematics
Latest version:
UTR33
Unicode Technical Report #33: Unicode Conformance Model
Latest version:
UTR36
Unicode Technical Report #36: Unicode Security Considerations
Latest version:
UTR53
Unicode Technical Report #53: Unicode Arabic Mark Rendering
Latest version:
UTR54
Unicode Technical Report #54: Unicode Mongolian 12.1 Snapshot
Latest version:
UTS6
Unicode Technical Standard #6: A Standard Compression Scheme for Unicode
Latest version:
UTS10
Unicode Technical Standard #10: Unicode Collation Algorithm
Latest version:
Version 17.0.0:
UTS18
Unicode Technical Standard #18: Unicode Regular Expressions
Latest version:
UTS22
Unicode Technical Standard #22: Unicode Character Mapping Markup Language (CharMapML)
Latest version:
UTS35
Unicode Technical Standard #35: Unicode Locale Data Markup Language (LDML)
Latest version:
UTS37
Unicode Technical Standard #37: Unicode Ideographic Variation Database
Latest version:
UTS39
Unicode Technical Standard #39: Unicode Security Mechanisms
Latest version:
Version 17.0.0:
UTS46
Unicode Technical Standard #46: Unicode IDNA Compatibility Processing
Latest version:
Version 17.0.0:
UTS51
Unicode Technical Standard #51: Unicode Emoji
Latest version:
Version 17.0.0:
UTS55
Unicode Technical Standard #55: Unicode Source Code Handling
Latest version:
Versions
About Versions of the Unicode Standard
Information on version numbering, and citing and referencing the Unicode Standard, the Unicode Character Database, and Unicode Technical Reports:
References to Other Standards
10646
International Organization for Standardization,
ISO/IEC 10646:2020: Information Technology – Universal Coded Character Set (UCS), Sixth Edition
Available from the ISO/IEC ITTF website:
CharMod
Martin J. Dürst, et al.,
Character Model for the World Wide Web 1.0: Fundamentals
, W3C Recommendation
CSS3Writing
Elika J. Etemad / fantasai, Koji Ishii,
CSS Writing Modes Level 3
, W3C Candidate Recommendation
HTML5
Steve Faulkner, et al.,
HTML 5.2
, W3C Recommendation, 14 December 2017
IDNA2008
The IDNA2008 specification is defined by a
cluster of IETF RFCs:
Internationalized Domain Names for Applications (IDNA): Definitions and Document Framework
Internationalized Domain Names in Applications (IDNA) Protocol
The Unicode Code Points and Internationalized Domain Names for Applications (IDNA)
Right-to-Left Scripts for Internationalized Domain Names for Applications (IDNA)
There is also an informative document:
Internationalized Domain Names for Applications (IDNA): Background, Explanation, and Rationale
ISO15924
International Organization for Standardization,
ISO 15924:2004: Information and Documentation – Codes for the Representation of Names of Scripts
ISO19757
International Organization for Standardization,
ISO/IEC 19757-2:2008: Information Technology – Document Schema Definition Language (DSDL) –
Part 2: Regular-Grammar-Based Validation – RELAX NG, Second Edition
Available from the ISO/IEC ITTF website:
JIS
Japanese Standards Association,
JIS X 4051:2004: Formatting Rules for Japanese Documents
『日本語文書の組版方法』
KSX1026
Korean Agency for Technology and Standards,
KS X 1026-1:2007: Information Technology – Universal Multiple Octet Coded Character Set – Hangul –
Part 1: Hangul Processing Guide for Information Interchange
XML
Tim Bray, et al.,
Extensible Markup Language (XML) 1.0, Fifth Edition
, W3C Recommendation
Other References
CharLint
Martin J. Dürst,
Charlint – A Character Normalization Tool
CharMatch
Addison Phillips,
Character Model for the World Wide Web: String Matching and Searching
, W3C Working Draft
CharNorm
François Yergeau, et al.,
Character Model for the World Wide Web 1.0: Normalization
, W3C Working Draft
JLREQ
Hiroyuki Chiba, et al.,
Requirements for Japanese Text Layout
, W3C Working Group Note
Knuth78
Donald E. Knuth, et al.,
Breaking Lines into Paragraphs
, republished in
Digital Typography
, CSLI 78
(Stanford, CA: CLSI Publications, 1997)
Suign98
Michel Suignard,
Worldwide Typography and How to Apply JIS X 4051-1995 to Unicode
Proceedings of the Twelfth International Unicode / ISO 10646 Conference (Tokyo, Japan: 1998)
TEX
Donald E. Knuth,
X, the Program
, Volume B of
Computers & Typesetting
(Reading, MA: Addison-Wesley, 1986)
UnicodeXML
Martin Dürst, Asmus Freytag,
Unicode in XML and other Markup Languages
, W3C Working Group Note
© 2006–2025 Unicode, Inc. This publication is protected by copyright, and permission must be obtained from Unicode, Inc. prior to any reproduction, modification, or other use not permitted by the
. Specifically, you may make copies of this publication and may annotate and translate it solely for personal or internal business purposes and not for public distribution, provided that any such permitted copies and modifications fully reproduce all copyright and other legal notices contained in the original. You may not make copies of or modifications to this publication for public distribution, or incorporate it in whole or in part into any product or publication without the express written permission of Unicode.
Use of all Unicode Products, including this publication, is governed by the Unicode
. The authors, contributors, and publishers have taken care in the preparation of this publication, but make no express or implied representation or warranty of any kind and assume no responsibility or liability for errors or omissions or for consequential or incidental damages that may arise therefrom. This publication is provided “AS-IS” without charge as a convenience to users.
Unicode and the Unicode Logo are registered trademarks of Unicode, Inc., in the United States and other countries.
US