<<

I N T E R N E T C O R P O R AT I O N F O R A SS I G N E D N A M E S A N D N U M B E R S

IDNs: Internationalized Domain Names Did you know?

· Internationalized Domain Names (IDNs) are domain names with characters other than a, b…, z; 0, 1,…, 9; and “-“. · IDNs on the second and third levels exist in some generic top-level domains (gTLDs) and in some country code top-level domains (ccTLDs). TLD

registries determine the choice of characters available under these TLDs. · Some languages, like Arabic, are written from right to left. IDNs at the second level are not very useful for the communities that use these languages because users would need to change both the type direction and script in the middle of entering a . · IDN TLDs will be made available through two separate processes, initially through the New gTLD Program and the IDN ccTLD Fast Track Process. · 60 percent of users are non-English speakers, while the dominant language used on the Internet is English (see Global Reach at www.glreach.com).

One of the most significant innovations in the Internet since its incep- tion is the introduction of Internationalized Domain Names (IDNs) in the Internet’s address space. You may be familiar with some of the larger top- level domains: dot-com, dot-cn, and dot-org are just three examples. IDN top-level domain names will offer many new opportunities and benefits for Internet users around the world by allowing them to establish and use domains in their native languages and scripts.

ICANN, the Internet Corporation for Assigned Names and Numbers, is responsible for managing and coordinating the

(DNS) to ensure that every address is unique and that all of us who use the Internet can find all valid Internet addresses. It does this by overseeing the Although many websites contain multilingual content, if you look distribution of unique IP (Internet Protocol) addresses and domain names. closely at the site’s web address on your browser, you will find IDNs It also ensures that each domain name maps to the correct IP address. represented only on the second level or lower levels. This representation happens in some gTLDs and some ccTLDs, and each registry decides One of ICANN’s top priorities is its IDN Program. This program is made up which characters are available for registration in domain names. of several initiatives that involve a significant cross-section of the global A map showing the IDN implementation across TLDs is available at Internet community and are aimed at providing a secure method for im- http://www.icann.org/en/maps/idntld.htm. plementing internationalized top level domains in the root zone. Through this program, users will be able to register and use domain names based Soon users around the globe will have access to IDN TLDs made up of on their local language scripts. This includes users of languages based on non-ASCII characters to the right of the dot. ICANN is carrying out three right-to-left scripts such as Arabic and users of languages based on non- initiatives to bring this about. Two are being implemented and one is at alphabetic scripts such as Mandarin Chinese. End users will be able to the policy development stage. These three processes are described briefly more easily identify localized online content by using fully localized identi- in the graphic on page 2. fiers. One example is http:// (“example.test” in Korean Hangul 실례.테스트 IDN usage in —or, more specifically, in the domain name part of the script), which is used for testing and illustration purposes only. address—is not yet fully available. The technical standard that will Most of the program’s objectives are related to: make this possible is being developed by the Internet Engineering Task Force, or IETF. Some implementations are now available for users to try out. · The secure and stable technical and operational deployment of IDNs in the Domain Name System (DNS) at the root level. More information and test environments can be found at: · Engaging the global Internet community in the policy processes that · http://cslab.kr/eai-test/index.php/Main-Page will guide the deployment, allocation and secure management of IDNs. · http://global-email.info/intro.html · http://www.ietf.org/html,charters/eai-charter.html What are IDNs? IDN is the short name for an Internationalized Domain Name. These do- main names are represented by local language characters. Such domain names could contain characters with diacritical marks as required by many European languages, or characters from non-Latin scripts (for example, Arabic or Chinese). Why are IDNs important? In Implementation: IDN ccTLD Fast Track Process IDNs are important for many reasons. The most important is the • To introduce a limited number of IDN TLDs in non-Latin scripts growing number of Internet users around the world for whom it is • Requires strings to be meaningful representations of country or difficult to use ASCII characters. The fact is, the Internet is accessed by territory names • Only for countries or territories represented in the ISO 3166-1 list more people who do not use Latin languages and scripts than those who do. This means that it is difficult for them to recognize ASCII In Policy Development: IDN ccTLDs – Long Term characters and reproduce them on keyboards or use software to enter • Full policy that caters for all website addresses in browsers. • Follows the ccNSO policy development process • Was initiated formally on 13 April 2009 by the ccNSO Council For example: http://www.icann.org/en/announcements/announcement-13apr09- en.htm • IDNs in your native language make navigating the Internet much easier. If you read an online newspaper in Russian and there is a link In Implementation: New Generic TLD Program to more information, then it can be a challenge to reproduce an ASCII • New process for applying for new gTLDs web address. It also makes more sense to have a Cyrillic-based ad- • Includes IDN TLDs dress for a site with Russian content. • Technical string requirements are the same for all IDN TLDs • If you see a billboard advertisement that contains a link or email ad- • Includes a mechanism for avoiding confusingly similar strings

dress, it is more useful if the website or email address is based on char- acters that you can reproduce. In other words, in many countries and territories this link must be in scripts other than the basic Latin script. What is the IDN ccTLD Fast Track Process? • Suppose your child comes home from school with an assignment that The IDN ccTLD Fast Track Process focuses on meeting a near-term demand requires obtaining information online on a specified website. What if by allocating a limited number of IDN ccTLDs based on non-Latin scripts the website address is in a different script than your native language? while a full IDN ccTLD policy is being developed. The process is further It may be hard to recognize the characters and reproduce the address limited to countries and territories represented in the ISO 3166-1 list. The in a browser. Fast Track Process is expected to be launched in Q4 2009, and a separate • Sometimes a website address is not a name that makes sense or has factsheet for this process is available at www.icann.org/topics/idn. any meaning attached. In such cases it is even more important to use the script or alphabet that intended users can recognize and be able What is the IDN ccTLD Policy Development? to reproduce on a computer keyboard. The IDN ccTLD policy development process is a long-term process for Certainly, IDNs have clear advantages in targeting local markets and non- countries and territories that do not qualify for the Fast Track Process. English speaking Internet users, and for local promotions and advertise- The first step in developing a long-term process for introducing IDN ments. So if you are a user not familiar with the basic Latin script, or if you ccTLDs is to develop an IDN ccTLD policy. ICANN’s Country Code Names wish to reach such users (with information, products or services), then Supporting Organization (ccNSO) is doing this. As soon as the process IDNs will be very important for your ability to reach these markets. is completed, implementation efforts will begin for all IDN ccTLDs. The

long-term process will incorporate the experience gained in the Fast How can I register an IDN? Track Process. If you want an IDN at the second level, or lower levels, you should select a TLD that meets your purpose. Check with you registrar and registry What is the New gTLD Applicant Guidebook? whether IDNs and the characters you wish to use are available. For a good The New gTLD Program focuses on expanding choices for generic top- overview, go to http://www.icann.org/en/maps/idntld.htm. This map shows level domains, or gTLDs. This expansion includes IDNs at the top level in where IDNs are available. Currently, more than 47 TLD registries and order to meet growing diversity and encourage competition for more registry managers support IDNs. innovation, choice and change to the Internet’s addressing system. If you want to register a domain name with the entire string in local char- acters, or if you want to apply to become an IDN TLD operator, you can ICANN is developing both a process for registries to apply for new gTLDs choose from three possible programs. Each program has its own specific and an Applicant Guidebook that takes applicants through the process qualification requirements, rules and launch time. and explains the implications of the many complicated issues surround- ing these new top-level domains. The Internet community is currently The diagram here gives an overview of the three programs available. As working on resolving string contention, protecting intellectual property these initiatives continuously progress, please view the latest status at: rights, handling internationally recognized issues of morality and public • IDN ccTLD Fast Track Process: http://www.icann.org/en/topics/idn/fast- order, and the geographical names process, among other issues. After track/ The shorthand term IDN ccTLDs refers to new top-level domains going through several rounds of public comment, the process and the associated with entries in the ISO 3166-1 list. guidebook are expected to be fully implemented in 2010. • New gTLD Program (including IDN gTLDs): http://www.icann.org/en/ topics/new-gtld-program.htm For more information about the Applicant Guidebook and the New gTLD Program, go to http://icann.org/en/topics/new-gtld-program.htm. • IDN ccNSO PDP: http://ccnso.icann.org/

2 Where can I try IDNs? What are the IDN Guidelines? ICANN created a dedicated IDNwiki facility to introduce users to the The guidelines list general standards for IDN registration in addition to testing of IDN top-level domains. The test is based on eleven IDN TLDs those in the IDNA standard. Registries must follow the guidelines’ policies representing “example.test” in non-Latin based scripts. The table below and practices for IDN TLDs to minimize the risk of cybersquatting and con- shows the languages and scripts initially selected for the IDNwiki. You can sumer confusion, and to ensure that the interests of local languages and enter the Wiki by typing in any of the illustrated domains, or by going to character sets are respected. Registries that deploy IDNs will be required http://idn.icann.org. to follow these guidelines.

Public participation in the evaluation of these domains is an important In addition all new TLD managers, either through the IDN ccTLD Fast Track Pro- aspect of the project. To participate, access one or more of the addresses cess or the New gTLD Program, will be required to meet all existing or future in the table and report your experience on the wiki discussion pages. versions of the IDN Guidelines. The latest version of the guidelines is available at http://www.icann.org/en/topics/idn/implementation-guidelines.htm. More languages and scripts are being added as requests come in from the various language communities. The IDN Guidelines will be revised right after the IDNA protocol revision is completed to ensure they are consistent.

Brief history of internationalization and the

Internet’s Domain Name System

Internationalization measures, including IDNs, might be the largest change in Internet operation since the TCP/IP (Transmission Control

Protocol/Internet Protocol) was introduced. Furthermore, IDN deployment might be more significant and complex than the original DNS introduc-

tion. To fully understand the difficulties of internationalization of the domain name space, it is important and useful to give a brief history of

IDN technology development.

The idea of internationalizing domain names goes back to the devel- What is the IDNA protocol revision? opmental stages of the Internet in the 1970s. Discussions centered The IETF is creating the standards for using non-ASCII characters in the on the usability of languages and scripts. However, the technology Domain Name System. In 2003, an international IDN working group devel- deployed in the Domain Name System today, which allows practically oped the IDNA standards, which since have been used by TLD registries any character in the registration of a domain name, was not developed and application developers. Since then, the IETF IDN working group has at that time. Therefore, the characters available for registering domain been evaluating the experience gained in implementing the IDNA names was limited to ASCII characters: the letters a-z, the digits 0-9, and protocol. They have identified several key areas of work now under way. the hyphen “-“. The core components still being revised include: In 2003, technical bodies raised the topic again, and the Internet Engi- • Eliminating version dependencies, thereby permitting more neering Task Force released the standards (RFCs 3490, 3491, and 3492), characters to be used automatically in IDNs now and in the future. also often referred to as IDNA, or IDNs in Application. These standards • Clear defining valid IDN labels. provide technical guidelines for successful deployment of IDNs; that is, a • Fixing the complications and errors in the existing protocol for many mechanism to handle non-ASCII characters in domain names in a stan- right-to-left domains. These domains will become valid when the dard fashion. It works by converting names with non-ASCII characters to revised protocol is implemented. ASCII labels that the Domain Name System will understand. The issues with the current IDN model that led to the revision work are discussed in RFC4690. The Internet drafts with proposed revisions are Later, ICANN and several top-level domain registries released version available at http://icann.org/topics/idn/rfc.htm. 1.0 of the IDN Guidelines. These guidelines were created for generic TLD (gTLD) registries to follow when implementing IDNs at the second level These proposed revisions are currently going through Last Call in the only. The current guidelines, version 2.1, is available at http://icann.org/ IETF, after which they will become final standards and can be used by topics/idn/implementation-guidelines.htm. TLD registries and application developers that want to be IDN capable. ICANN is expecting the standards to be completed in time for the launch Work is under way to revise the IDNA protocol, based on experience since of IDN TLDs. Although additional time will be needed to implement the the 2003 introduction. Additional work is under way by an IDN TLD registry standards in applications, ICANN will follow the technical standards in working group to amend the guidelines further to ensure that the guide- the IDN TLD string requirements right away. line directions will be used deeper into the DNS hierarchy, particularly as a set of principles for implementing internationalized top-level labels.

For a more detailed history of technical development see http://www.isoc. org/pubpolpillar/docs/i18n-dns-chronology.pdf.

3 Intrnet Corporation for Assigned Names and Numbers (September 2009)

. . “

-

-

-

is

- of

a

- a

xn IDN on to has sole more

en

8649 2714 7913

with

that

s

7848 and

this ASCII

in Inter local

( role

p1b

be ]

823 429 The The http:// -- confusion application and

ICANN doesn’t 234 8236 doesn’t was

way

local an 2 2 the

at registrants xn For

it

al provides provides 310 202 will

understand

impact because languages

forward

web address

address s

is:

i collaborative

+1 +1 +32 +61 a more more

avoid

of to include Without

s name.

F F F F

each other. handling type and that

ICANN ICANN makers, IDNs as to name found

-

participate That looks .

of corporation parties.

to

for

critical that

competition keeping all

be to

name standard. (DNS)

coordination

[

is each

world. variety

level spam

n to http://www.unicode.org

domain important example need the the need

policy

of

its

have pints scheme can

a

expand ICANN how

domain

the

find Internet. benefit an

capable we

visit

-

wide of number.

stop to

identifiers. System domai you to a solutions

“example.test”) “example.test”)

interested interested

a

organization founded to founded organization Unicode

code that registry

promotes e

characters

9358 7900 2407

of approach version version only 7870

or have experts,

It th

form form

across www.icann.org

through the

dedicated global Program Program is f the public

the Internet.

823

429

resolve for

encoding

up

o

Name 234 8236 -

cannot

across across global Internet, of

where

unique ASCII conventions. 2 2 at

n

to

continue continue But 310 202

It

does

Internet IDN

the one

name DNS

hold of

stakeholders. stakeholders. visit: version version

it nonprofit

+1 +32 +61 +1 use profit

world a

of

- single T T T T

a

ersio - linguistic able the the

punycode decoded

know

v forums for all for forums The The bottom

is Domain tables have for

identifiers e the requires requires

cultural -

be other

on

reserved character Tables script, the

The intention is intention The

and

this

used

more information on information more

please

Internet’s Internet. od

Internet.

to

interoperable.

c system, not open

sequence and promote over

These These a see

For ny

each the the the often . evolution

unique is DNS and computer more information, please information, more all

the pu

person

is names. network

[email protected].

wouldn’t and

to

characters.

IDNs. for

It

on computers

Unicode on

e

is users

example, example,

technical

and the

never never of participative, participative, order for the for order Consortium

th naming

and --11b5bs3a9aj6g.

the

we so facilitate

Devanagari

commonly

your

r

of from these

The The

in

local For a o stable This prefix end for are digitalized. are

to f in will a fully internationalized secure secure identified.

1998.

information

ICANN USA USA Australia Belgium access

extend

policy is is contact x

another

i

number into f open, in

content into this

e and r

unique Unicode users scripts. manage

p with

Hindi,

more secure, Internet’s expansion

e

registration reach

achieve be

h

6ci4b4b3a.xn the characters. characters. containing containing purpose and coded and Punycode characters character unique and membership Unicode Hence develop, develop, in T The stable activities, ICANN’s icann.org/topics/idn/ continuing continuing about information Additional About To For languages, characters, and characters, languages, To developers, developers, involvement address to coordinates coordination formed participants net develops control deal the the

-

-

r

o

20006

90292 labels only

for

terms for

easier s is Code script).

Such

only

2000 used DC CA

“LDH and

-Latin as

level

. that TLD

can be characters. I code

here

name NSW

Brussels

speech characters

non

converts names to are provided are term related

all such all

applications Hangul -

Internet

understand.

by marks ASCI only

1040 representation - in Standard -

alternative the o

207.151.159.3, Engineering were allowedin for Sydney Los Angeles, B Washington,

local

IDN t from from

characters). the

IDNA “

these - or languages domain glossary n

name) “ of characters used

ASCII domain name

numerical

domain

ASCII and

Computers Computers represented by local by represented Glossary

convert

tio

for Greek). with

a to before internationaliza before The The

using

typing are --“, to -ASCII

in are

el “ diacritical accurately

Internet Application) protocol Application)

r ASCII

possible

numerical

text.

of

various

(American n

hyphen it being

i in 5 that domain

330

characters

(non

glossary the characters. names

alphabets the Originally, Originally, IDNs d the

with

common Bt.

makes the with (“example.test”

Name the

is

a

of by fact

Floor

]

(the Suite hyphen languages be availablewill languages

use

ASCII

characters is name communications around interna around communications

instead

ASCII makes

5th

Explanations -

Languages DNS Names and online understand. the

in

work write the Way, for level remains in remains level

code subset.

Name)

So domain hen

9,

to Schuman, characters. to NW,

– subset to non ASCII ASCII that letters W 테스트

characters The

-

0 Hunter Street . . Chinese) world’s

)

characters IDNA isIDNA not able top ” and

] (“example.test” ] that of A this RFC3490

use the DNS can can DNS the use space. space. of domain

48

Code ASCII reference for internationalized top internationalized for reference Point

or

-9 “@ Street widely

in to

non

2, I Domain Admiralty with

r 0 Domain refers refers used the corresponding

dynamic

실례

an [ o

Domain digits address.

simpler that

Rond Alphabets

Hence

subset z, name

are entire entire string so short |

IP 1875 Level 6 4676 System)

contain contain this “a the devices

with

used

labels.

language

refers refers s

a reference for for reference while digits Hyphen) more

a

names topic defined

this a the used the in used the

are

Standard h

-z, Interchange)

a other to could to ASCII using

example:

Name names.

arcane Scripts

strings this level, level, παράδειγμα.test names, names, suc

domain

Scripts familiar

www.internic.net.

[

Digit, Digit, letters

usually r

| by numbers,

e

http://www.ietf.org and For applications into

(

t http://www.icann.org/en/topics/idn/idn-glossary.htm

the broader,

Usually

domain Usually

the

c

protocol the letters

DC

a a type

names make at names.

name

of systems.

a

r

acronyms (for example, Arabic Arabic example, (for

that done referring referring second (Internationalized (Internationalized (American (American

domain is

on to points” restricted

(Domain

(Letter,

Force

only (Internationalized (Internationalized

SLDs TLDs can

is cha

example:

Information

handle a the

Angeles f

characters. labels by allowing a allowing by IDN DNS tion For ASCII domain instead o were were at IDN code contains contains computers understand or languages, European many by required scripts LDH you Task domain writing this for IDNA IDNA to local by represented communities. Languages IDN labels labels domain ASCII Not all characters use in domain domain in use characters

based below tionalizing available

Various Internationalized Sydney Los Washington, Brussels