Comprehensive history of grandfathered (disallowed) unicode character general top-level (gTLD) domain names.

Introduction

Unicode domain names, also known as Internationalized Domain Names (IDNs), allow the use of non-ASCII characters, such as emojis and symbols from various scripts, in internet domain names. Unlike traditional domain names restricted to 37 ASCII characters (English letters, digits 0-9, and the hyphen), unicode domain names expand possibilities by incorporating characters from diverse unicode character sets, like currencies (€), mathematical operators (÷), and emojis (😉).

To function within the internet’s infrastructure, these special characters are encoded into an ASCII-compatible format called punycode, prefixed with "xn--". For example, "music♫.com" becomes "xn--music-ht1c.com" in punycode.

With the exception of the information symbol (ℹ), trademark symbol (™️) and "m" emoji (Ⓜ️), a unicode character or emoji must be converted into punycode to work as a domain name. Punycode is a character encoding method used for IDNs. This representation is used when registering domains containing special characters. The ASCII representation starts with the prefix "xn--" and is followed by the unicode character(s) or emoji(s) encoded as punycode. For example, "xn--qei" is "❤" and "xn--yp8h" is "👍" when converted back to Unicode.

Each emoji has a unique punycode representation. For example, "😉" is represented as "xn--n28h" and "☚" as "xn--b4h". There are several tools on the internet that help with converting unicode characters and emojis to punycode and back, but the most popular is punycoder.com.

While it is possible to register disallowed unicode character and emoji domain names on some country-code top-level domains (ccTLDs) to this day, it is NOT POSSIBLE (since 2010) to register disallowed unicode character or emoji domain names on general top-level domains (gTLDs).

This is because all gTLDs must follow a strict set of rules and restrictions imposed by Internet Corporation for Assigned Names and Numbers (ICANN) and Internet Engineering Task Force (IETF). ccTLDs are not required to adhere to all of these registration restrictions, as it is up to each country to decide what is best for it's respective, assigned ccTLD.

For more details, go to datatracker.ietf.org/doc/html/rfc5891.

For a specific list of disallowed unicode characters, go to datatracker.ietf.org/doc/html/rfc5892.

Notable Events

Prior to April 2001, domain names were limited to 37 characters: 26 English letters, 0 to 9, and the hyphen (-).

On 19 April 2001, ICANN and Verisign expanded the allowed set of characters to include a limited range of symbols drawn from a few different Unicode character sets below:

At least four single-character unicode domains were created on this day: ☻.com (xn--84h.com), ♨️.com (xn--j6h.com), ♨️.net (xn--j6h.net), ☮️.com (xn--v4h.com).

In 2003, i♥you.com (xn--iyou-5u3b.com), i♥.com (xn--i-n3p.com), and over a thousand other heart-related unicode domains were registered by Menno de Ruiter for the first mass-scale emoji project on the internet in conjunction with the two principal programmers for the punycode/IDN project at VeriSign and Galcomm ICANN registrar. VeriSign would later delete the majority of these heart-related unicode domains, without notice, leaving the project owner out of hundreds of thousands of dollars.

In 2005, ICANN deleted all .org domain names that contained disallowed unicode characters without notice, leaving owners confused, upset, and empty-handed.

In 2008, the IDNA2008 standard is initiated by ICANN and advises against emoji domains due to security and usability concerns.

In 2009, ICANN introduced IDNs, allowing domain names in non-Latin scripts to be registered.

In 2010, the Unicode Consortium formally recognized 722 emojis and adopted 33 of these unicode characters as emojis (unicode.org/reports/tr51/).

The IDNA2008 is published and accepted by ICANN, effectively banning any further emoji-based .com and .net registrations (unicode.org/reports/tr46/).

Currently Registered

On 7 July 2017, the people over at i❤️.ws (Domain Research Group) were granted access to the raw .com zone file containing all registered .com domain names in existence. Of the 305,782,825 registered domain names at the time, only 233 contained disallowed unicode characters. Regrettably, there is no guaranteed way to know exactly how many of these historical domain names haven't been renewed throughout the last years.

Below are .com and .net lists of disallowed unicode character domain names that are currently known to be registered:

List of 166 known, registered, disallowed unicode character .COM domain names as of May 2025:

Not able to display

List of 82 known, registered, disallowed unicode character .NET domain names as of May 2025:

Not able to display

For punycode-only lists, historical lists, or if you are having trouble viewing the lists on this page, go to the GitHub text file folder.

Note: These lists may not be complete. If you find a discrepancy, please email [email protected].

Emoji Domains Today

As of May 2025, only country-code top-level domains (ccTLDs) are able to allow the registration of unicode/emoji domains. Of the 308 ccTLDs in existence today, only 8 actively allow for unicode and/or emoji character domain registrations:

ccTLD Registrars with Functioning Emoji Registrations
.fm (Federated States of Micronesia) regery.com, dot.fm, dynadot.com, *
.gg (Bailiwick of Guernsey) ** porkbun.com, channelislandhosting.net, sav.com, *
.je (Bailiwick of Jersey) ** porkbun.com, channelislandhosting.net, sav.com, *
.kz (Kazakhstan) ps.kz, *
.st (São Tomé and Príncipe) gandi.net, istanco.net, nic.st, *
.to (Tonga) istanco.net, directnic.com, *
.uz (Uzbekistan) webname.uz, ahost.uz, pscloud.uz, billur.com, *
.ws (Samoa) porkbun.com, regery.com, *

* The registrar lists above may not be 100% accurate or complete. Most likely, there are other registrars that allow the registration of unicode/emoji ccTLD domains on their website. Plus, some of these registrars my change their registration restrictions over time.

** The .gg and .je ccTLDs are unique regarding their unicode/emoji character registration acceptance. They allow for unicode characters (xn--123) to be registered, NOT emoji characters (xn--1234).

The registration of unicode or emoji character domain can difficult since it is sometimes not possible to enter special characters into the registration text fields at some registrars. To get around this, try using the punycode version of the character(s) instead.

The availability of a unicode/emoji domain name can be verified by checking the WHOIS (who.to) data of the punycode (punycoder.com) version of the domain.

References and Sources

[Wikipedia, Emoji domain]
(wikipedia.org/wiki/Emoji_domain)

[♨️.com] (xn--j6h.com)

[All the DotCom Emoji Domains]
(emoji-domains.medium.com/all-the-dotcom-emoji-domains-5ad2f638b090)

[There are 52 Emoji DotNet Domains]
(emoji-domains.medium.com/there-are-52-emoji-dotnet-domains-16e66ba71919)

[The Ultimate Guide to Emoji Domains]
(emojidomaininfo.ws)

[Emoji Domains: The Definitive Buyer’s Guide]
(le.land/emoji-domains/)

[Emoji Domains 🎉]
(emoji.domains/)

[I bought 300 emoji domain names from Kazakhstan and built an email service]
(tinyprojects.dev/projects/mailoji)

[Mapping Characters for Internationalized Domain Names in Applications (IDNA) 2008]
(datatracker.ietf.org/doc/html/rfc5895)