[Home]   [Full version]  

NIST tool helps Internet master top-level domains

May 16 ,Technology


At the request of a worldwide Internet organization, a computer scientist at the National Institute of Standards and Technology developed an algorithm that may guide applicants in proposing new “top-level domains”—the last part of an Internet address, such as .com, that people type in navigating the Web.

As new top-level domains are added to the familiar .com, .info and .net, the algorithm* checks whether the newly proposed name is confusingly similar to existing ones by looking for visual likenesses in its appearance. Having visually distinct top-level domain names may help avoid confusion in navigating the ever-expanding Internet and combat fraud, by reducing the potential to create malicious look-alikes: .C0M with a zero instead of .COM, for instance.

Later this year, the Internet Corporation for Assigned Names and Numbers (ICANN) plans to launch the process for proposing a new round of “generic” top-level domains (gTLDs), strings such as .net, .gov and .org meant to indicate organizations or interests. In preparing for newly proposed gTLDs, ICANN reached out to various algorithm developers, including NIST’s Paul E. Black, as among those engaged to “provide an open, objective, and predictable mechanism for assessing the degree of visual confusion” in gTLDs.

Black’s algorithm compares a proposed gTLD with other TLDs and generates a score based on their visual similarities. For example, the domain .C0M scores an 88 percent visual similarity with the familiar .COM. The resulting scores may help indicate whether the newly proposed domain name looks too much like existing ones.

To make its assessments, the algorithm rates the degree of similarity between pairs of alpha-numeric characters. Some pairs, such as the numeral “1” and its dead-ringer, the lowercase letter “l,” are assigned the highest scores for visual similarity while other pairs, such as “h” and “n”, are given lower scores. The algorithm takes other considerations into account, for example how certain pairs of letters, like “c” and “l,” can join to look like a third letter (“d”), as in the case of “close” and “dose.”

Employing these scores and considerations, the algorithm computes the “cost” of transforming one string of characters into another, such as “opel” into “apple.” Lower cost means higher visual similarity. The algorithm then adjusts for the relative lengths of the two strings (different lengths increase their distinctiveness) and converts the final cost into a percent similarity.

ICANN is considering future enhancements to the algorithm, such as having it check for visual confusion between existing domains and future planned Internet top-level domain names in scripts such as Cyrillic.

Source: National Institute of Standards and Technology

Related stories:

Ask.com hopes to make search faster, more relevant
(AP) -- Assuming your company's name isn't a verb synonymous with looking things up online, how do you get Web surfers to not just try your search engine, but also frequent it?
UCLA group discovers humongous prime number
(AP) -- Mathematicians at UCLA have discovered a 13 million-digit prime number, a long-sought milestone that makes them eligible for a $100,000 prize.
IBM Accelerates Virtual Desktop With Breakthrough Solution
IBM today announced a powerful new solution to help organizations slash virtual desktop infrastructure storage requirements by up to 80 percent, allowing them to take advantage of new cloud computing models at significantly reduced costs while increasing energy efficiency.
What a sleep study can reveal about fibromyalgia
Research engineers and sleep medicine specialists from two Michigan universities have joined technical and clinical hands to put innovative quantitative analysis, signal-processing technology and computer algorithms to work in the sleep lab. One of their recent findings is that a new approach to analyzing sleep fragmentation appears to distinguish fibromyalgia patients from healthy controls.
Subliminal learning demonstrated in the human brain
Although the idea that instrumental learning can occur subconsciously has been around for nearly a century, it had not been unequivocally demonstrated. Now, a new study published by Cell Press in the August 28 issue of the journal Neuron used sophisticated perceptual masking, computational modeling, and neuroimaging to show that instrumental learning can occur in the human brain without conscious processing of contextual cues.
New Algorithm Significantly Boosts Routing Efficiency of Networks
(PhysOrg.com) -- A time-and-money-saving question shared by commuters in their cars and networks sharing ever-changing Internet resources is: "What's the best way to get from here to there?"
New immunization strategy could halve the doses for stopping computer virus spreading
Researchers have developed a new immunization strategy that requires up to 50% fewer immunization doses compared with the current most efficient strategy. The new strategy could be used to prevent the spread of human epidemics and computer viruses, and it applies to a wide variety of networks.
Multithreaded supercomputer seeks software for data-intensive computing
The newest breed of supercomputers have hardware set up not just for speed, but also to better tackle large networks of seemingly random data. And now, a multi-institutional group of researchers has been awarded $4.0 million to develop software for these supercomputers. Applications include anywhere complex webs of information can be found: from internet security and power grid stability to complex biological networks.

News discussion:

Technology news

[Home]   [Full version]