The first release of JRC-Names (September 2011) contained the names of about 205,000 distinct known entities, plus about the same amount of variant spellings for these entities. Additionally, it contains a number of morphologically inflected variants of these names. By March 2016, the resource has grown to 307,000 distinct entities plus 333,000 variants.
EMM identifies new names every day, and a file including also the most recently found names and name spellings is available for daily download from the JRC's web pages.
As of July 2011, the database included names spelt in 27 different scripts. The most frequently used scripts are Latin (including English and most other European languages), Cyrillic (e.g. Russian and Bulgarian), Arabic (including Farsi), Japanese (Han, Hiragana and Katakana) and Chinese Han (simplified variant).
64% of the names in JRC-Names do not have additional spelling variants. For 28% of the names, JRC-Names knows two or three spellings. There are 3760 entities with ten spellings or more, and 37 entities with over 100 spelling variants. The names with the most spelling variants are Muammar Gaddafi (413 spellings), Mikhail Saakashvili (256) and Mahmoud Ahmadinejad (246) (status July 2011).