The citation standard defined here offers proper recognition to authors as well as permanent identification through the use of global, persistent identifiers in place of URLs, which can change frequently. Use of universal numerical fingerprints (UNFs) guarantees to the scholarly community that future researchers will be able to verify that data retrieved is identical to that used in a publication decades earlier, even if it has changed storage media, operating systems, hardware, and statistical program format.
Following is an authentic example of a replication data-set citation (from International Studies Quarterly, King and Zeng, 2007: PDF, p.209):
Gary King; Langche Zeng, 2006, "Replication Data Set for 'When Can History be Our Guide? The Pitfalls of Counterfactual Inference'" hdl:1902.1/DXRXCFAWPK UNF:3:DaYlT6QSX9r0D50ye+tXpA== Murray Research Archive [distributor]
This citation has six components. Three are readable by humans: the author, title and year. Two components are machine-readable, and one is optional. Of the machine-readable components to this citation, the unique global identifier begins with "hdl" (this refers to the international HANDLE.NET system). The universal numerical fingerprint begins with "UNF". This identifier is designed to persist even if URLs--or the web itself--are replaced with something else. When the citation appears online, the identifier is hot-linked to the URL that references the identifier, which works in browsers available today. In print, the URL is also included in the citation.
Four features make the UNF especially useful:
Citations also can have optional features in a standard format, such as "Murray Research Archive [distributor]", which lists a network type in square brackets that is selected from a given, controlled vocabulary.
Learn more: Micah Altman and Gary King. 2007. "A Proposed Standard for the Scholarly Citation of Quantitative Data," D-Lib Magazine, Vol. 13, No. 3/4 (March). (Abstract: HTML | Article: PDF)