This guide supports version 1.2 of the Dataverse Network application.
You start viewing and using data on the Datverse Network homepage. These topics describe how to use the Dataverse Network:
When you log on to the Dataverse Network, you see the Network homepage. This homepage has two tabs: Now Available and Coming Soon.
Any user can browse, search, and download files from dataverses on the Now Available tab. These dataverses are designated as released. If a study contains data files that are subsettable, you can subset and analyze those data files, and then download the subsetted data. This tab is the default view of the Network homepage.
Dataverses listed on the Coming Soon tab are designated as not released, and the contents are not available to the public. You cannot search or download studies in these dataverses.
Continue to the next topic to see how to navigate the menus.
All users can perform the following by using the main menu options:
Note: A login timeout period exists. After you log in, if you do not use the interface for a short period of time, the Network might prompt you to log in again when you do start using it.
The Dataverse Network hosts many distinct types of dataverses. Each dataverse can contain any number of studies, and each study can contain any number of data set files. Within a dataverse, studies can be organized into collections. A dataverse also can contain links to collections of studies in other dataverses.
You can browse, search, and view contents of released dataverses within the Network, or of any dataverse or study that you have permission to access. To view the contents of a dataverse, study, or collection, click the title that you choose to view. The contents are expanded and displayed below the selected entity.
Authors and administrators control access to content by using dataverse, study, and file settings and Terms of Use. For example, you can view data files for any study that is set as Released and is available to the public, or for any study that you have permission to access.
For each study you can view two distinct sets of information: the Cataloging Information and the list of study files.
Status and permission settings can apply independently to the following: Dataverses, Studies, and Study files
In addition, Terms of Use can apply at any level: Network, dataverse, or study.
When a dataverse is created, it is set as not released. A dataverse that is not released can be viewed or accessed only by a Dataverse Admin, Curator, or Contributor for that dataverse, or by any user granted permission explicitly to access it. It appears on the Coming Soon tab in the Dataverse Network. When work on the dataverse is complete and the site is set to released, it then appears on the Now Available tab in the Dataverse Network. Any user can view or access the dataverse after it is released.
Permissions can be granted to access a dataverse that is not released.
Note: When a user browses or searches a Dataverse Network, all studies and collections in a dataverse that is not released are omitted from any list of search results.
A study is created within a dataverse, and the initial status of that study is New. When the Contributor (author) determines that work on the study is complete, that user changes the status to In Review. A Curator or Dataverse Admin then reviews the study and changes the status to Released. Any user can view or access that study.
For each study within a dataverse, permissions can be set to Restricted or Public for access to that study. Permissions can be granted to specific users or groups to access Restricted studies, and a setting can be enabled to enable users to request access to Restricted studies.
For each file within a study, permissions can be set to Restricted or Public for access to that file. Permissions can be granted to specific users or groups to access Restricted files.
Dataverses, studies, and data files can have user restrictions applied. If prompted to accept Terms of Use, click the check box and then click the Continue button to view or download the information you chose.
Terms of Use are customizable and can apply at any of three levels:
If a Network has Terms of Use applied to general use, then each time you log in to the Network you must accept the Terms before you can access any options.
If a Network has Terms of Use applied to data uploads or downloads, then any study in that Network includes those Terms. When you select a study from a Network with Terms applied to data downloads, you accept the Terms to download the files. You also must accept Terms when you add a study or file to a dataverse in a Network with Terms applied to study creation.
If a dataverse has Terms of Use applied to data uploads or downloads, then any study in that dataverse includes those Terms. You must accept the Terms to add or download files.
When Terms of Use are applied at the Network or dataverse level, for the first study file that you view or add in the Network or dataverse you must accept the Terms one time per session in the Network.
Individual studies also can have Terms of Use applied. When you select a study with Terms applied and view the Cataloging Information tab, a Terms of Use section appears at the bottom of the tab. Click the blue down-arrow to view the Terms of Use on this tab. If you choose to view, subset, or download any study files or data sets from a study with Terms applied, first you must accept the Terms.
When Terms are applied at the study level, for the first file that you view or download in the study you must accept the Terms one time per session in the Network.
When a study is created, a set of metadata is associated with that study. This metadata is called the Cataloging Information for the study. When you select a study to view it, you first see the Cataloging Information tab listing the metadata associated with that study. This is the default view of a study.
Cataloging Information contains numerous fields that help to describe the study. The amount of information you find for each study varies, based on what was entered by the author (Contributor) or Curator of that study. For example, one study might display the distributor, related material, and geographic coverage. Another study might display only the authors and the abstract. Every study includes the Citation Information fields in the Cataloging Information.
Note: A comprehensive list of all Cataloging Information fields is provided in the List of Metadata.
Cataloging Information is divided into four sections. These sections and their details are displayed only when the author (Contributor) or Curator provides the information when creating the study. Sections consist of the following:
When you view a study, click the Documentation, Data and Analysis tab to view a list of all electronic files associated with the study that were provided by the author or Curator. See the Legend at the bottom of the Documentation, Data and Analysis tab to interpret any icons associated with these files.
A study might contain documentation files and data files. When you upload data files of the type .dta, .sav, or .por to the Network, they are converted to .tab tab-delimited files. These .tab files are subsettable, and can be subsetted and analyzed online by using the Dataverse Network application.
You can identify a subsettable data file by the analysis icon and the number of variables and categories listed next to the file name. Other files that also contain data might be associated with a study, but the Dataverse Network application does not recognize them as data (or subsettable) files.
To find a study or data set, you can search or browse studies offered in any released dataverse on the Now Available tab. Each dataverse offers a hierarchical organization comprising one or more collections of data sets with a particular theme. Most dataverses allow you to search for data within their files, or you can start browsing at the dataverse closest to your substantive interests.
Dataverses are served by DVNs. To view a live DVN installation, go to the IQSS Dataverse Network and browse or search our dataverses.
Keep reading to find out more about these subjects:
You can browse all public dataverses from the Network homepage Now Available tab. Click the title of a dataverse to browse that dataverse's collections and studies. Click the title of a collection to view a list of studies and subcollections for that selection. Click the title of a study to view the Cataloging Information and study files for that selection.
When you select a dataverse to view its contents, the homepage opens to the root collection, and the dataverse's studies are displayed directly under the root collection name. If the root collection contains other collections, then those collections are listed and not the studies within them. You must select a collection title to view the studies contained within it.
Note: If a dataverse includes links to collections from another dataverse and the root collection does not contain other collections, the homepage opens to a list of the root and linked collections.
You can search for studies across the entire DVN from the Network homepage, or search within a dataverse from the dataverse homepage. When you search across the Network, studies from restricted dataverses are not included in the search. If an entire study is restricted (both metadata and files), it is not included in search results unless you have access to that data. After your search is complete, you can further narrow your list of data by searching again in the results. See Search Tips for search examples and guidelines.
When you enter more than one term in the search text field, the results list contains studies that have these terms near each other within the study fields searched. For example, if you enter United Nations, the results include studies where the words United and Nations are separated by no more than four words in the same study field, such as abstract or title.
You can restrict a search to content in the following study fields by using the basic Search drop-down list:
Use the following guidelines to search effectively within a Network or a dataverse:
AND logic within individual fields. That is, if you enter more than one term, the search engine looks for all terms within a single field, such as title or abstract.United Nations report, the results list any studies that include the terms United, Nations, and report within a single metadata field.10 year in a basic search. If a study includes the string 10 millions deaths per year within a metadata field, such as abstract, that study is not included in the search results. A study that contains the string 10 per year within the abstract field is included in the search results.Gary King; Will Lowe, 2003, "10 Million International Dyadic Events", hdl:1902.1/FYXLAWZRIA UNF:3:um06qkr/1tAwpS4roUqAiw== Murray Research Archive [Distributor]
If you type King, 10 Million in the Search field and click Search, you see 0 matches were found in the Results field. If you type 10 in the Search field and click Search, you see something like 1621 matches were found in the Results field. But if you first type King in the Search field and click Search, then type 10 Million in the Search field and click Search again, you see something like 4 matches were found in the Results field.
In an advanced search, you can refine your criteria by choosing which Cataloging Information fields to search. You also can apply logic to the field search. For text fields, you can specify that the field searched either contains or does not contain the text that you enter. For date fields, you can specify that the field searched is either later than or earlier than the date that you enter. Refer to the Documentation page for Query Syntax at the Lucene web site for full syntax details.
To perform an advanced search, click the Advanced Search link at the top-right of the Search panel. You can search the following study metadata fields by using the Search Scope drop-down list:
elections.North America.state.survey data, census/enumeration data, or aggregate data.When your search is complete, the results page lists studies that met the search criteria in order of relevance. For example, a study that includes your search term within the Cataloging Information in ten places appears before a study that includes your search term in the Cataloging Information in only one place.
You can sort search results by title, study ID, or number of downloads (that is, the number of times users downloaded any file belonging to that study). Click the Sort By drop-down list to choose your sort order.
When you browse a collection, the studies contained within the collection are listed alphabetically by title.
You can download any of the following within a study:
The default format for all subsettable data file downloads is tab-delimited. When you download one or more subsettable files in tab-delimited format, the file contains a header row. When you download one subsettable file, you can select from the following formats in addition to tab-delimited:
Note: Studies and data files often have user restrictions applied. If prompted to accept Terms of Use for a study or file, check the I Accept box and then click the Continue button to view or download the file.
If you download all data files within a study, the files are downloaded in a zipped archive, and the individual files are in tab-delimited format. You must unzip the archive to view or use the individual, tab-delimited data files.
To download all data sets associated with a study:
When files are uploaded to a study, the Contributor assigns a category to the file: Documentation or Data File.
If you download all data files within a category, the files are downloaded in a zipped archive, and the individual files are in tab-delimited format. You must unzip the archive to view or use the individual, tab-delimited data files.
To download all data sets within a category:
If you download an individual data file, you can select from several file formats in which to download the data. If you select tab-delimited format, the file is downloaded directly. If you select any other format, the file is downloaded in a zipped archive. You must unzip the archive to view or use the individual data file.
To download one full data set without subsetting or analyzing the contents:
Data files (subsettable files) can be subsetted and analyzed online by using the Dataverse Network application. For analysis, the Dataverse Network offers a user interface to Zelig, a powerful, R-based statistical computing tool. A comprehensive set of statistical analysis models are provided.
After you find the data set that you want, access the Subset and Analysis options to use the online tools. Then, you can subset data by variables or observations, translate it into a convenient format, download subsets, and apply statistics and analysis.
Review the Data Subset and Recode Tips before you start.
You can apply any of the following advanced statistical models to all or some variables in a data set:
You can subset and analyze data files before you download the file or your subsets.
To access the Subset and Analysis options for a data set:
Review the Data Subset and Recode Tips before you start work with a study's files.
To subset and recode variables within a data set:
The variable Name you entered is found among the existing variables; enter a new variable name.Continue to download a subset.
Use the following guidelines when working with data files:
3.(--3.-3-).(-3, 3).[]) for closed ranges.0,[7-9].You can download a subset of variables within a study file. You also can recode a subset of variables and download the recoded subset, if you choose.
To download a subset of variables:
To apply descriptive statistics to a data set or subset:
To run statistical models for selected variables:
Incompatible type error message.