Data

Data hosted at Institutions Hub

Clio Infra wishes to express its gratitude to those who allowed us to host their data. When you use these data do not forget to refer to the author’s articles/papers.  

Topic

Datasets

Political institutions and governance

Adjusted state antiquity dataset 

    Extraction ratio

    Government revenues relative to GDP

    Latent democracy indicator, 1850-2000

     

    Conflicts and wars

     

    Conflict Catalog (Violent Conflicts 1400 A.D. to the Present  in Different Regions of the World)

    Homicide dataset, 1800-2000

    Other institutions

     

    Foundation of universities dataset

       

       

      Links to external data

      These data are not hosted on our server, we only supply links to their respective homepages. Please always consult the original homepage regarding contents, definitions, coverage and the terms of use.

       

      Topic

      Datasets

      Political institutions and governance

       

      Database of Political Institutions 2010

      World Governance Indicators

      Comparative Political Datasets I

      Comparative Political Datasets II

      Comparative Political Datasets III

      Polity IV

      Democracy and Dictatorship

      Vanhanen’s democracy (polyarchy) dataset

      State Antiquity (Statehist)

      IDEA (Institute for Democracy and Electoral Assistance) voters turnout database

      Legal system

       

      Judicial Checks and Balances

         

        Conflicts and wars

         

        UCDP/PRIO Armed Conflict Dataset
        UCDP Non-State Conflict Dataset
        Correlates of War, Inter-state wars dataset
        Correlates of War, Intra-state wars dataset

          Economic institutions

           

          Economic Freedom of the World

          Institutional Characteristics of Trade Unions, Wage Setting, State Intervention and Social Pacts 

          Colonial institutions

           

          Geodist

          Colonial/Dependency Contiguity, 1816-2002 

          Transatlantic slave trade

          ethnicity, language and religion

          Ethnic, Linguistic and Religious Fractionalization

           

          Religion adherence data

          Ethnographic data on societies

           

          Ethnographic Atlas The Standard Cross-cultural Sample

           

           

          Political institutions and governance

          1. Adjusted state antiquity dataset 

          Authors: original state antiquity data: Louis Putterman and Valerie Bockstette, adjustments: Jan Luiten van Zanden
          Content: state antiquity scores 1801-1950, with adjustments for some countries.
          excel format: Click here for State Antiquity.xls hosted on our server
          data description: Click here for State Antiquity Dataset.doc hosted on our server

           

          2. Extraction ratio

          Authors: Jan Luiten van Zanden, Joerg Baten, Peter Foldvari, Bas van Leeuwen

          Content: Income inequality, extraction ratio for every benchmark years between 1820-2000. The extraction ratio is defined as the ratio of the observed income inequality and the theoretical maximum of income inequality. The later is estimated under the assumption that the elite (assumed here to be 0.1 or 1% of the population) can and does expropriate all incomes above the subsistence level (assumed to be 400 G-K dollars in 1990 prices) from the non-elite. The discrepancy between the observed and the theoretical ceiling of income inequality is a measure of the power of the elite.   

          excel format: Click here for Extraction ratio.xls hosted on our server

          data description: the methodology of income inequality estimates is described here, the methodology of extraction ratio is described here.

          If you use this dataset please cite: Zanden, J .L. van, Baten,. J., Foldvari, P. and Leeuwen, B. van (2011) The Changing Shape of Global Inequality 1820-2000: Exploring a new dataset, CGEH Working Paper No. 1

           

          3. Government revenues relative to GDP

          Content: total government revenue, including taxes and excises, as percentage of GDP or comparable measure of aggregate economic activity

          OECD countries (1800-2007)
          Collected by Pim de Zwart
          excel format: click here to download the data in excel format (oecd.xls)
          data description: a description of sources and definitions can be found here (notes.doc)

          African (1950-2005) and Asian (1900 2005) countries
          collected by Peter Foldvari
          excel format: click here to download the data in excel format (asia,africa.xls)
          data source: Mitchell, Brian R., International Historical Statistics, Africa, Asia and Oceania: 1750-2005 (London: Palgrave Macmillan, 2007).

           

          4. Latent democracy indicator, 1850-2000

          Content:  a latent democracy indicator, extracted from five components of the PolityIV projects dataset (XRCOMP, XROPEN, XCONST, PARREG, PARCOMP) and two components of the Index of Democracy by Vanhanen (participation and competition) by a measurement error model factor model. The number of available countries varies between 38 (1850) and 139 (2000).
           

          excel format: click here to download the data in excel format (latentD.xls)
           

          data description: a description of sources and definitions can be found in Foldvari, P.: A latent democracy measure 1850-2000, Utrecht University, Centre for Global Economic History, Working paper no. 59., June 2014. When using this dataset please cite above working paper.

           

          Conflicts and wars

          1. Conflict Catalog (Violent Conflicts 1400 A.D. to the Present  in Different Regions of the World)

          Authors: Peter Brecke
          Contents: 3708 conflicts, data on parties, fatalities, date and duration.
          Link to data in excel format: Conflict Catalog 18 vars.xls hosted on our server
          Click here for data description.

           

          2. Homicide dataset, 1800-2000

          Content: The number of homicides per 100.000 inhabitants. The official definition of intentional homicide, “unlawful death deliberately inflicted on one person by another person” (OECD, 2011), is used. The dataset excludes civilian and military deaths  inflicted during inter-state wars and deaths caused by civil wars (OECD, 2014).

          excel format: click here to download the data in excel format (homicide.xls)
           

          data description: click here to download the description in doc format (descriptionhomicide.doc)

          When using this dataset please cite: Baten, J, Bierman, W., Foldvari, P. and van Zanden, J. L.: Chapter 8 Personal security since 1820 In How Was Life? Global Well-being since 1820 (Jan Luiten van Zanden, Joerg Baten, Marco Mira d’Ercole, Auke Rijpma, Marcel Timmer eds,), OECD, Paris, 2014

           

          Other institutions

          1.       Foundation of universities dataset

          Author: Peter Foldvari

          Content: The number of universities founded in a year within the current borders of a particular country. Coverage:  1500-2013, 95 countries.

          excel format: click here to download the data in excel format 

          data description: click here to download the description in docx format 

           

          Links to external data

          These data are not hosted on our server, we only supply links to their respective homepages. Please always consult the original homepage regarding contents, definitions, coverage and the terms of use.

           

          Political institutions and governance

          1. Database of Political Institutions 2010

          Authors: Thorsten Beck, George Clarke, Alberto Groff, Philip Keefer, and Patrick Walsh (hosted by World Bank)

          Contents: A wide range of indicators on political institution for 180 countries, 1975-2010. Variables cover the executive power, the legislature and the election system.

          Click here for link to data (2009) in excel format.

          Click here for link to data (2010) in stata format.

          Click here for link to data description.

          If you use this dataset please cite: Thorsten Beck, George Clarke, Alberto Groff, Philip Keefer, and Patrick Walsh, 2001. "New tools in comparative political economy: The Database of Political Institutions." 15:1, 165-176 (September), World Bank Economic Review.

           

          2. World Governance Indicators

          Authors: Daniel Kaufmann, Aart Kraay and Massimo Mastruzzi (hosted by World Bank)

          Contents: different governace indicators for 213 economies, 1996-2010

          Click here for link to data in excel format.

          Click here for link to data description. 

           

          3. Comparative Political Datasets I

          Authors: Klaus Armingeon, David Weisstanner, Sarah Engler, Panajotis Potolidis, Marlène Gerber, Philipp Leimgruber (Institut für Politikwissenschaft, University of Bern)

          Contents: 23 OECD countries, 1960-2009

          Click here for link to data in excel format.

          Click here for link to data in stata format.

          Click here for link to data in spss format.

          Click here for link to data description.

           

          4. Comparative Political Datasets II

          Authors: Klaus Armingeon, David Weisstanner, Sarah Engler, Panajotis Potolidis, Marlène Gerber, Philipp Leimgruber (Institut für Politikwissenschaft, University of Bern)

          Contents: 29 post-Communist countries, 1989-2007

          Click here for link to data in excel format.

          Click here for link to data in spss format.

          Click here for link to data description.

           

          5. Comparative Political Datasets III

          Authors: Klaus Armingeon, David Weisstanner, Sarah Engler, Panajotis Potolidis, Marlène Gerber, and Philipp Leimgruber (Institut für Politikwissenschaft, University of Bern)

          Contents: 35 OECD and EU countries, 1990-2009

          Click here for link to data in excel format.

          Click here for link to data in stata format.

          Click here for link to data in spss format.

          Click here for link to data description.

           

          6. Polity IV

          Authors: Monty G. Marshall, Keith Jaggers, and Ted Robert Gurr

          Contents: 164 countries, 1800-2010, autocracy, democracy index, ranging from -10 to 10

          Click here for link to data in excel format.

          Click here for link to data in spss format.

          Click here for link to data description.  

           

          7. Democracy and Dictatorship

          Authors: José Antonio Cheibub, Jennifer Gandhi and James Raymond Vreeland

          Contents: 204 countries, 1946-2008, types of

          Click here for link to dataset in excel format.

          Click here for link to dataset in spss format.

          Click here for link to data description.

          If you use this dataset, please cite: Antonio Cheibub, Jennifer Gandhi and James Raymond Vreeland "Democracy and dictatorship revisited" Public Choice  Volume 143, Numbers 1-2 (2010), 67-101.

           

          8. Vanhanen’s democracy (polyarchy) dataset

          Authors: Tatu Vanhanen

          Contents: 188 countires, 1810-2010, calcualted from election outcomes.

          Click here for link to dataset in excel format.

          Click here for link to dataset in stata format.

          Click here for link to dataset in spss format:

          Click here for link to data description.

           

          9. State Antiquity (Statehist)

          Authors: Louis Putterman and Valerie Bockstette

          Contents: 149 countries, scores of the presence of super-tribal polity

          Click here for link to data in excel format (version 3).

          Click here for link to data description. 

           

          10. IDEA (Institute for Democracy and Electoral Assistance) voters turnout database

          Contents: data on voter turnout since 1945 form 170 countries

          Click here for the online data

           

          Legal system

          1. Judicial Checks and Balances

          Authors: Rafael La Porta, Florencio López-de-Silanes, Cristian Pop-Eleches, and Andrei Shleifer

          Contents: 71 countries, cross-section

          Click here for link to data in excel format

          Click here for link to data description.

          If you use this data, plase cite: La Porta, Rafael, Florencio López-de-Silanes, Cristian Pop-Eleches and Andrei Shleifer. 2004. “Judicial checks and balances”. Journal of Political Economy 112 (April): 445-470.

           

          Conflicts and wars

          1. UCDP/PRIO Armed Conflict Dataset

          Authors: Gleditsch, Nils Petter, Peter Wallensteen, Mikael Eriksson, Margareta Sollenberg, and Håvard Strand

          Contents:  260 armed conflicts 1946-2010

          Click here for link to data in excel format.

          Click here for link to data description.

          If you use this data, please cite: Gleditsch, Nils Petter, Peter Wallensteen, Mikael Eriksson, Margareta Sollenberg, and Håvard Strand. 2002. “Armed Conflict 1946-2001: A New Dataset.” Journal of Peace Research 39(5).

           

          2. UCDP Non-State Conflict Dataset

          Authors: Ralph Sundberg, Kristine Eck and Joakim Kreutz

          Contents:  784 armed conflicts when none of the parties were government or state, 1989-2013

          Click here for link to data in excel format.

          Click here for link to data description/ codebook.

          If you use this data, please cite:  Sundberg, Ralph, Kristine Eck and Joakim Kreutz "Introducing the UCDP Non-State Conflict Dataset", Journal of Peace Research, March 2012, 49:351-362 

           

          3. Correlates of War, Inter-state wars dataset

          Authors: Meredith Reid Sarkees and Frank Wayman Contents: 95 inter-state wars (among states and governments), 1816-2000

          Click here for link to data in csv format.

          Click here for link to data description/ codebook.

          If you use this data, please cite:  Sarkees, Meredith Reid and Frank Wayman (2010). Resort to War: 1816 - 2007. CQ Press.

           

          4. Correlates of War, Intra-state wars dataset

          Authors: Meredith Reid Sarkees and Frank Wayman Contents: 95 intra-state wars (conflicts taking palce within the boundaries of a state), 1816-2000

          Click here for link to data in csv format.

          Click here for link to data description/ codebook.

          If you use this data, please cite:  Sarkees, Meredith Reid and Frank Wayman (2010). Resort to War: 1816 - 2007. CQ Press.

           

          Economic institutions

          1. Economic Freedom of the World

          Authors: Fraser Institute

          Contents: 141 countries, 1970, 1975, 1980, 1985 1990, 1995 and 2000-2009 annually.

          Click here for link to data and data description: you are required to install a software (PC) that contains the data and data management tools. Also you can export the required data to excel.

           

          2.  Institutional Characteristics of Trade Unions, Wage Setting, State Intervention and Social Pacts (version 4)

          Authors: Jelle Visser

          Contents: Data on labour unions, collective bargain, government intervention, minimum wages and strike regulations in 46 countries (OECD, EU, emerging economies), 1960-2011

          Click here for link to data in excel format and data description.

           

          Colonial institutions

          1. Geodist

          Authors: CEPII

          Contents: data on 225 countries, including their distance, official language and colonial past

          Click here for link to data in excel format.

          Click here for link to data in stata format.

           

          2. Colonial/Dependency Contiguity, 1816-2002 

          Authors: Paul Hensel

          Contents: All contiguity relationships between states in the international system through their colonies or dependencies, 1816-2002  Click here for link to data in csv format.

          Click here for link to data description/ codebook.

          If you use this data, please cite: Correlates of War 2 Project. Colonial/Dependency Contiguity Data, 1816-2002. Version 3.0. Online: http://correlatesofwar.org.

           

          3. Transatlantic slave trade

          Authors: Voyages: The Trans-Atlantic Slave Trade Database by Emory Universit

          Contents: data on more than 30000 voyages over the Atlantic countries 1514-1866, with data on the number of slaves, length of voyages, percetange of males and children

          Click here for link the online data.

          If you use this data please cite: David Eltis, “A Brief Overview of the Trans-Atlantic Slave Trade,” Voyages: The Trans-Atlantic Slave Trade Database http://www.slavevoyages.org/tast/assessment/essays-intro-01.faces (accessed April 27, 2008).

           

          Data on ethnicity, language and religion

          1. Ethnic, Linguistic and Religious Fractionalization

          Authors: Alberto Alesina, Arnaud Devleeschauwer, William Easterly,Sergio Kurlat, and Romain Wacziarg

          Contents: 190 countries, ethnic, linguistic and religious fractionalization, cross-section

          Click here for link to data in excel format.

          Click here for link to data description (the original paper).

          If you use this data, please cite: Alberto Alesina, Arnaud Devleeschauwer, William Easterly,Sergio Kurlat, and Romain Wacziarg. "Fractionalization" Journal of Economic Growth, vol. 8, no. 2, June 2003, pp. 155-194.

           

          2. Religion adherence data

          Authors: Robert Barro, Rachel M. McCleary

          Contents: 213 countries, 1900, 1970, data on share of religions in population, Herfindahl indices of religious concentration.

          Click here for link to data in excel format.

           

          Ethnographic data on societies

          1. Ethnographic Atlas The Standard Cross-cultural Sample

          Authors: George P. Murdock and Douglas R. White

          Content: data on different cultural, ethnic and institutional aspect of 186 cultures. This data is corrected for the effects of regional diffusion effects and auto-correaltions.

          Click here for the data is SPSS format.

          Clisck here for codebook