A simple API for downloading and reading xml data directly from Lattes < http://lattes.cnpq.br/>.
**ATTENTION: The package is not working as of 2017-11-26. The Lattes website, where the xml files were available, is offline. **
Lattes is an unique and largest platform for academic curriculumns. There you can find information about the academic work of all Brazilian scholars. It includes institution of PhD, current employer, field of work, all publications metadata and more. It is an unique and reliable source of information for bibliometric studies.
I've been working with Lattes data for some time. Here I present a short list of papers that have used this data.
Is predatory publishing a real threat? Evidence from a large database study. Working paper
GetLattesData is a wrap up of functions I've been using for accessing the dataset. It's main innovation is the possibility of downloading data directly from Lattes, without any manual work or captcha solving.
The package is available in CRAN:
You can also install the development version from Github:
library(GetLattesData) # ids from EA-UFRGS my.ids <- c('K4713546D3', 'K4440252H7', 'K4783858A0', 'K4723925J2') # qualis for the field of management field.qualis = 'ADMINISTRAÇÃO PÚBLICA E DE EMPRESAS, CIÊNCIAS CONTÁBEIS E TURISMO' l.out <- gld_get_lattes_data(id.vec = my.ids, field.qualis = field.qualis) tpublic <- l.out$tpublic dplyr::glimpse(tpublic)
Lattes website is offline. Online downloading of xml files is no longer possible.