Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Paid coding and scraping help for SSHRC-Funded Project (Dept of Sociology) #403

Open
CGuzman88 opened this issue Sep 4, 2018 · 1 comment

Comments

@CGuzman88
Copy link

Department of Sociology | University of Toronto
Cinthya Guzman | Ph.D. Student c.guzman@mail.utoronto.ca
on behalf of Dan Silver | Associate Professor dsilver@utsc.utoronto.ca
Job Opportunity

We're studying how the discipline of sociology has evolved over time, in particular how its ideas about sociological theory have changed — what authors to teach, what topics to emphasize, and so on. If you’re interested in the background, here’s a first paper, where we analyzed syllabi, along with some other data.
For the next phase, we are looking at the content and reach of textbooks, over time. The bottom line is that we’re looking for a little bit of help in creating a database from Worldcat listings. I’m wondering if you might be able to help us by writing code for taking it from their API, or scraping if need be. I understand you might be busy or this might not be of interest. If so, maybe you might be able to connect me with somebody else who could, perhaps a CS/Engineering undergrad? I’d of course compensate for the work.

Some more details: Broadly speaking, we are interested in tracing the history of sociological/social theory through the textbooks and books in circulation since the 1950s (or earlier if available). We are mainly interested in tracing how sociological theory has developed in North America, but also are looking to its reach and adaptation elsewhere. By looking into the books that were in circulation since the 1950s, we would be able to trace the canonization of sociological theory and when and how it began to take its current shape.
In the end, we would like to be able to map the presence and demand for these books. Once we have this data, we can conduct a closer examination of which theories and theorists are taught and shifts across time and place.
Worldcat tells us: i) the number of editions of a particular book, and their dates of publication, ii) the number of copies available across all their listed libraries, iii) the geographical location of these copies (e.g. university, city, state, country, lon/lat), iv) author and publisher information, v) language translations, and vi) whether the book has been digitalized.
There’s a few ways we could approach this
Option 1: Through the U of T library, we have already compiled a list of about 400 books we’d like to include. Using this list, we could compile all the relevant information in WorldCat about those books.
Option 2: requires a new term search using “sociological theory” on WorldCat and sets the parameters to only capture: Print book, Ebook, and Continually updated resources. We may need some additional search terms. Using the books this search returns, we would like the same information listed above.
Option 2 will produce a lot of noise, so probably we would go with option 1. For either, though, there’s the question about how to access and compile the data. There’s some info about their APIs here: https://www.worldcat.org though I’m not sure what to make of it.

This might be a big vague, and so happy to discuss further.

@chungkky
Copy link

chungkky commented Sep 4, 2018 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants