Skip to content

This is a project that scrapes PubChem data for GHS hazard codes given CAS numbers

Notifications You must be signed in to change notification settings

chadr1989/GHS-codes-from-CAS-numbers

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 

Repository files navigation

GHS-codes-from-CAS-numbers

This is a project that scrapes PubChem data for GHS hazard codes. Given a spreadsheet file that contains a column named 'CAS' filled with valid CAS codes, the script will add columns for 'PubChem ID' and 'GHS Codes'. The PubChem ID is necessary to search NCBI data in the form of cascading text htmls with no formatting. Just follow the notes, even users with little to no Python experience should be able to run the script and return an edited spreadsheet file with the forementioned data added.