Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create dictionary of statutes from UCCS schema #29

Open
newswim opened this issue May 21, 2023 · 4 comments
Open

Create dictionary of statutes from UCCS schema #29

newswim opened this issue May 21, 2023 · 4 comments
Assignees
Labels
good first issue Good for newcomers help wanted Extra attention is needed

Comments

@newswim
Copy link
Contributor

newswim commented May 21, 2023

Today, when we normalize statute data we send a CSV containing the unprocessed charge data to the University of Michigan and they send us back a processed CSV. Since we have a record of the inputs and outputs, we should be able to create a dictionary to map unprocessed strings to their processed strings, based on the case number and charge.

If this step throws an error, like "Statute not found", we'll add that string to a new data set and send it to U of Mich for processing.

@emilyrbartha should have access to uccs_schema.csv.

@newswim newswim added good first issue Good for newcomers help wanted Extra attention is needed labels May 21, 2023
@tpadmanabhan
Copy link
Collaborator

Review with Nick and Nidhi

@tpadmanabhan
Copy link
Collaborator

@newswim review with @nicolassaw

@nicolassaw
Copy link

@newswim How do we send and receive this data with umichigan? Is there an API we can send jobs to? I have pulled the last week and ran it against the big umich database (7-8k unique charge pairs) and found a lot of the charges on the new cases are missing. This poses some issues with normalizing charge type. Perhaps some formatting changes (like removing count values from "(COUNT ONE) ARSON") might help.

@nicolassaw
Copy link

Uploaded a JSON of UMich charge data to the resources folder.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
good first issue Good for newcomers help wanted Extra attention is needed
Projects
Status: No status
Development

No branches or pull requests

3 participants