Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Product Engineering Review: Marketing_Campaign_Effectiveness_Preditction_PY_SQL #582

Open
chetan-hirapara opened this issue Apr 8, 2024 · 0 comments
Assignees
Labels
enhancement Adding graphs and improving output

Comments

@chetan-hirapara
Copy link
Collaborator

chetan-hirapara commented Apr 8, 2024

Reviewer 1 comments:

  • warnings are suppressed.
  • SQL Queries are used or DataFrame creation.
  • Do not use external plots as it pulls the data to client.
  • SQL Queries are used for TD_PLOT

Reviewer 1 suggestions:

  • Avoid suppressing the warnings. it may mask some of the genuine issues.
  • notebook has a reference for query which has window aggregate on grouping data. It should be modified and use teradataml window aggregates.
  • Notebook uses a barplot from matplotlib. teradataml supports barplot. Please look at options and use inline plot if it works for you.
  • Notebook uses SQL Query for group by. use teradataml group by instead.
  • Use teradataml DataFrame.plot instead of SQL Query.
  • Used SQL query for TrainTestSplit instead of teradataml. Use teradataml analytic function.
  • Filtered the data based on SQL. Use teradataml dataframe filtering. If we want to store the filtered data, use to_sql on top of filtered DataFrame.
  • used SQL for XGBoost. Convert it to teradataml function.
  • used SQL for ClassificationEvaluator. Instead, use XGBoostModel.evaluate. That should automatically does evaluation.
  • used SQL for DecisionForest. Instead, use teradataml function.

Reviewer 2 comments:

  • in section 3.1 pandas shape and and sort used.
  • queries used for cast,sum,count function.
  • long sql queries used.

Reviewer 2 suggestions:

  • use teradataml functions wherever possible
@chetan-hirapara chetan-hirapara self-assigned this Apr 8, 2024
@chetan-hirapara chetan-hirapara added the enhancement Adding graphs and improving output label Apr 8, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement Adding graphs and improving output
Projects
None yet
Development

No branches or pull requests

1 participant