Extract tabular data behind authentication using selenium
, process it, and push it to snowflake
.
selenium
pandas
snowflake
sqlalchemy
This is the basic workflow. All of the following steps are automated using a selenium webdriver.
Disclaimer The sensitive information is redacted for privacy reasons.
Enter login credentials to the website:
Once logged in, the website displays a list of stores. For every store click the second button (refer to the image below)
Clicking the button in 2, leads you to this page (image below). On this page, expand all the listings:
Once expanded, there are a total of 5 tables (Table #1, Table #2, etc.). Extract all these tables.
Then click on the the three red buttons (labeled 1, 2, 3) and you will a bunch of other tables. Extract all these tables as well.
The above data is extracted in a pandas
DataFrame
for further processing and finally pushed to snowflake
.