Skip to end of banner Go to start of banner

2024-09-19 Meeting notes

Skip to end of metadata

Created by Tyler Hill on Sept 19, 2024

Go to start of metadata

You are viewing an old version of this content. View the current version.

Compare with Current View Version History

\uD83D\uDDD3 Date

19 Sep 2024

\uD83D\uDC65 Participants

\uD83E\uDD45 Agenda

\uD83D\uDDE3 Discussion topics

Item

Notes

Overview

Josh will be adding more intro issues to the GitHub soon
Will be going through chromedp instead of continuing A Tour of Go

chromedp

in api-tools in main.go: runs all the scrapers, we’ll be looking at scrapeCoursebook
- initChromeDp first
- RunResponse runs a set of actions on the context
  - Run would not be able to handle page navigation/changes, like what you trigger by clicking a button
- use RunResponse to log in with your netID and password to scrape without being rate limited
  - ClearBrowserCookies (using ActionFunc which is like a custom action) to get a new token at the start of each new prefix which resets your rate limit
  - get headers from with cookies to use in later requests
- make a new request to coursebooks’s behind-the-scenes url for each course (this reverse engineering makes scraping faster and easier, don’t have to click thru so many dropdowns/buttons)
  - and then for each section in each course
  - also refresh the token every 30 seconds, just in case

⤴ Decisions

✅ Action items

No labels