Dockerfile and script for running crawl jobs in Google Cloud Run with captcha support #208

codders · 2022-08-28T17:26:00Z

This PR adds support for Google-Cloud-Run, and adds a docker image that can be launched to trigger a one-time crawl (which can then be setup to run on a schedule).
I have refactored the config handling so that it's possible to pass the relevant config by environment variables.
This needs a bunch of documentation still - opening the PR for discussion

Add additional python script for gcloud run

Updated dependencies

.gitignore

alexanderroidl · 2022-08-29T16:34:51Z

README.md

@@ -161,15 +161,35 @@ First build the image inside the project's root directory:
 $ docker build -t flathunter .
 ```

-**When running a container using the image, a config file needs to be mounted on the container at ```/config.yaml```.** The example below provides the file ```config.yaml``` off the current working directory:
+**When running a container using the image, a config file needs to be mounted on the container at ```/config.yaml``` or configuration has to be supplied using environment variables.** The example below provides the file ```config.yaml``` off the current working directory:


We should think about separating the README into separate .md-files and only use the main file as index, because it is so large already

Yeah. That would make sense. If you have a proposal for how you would like to see that split, I would welcome that. At least the basic getting-started instructions should maybe be at the top of the main README, and people who want to know more can dig a little.

.gitignore

codders added 3 commits August 28, 2022 12:23

Added initial support for google-cloud-run

7d7c411

Updated config parsing to read config from environment

9734dc4

Added dotenv

d819ebb

codders mentioned this pull request Aug 28, 2022

gCloud app not sending notifications #202

Closed

codders requested a review from alexanderroidl August 29, 2022 08:54

codders changed the title ~~[WIP] Dockerfile and script for running crawl jobs in Google Cloud Run with captcha support~~ Dockerfile and script for running crawl jobs in Google Cloud Run with captcha support Aug 29, 2022

codders added 2 commits August 29, 2022 13:31

Seperate dockerfile for use with gcloud run

85cfd9c

Add additional python script for gcloud run

Updated README

966ce83

Updated dependencies

codders force-pushed the feat/google-cloud-run branch from 93f0907 to 966ce83 Compare August 29, 2022 11:35

alexanderroidl reviewed Aug 29, 2022

View reviewed changes

.gitignore Show resolved Hide resolved

alexanderroidl reviewed Aug 29, 2022

View reviewed changes

.gitignore Show resolved Hide resolved

alexanderroidl previously approved these changes Aug 29, 2022

View reviewed changes

Merge branch 'main' into feat/google-cloud-run

9681990

codders dismissed alexanderroidl’s stale review via 9681990 August 30, 2022 11:44

codders merged commit 0b5445d into flathunters:main Aug 30, 2022

codders deleted the feat/google-cloud-run branch August 30, 2022 11:45

abuchmueller mentioned this pull request Sep 3, 2022

WebDriverException: Message: unknown error: cannot connect to chrome at 127.0.0.1:44405 from session not created: This version of ChromeDriver only supports Chrome version 105 #213

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dockerfile and script for running crawl jobs in Google Cloud Run with captcha support #208

Dockerfile and script for running crawl jobs in Google Cloud Run with captcha support #208

codders commented Aug 28, 2022

alexanderroidl Aug 29, 2022

codders Aug 30, 2022

Dockerfile and script for running crawl jobs in Google Cloud Run with captcha support #208

Dockerfile and script for running crawl jobs in Google Cloud Run with captcha support #208

Conversation

codders commented Aug 28, 2022

alexanderroidl Aug 29, 2022

Choose a reason for hiding this comment

codders Aug 30, 2022

Choose a reason for hiding this comment