Skip to content

Proposal: Support for Stateless Crawler Execution #1219

Closed Answered by janbuchar
neviaumi asked this question in Q&A
Discussion options

You must be logged in to vote

Hi @neviaumi and thanks for opening this discussion! The Crawlee storage system is currently undergoing a significant refactor - see #1194. With that, you should be able to easily set up a crawler in a way that each MCP "call" (forgive me for not knowing the correct terminology) has its own non-persistent storage (datasets, key-value stores and request queues).

In fact, you can already configure the MemoryStorageClient to not dump anything in the filesystem - see the highlighted code in https://crawlee.dev/python/docs/deployment/gcp-cloud-run-functions, for example.

Replies: 2 comments 1 reply

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
1 reply
@neviaumi
Comment options

Answer selected by neviaumi
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants