Apify and Crawlee Official Forum
b
F
A
J
A
1,148
Apify Platform Forum
Crawlee JavaScript Forum
Crawlee for Python Forum
Sign up for Apify Platform here
Star Crawlee on GitHub
Star Crawlee for Python on GitHub
Powered by
Hall
Crawlee JavaScript Forum
0
Join
on Discord
save HTML file using crawlee
Crawlee JavaScript Forum
save HTML file using crawlee
0
Join
on Discord
N
Nyanmaru
2 months ago
Has anybody tried downloading the HTML file of the URL using Crawlee? Was wondering if Crawlee has a capacity of downloading the HTML file of the URL since I've just been using Crawlee and really loving the experience.
E
M
N
4 comments
Share
E
Exp
2 months ago
You can download HTML content of a webpages using Crawlee
M
Marco
2 months ago
It depends on which crawler you are using:
Cheerio:
https://cheerio.js.org/docs/api/classes/Cheerio#html
Playwright:
https://playwright.dev/docs/api/class-page#page-content
Puppeteer:
https://pptr.dev/api/puppeteer.page.content
N
Nyanmaru
2 months ago
Thanks for this awesome answer! Was wondering if Crawlee has examples on how to save it to a file?
M
Marco
last month
You can use the KeyValueStore:
https://crawlee.dev/api/core/class/KeyValueStore
. E.g., with Cheerio:
Plain Text
Copy
await store.setValue('my-html', $.html('html'), { contentType: 'text/html' });
Add a reply
Sign up and join the conversation on Discord
Join
on Discord