Apify

Apify and Crawlee Official Forum

b
F
A
J
A

Handling HTML structures of different websites

Hey,

I want to scrape multiple e commerce web shops that have different HTML structures.

I was thinking about making handlers for each shop. Allowing each shop to scrape the HTML on its own. Eventually all sites should come up with kind of similar data, such as price, title, in stock sizes etc. This is necessary because the data must then be processed, requiring each product to meet the schema.

Is this the best way to do so? I honestly don't know how to work this out into code yet, am now mostly thinking about a good approach. I would like to hear if there is a better approach πŸ™‚
v
1 comment
Hi , using 1 actor per 1 site should be the cleanest approach. You can still use some shared code for example by using a private npm package.
Add a reply
Sign up and join the conversation on Discord
Join