Apify and Crawlee Official Forum

Updated 4 months ago

Error: Dataset with id: [ACTOR_DEFAULT_DATASET_ID] does not exist.

Hello everyone,

I'm trying to use apify actor:push-data inside an actor. The command is inside a dockerized shell script [1] and as simple as this:
Plain Text
curl http://httpbin.org/ip | apify actor:push-data


But the run fails with:
Plain Text
2024-04-18T13:34:28.642Z     at DatasetClient.throwOnNonExisting (/usr/local/lib/node_modules/apify-cli/node_modules/@crawlee/memory-storage/resource-clients/common/base-client.js:15:15)
2024-04-18T13:34:28.643Z     at DatasetClient.pushItems (/usr/local/lib/node_modules/apify-cli/node_modules/@crawlee/memory-storage/resource-clients/dataset.js:158:18)
2024-04-18T13:34:28.644Z     at async PushDataCommand.run (/usr/local/lib/node_modules/apify-cli/src/commands/actor/push-data.js:25:9)
2024-04-18T13:34:28.644Z     at async PushDataCommand._run (/usr/local/lib/node_modules/apify-cli/node_modules/@oclif/command/lib/command.js:43:20)
2024-04-18T13:34:28.645Z     at async Config.runCommand (/usr/local/lib/node_modules/apify-cli/node_modules/@oclif/config/lib/config.js:173:24)
2024-04-18T13:34:28.646Z     at async Main.run (/usr/local/lib/node_modules/apify-cli/node_modules/@oclif/command/lib/main.js:28:9)
2024-04-18T13:34:28.647Z     at async Main._run (/usr/local/lib/node_modules/apify-cli/node_modules/@oclif/command/lib/command.js:43:20)
2024-04-18T13:34:28.648Z     at async /usr/local/lib/node_modules/apify-cli/src/bin/run:7:9


I think I exhausted all the debug option I found online, so I kindly ask for help here.

What am I missing?

[1] https://github.com/netmilk/actor-curl/blob/d31d4805e5be4a6ef06b3f2e291bbb44da517000/bin/start.sh#L6
P
n
4 comments
Hi ,
Thank you for your feedback I will reach out to our tooling team for more information.
Hey , thank you, I appreciate that. I can't wait to crack this one. πŸ™‚
Just a quick note. Please, prove me wrong, but I think I found out it couldn't ever work. This [1] condition is always true and the cli then always uses the MemoryStorage regardless the ENV variables exposed by the Run for the Actor. If I overrode the forceCloud to true it uses the ApifyClient adaptor, it works as expected and I can see the data in the "Run Detail > Storage > Dataset" tab in the Apify Cloud Console. I'll file a Github Issue later and let's follow up there. Ok?
[1] https://github.com/apify/apify-cli/blob/8f7b1fd84c4070a11b0af26b64c6a5002f7a7218/src/lib/actor.ts#L30
Add a reply
Sign up and join the conversation on Discord