Apify

Apify and Crawlee Official Forum

b
F
A
J
A
Members
!!!Joefree!!! πŸ‘‘
!
!!!Joefree!!! πŸ‘‘
Offline, last seen 19 hours ago
Joined August 30, 2024
I use to be able to put image on README complete with width and height

<img src="https://example.com/image.jpg" width="200" height="300">

but now it seem the width and height is ignored , which makes my README uglier than ever before.
2 comments
o
!
I’m not sure if anyone else has experienced this or if it's just me.

When I don’t use an Actor for a long time, about a month or so, I sometimes notice that when I revisit the Actor console, the input fields are pre-filled by someone else, and I don’t know who. This is my own Actor.

These inputs may contain sensitive data like passwords, as shown in the screenshot.

I can't reproduce this issue, but my guess is that when my saved inputs 'expire,' they are automatically replaced by someone else’s inputs.

I hope someone from @Apifyteam can look into this.

Thank You!
7 comments
S
!
A
The table Headers blocking the dropdown.
When a user share their run, it seem my theme preference is "infected" by the view
although I am using "Light Theme", I view the run in "Dark Theme"
1 comment
!
Hello Team! Does reducing Docker size will reduce loading time and make the Actor load faster ... or ... is it the same ?
Thank You!
5 comments
J
M
!
M
How to know if current Actor have StandBy mode enabled.
So.. the Actor can decide whether it should activate HTTP server or run normal Codes.
Is there any API or ENV variable that I could use ?

Thank You!
6 comments
!
L
I am trying to use RUN --mount inside my docker :
Plain Text
RUN --mount=from=builder,target=/__src__ /python3 /main.py --setup


but it failed :

Plain Text
ACTOR: ERROR: Docker image build failed: the --mount option requires BuildKit. Refer to https://docs.docker.com/go/buildkit/ to learn how to build images with BuildKit enabled


I am still not familiar with docker creation. any clue how to fix this ?
Thank You.
2 comments
!
L
I just realized some Actor maintained by Apify could have static DNS address example: https://super-scraper-api.apify.actor
Is my Actors also have this DNS name ? If yes it could make things easier, for example the OAuth Authentication process
Thank you. πŸ™πŸ»
2 comments
!
I set my Actor "Use custom SEO Details", and saved successfully.
But when return to the setting page, its return to unchecked.
1 comment
P
On API Console theres is example using CURL, but theres no example using the most used language in the world which is pure JS Fetch. Adding Fetch example may help users a lot.

eg:

Plain Text
const APIFY_TOKEN = "YOUR_APIFY_TOKEN"

var run_inputs  = { "location": "New York" }

var url     = "https://api.apify.com/v2/acts/9avOmybPQaoW1CDas/run-sync-get-dataset-items?token=" + APIFY_TOKEN
var headers     = { "content-type": "application/json" }
var options     = { method: "POST", headers, body: JSON.stringify(run_inputs) }

var response = await fetch(url, options).then(res => res.json())


add another language (PHP, ruby) example might useful to.
1 comment
o
When I want to empty secret input, error show up
Error: Expected value to be of type string but received type "null"
7 comments
!
R
Currenly the Apify API doesn't support Array as input.
When we request run API with ARRAY input eg:

requests.post('https://api.apify.com/v2/acts/actorId/run-sync?token=********', json=[{'location': 'indonesia'}])

we get error something like :

{'error': {'type': 'invalid-input', 'message': 'The input JSON must be object, got "array" instead.'}}

Instead error, the API should accept the input as Batch Execution request just like Apollo GraphQL can do: https://www.apollographql.com/docs/router/executing-operations/query-batching/

Advantage:
The Actor only load Docker file once (save load time), then execute Scraper Code multiple time with multiple INPUTS, creating multiple Datasets for each INPUTS.
The run output should also return ARRAY of run results.
1 comment
S
What is the best way to replicate this INPUTS into INPUT_SCHEMA.json
10 comments
!
S
A
M
May be I should wait for one more day ...πŸ€—
2 comments
!
H
I have my actor all setup, readme, input schema, output schema. etc.
But I am still missing one step to have my Actor in Full Strength .... but I am not sure what is it ?
1 comment
S
On dataset view, pagination is not working
2 comments
!
O
My actor Input Scheme have hundred of optional BOOLEAN parameters (literally). When user not selecting any of the options, the platform set the default values to False (all hundreds parameters). This could create confusion ? How to prevent this from happening ? How to make platform just set True only for users selected options ? Thanks.
3 comments
!
A
I am receiving this notification, not sure where it come from.
1 comment
L
when setting environment variable from forked build, received following errors:
Plain Text
Error: Environment variable could not be saved (Concurrent update has been detected (object ID: N3tfo0ZvLIcrOSSAA))
4 comments
!
L
Does a run ID or Dataset ID have a pattern to match (eg: regexp) so I know its a Run ID ?
Thanks.
2 comments
L
R
I am experimenting with this scenario:

I build an Actor, and create 3 fork/build. each have different INPUT_SCHEMA, README.md, etc.
When I change version via Version dropdown, I can see INFORMATION page changed.
But, the INPUT page does not change, without have to refresh the page, even so, it seem the input conficlting with each other (diffrent fork).
Can it fixed, so when I change versions, the INPUT page also changed.

Context: I am trying to build an Actor Bundle that can scrape multiple website with different input schema (utilizing platform fork features).
this should work via api: eg https://api.apify.com/v2/acts/actorId/run-sync?build=website_1.
I want this also work from web console, is this something doable ?
2 comments
!
v
Is it possible to get user client HTTP headers (user-agent etc.) anytime a user accessing the actor ?
2 comments
!
O
Is it possible for an Actor to have persistent files that always updated, and where should I put the files. For example file based database such SQLite, etc.
Thank You.
I am using api call to get user info:
Plain Text
user_info = await Actor.apify_client.user('me').get()


I saw an attribute isPaying = True. Does this status indicates the user paying for platform, or is it paying for current actor subscription ?
just want to make sure, thank You
2 comments
!
P