The
TASK_URL_READER
TASK_LM
TASK_URL_READER
Parameter | Data Type | Required | Description |
---|---|---|---|
url |
|
Required | The web address of the page you want to extract (e.g., "https://www.example.com") |
return_format |
|
Optional | How you want the content formatted. Options include
|
use_cache |
|
Optional | Whether to save and reuse previous results (default:
|
celladdress |
|
Optional | Where to place the output (e.g., "C1"). If not specified, results appear in the cell to the right of the function |
"text"
Returns just the readable text from the webpage with minimal formatting:
=TASK_URL_READER("https://www.example.com", "text")
This is ideal when you only need the raw text content without any HTML tags or formatting.
"html"
Returns the complete HTML code of the webpage:
=TASK_URL_READER("https://www.example.com", "html")
Use this when you need to analyze the page structure or extract specific HTML elements.
"markdown"
Converts the webpage content to markdown formatting:
=TASK_URL_READER("https://www.example.com", "markdown")
This preserves basic formatting like headings, lists, and links in a clean, readable format.
"screenshot"
Returns a URL to a screenshot image of the webpage:
=TASK_URL_READER("https://www.example.com", "screenshot")
The screenshot option is particularly useful when:
When combined with
TASK_LM
=TASK_LM("Describe what you see in this image: " & TASK_URL_READER("https://www.example.com", "screenshot"))
The cache system stores results for 6 hours to improve performance and reduce API calls:
=TASK_URL_READER("https://www.example.com", "default", true)
To always fetch fresh content, set
use_cache
false
=TASK_URL_READER("https://www.example.com", "default", false)
Use the Save icon in the sidebar for long-term storage of cached results across sessions.
By default, results appear in the cell to the right of the function. To specify a different output location:
=TASK_URL_READER("https://www.example.com", "default", true, "G10")
You can also specify just a column, and the function will use the current row:
=TASK_URL_READER("https://www.example.com", "default", true, "G")
=TASK_URL_READER("https://www.wikipedia.org")
This places Wikipedia's homepage content in the cell to the right of the formula.
=TASK_URL_READER("https://www.bbc.com/news/world-65360307", "text", true, "B5")
This extracts a BBC news article as plain text and places it in cell B5.
=TASK_URL_READER("https://www.amazon.com/dp/B08N5KWB9H", "markdown")
This fetches an Amazon product page in markdown format.
=TASK_URL_READER("https://developers.google.com/apps-script/reference/spreadsheet/sheet", "text", false)
This gets fresh (non-cached) Google Apps Script documentation as plain text.
=TASK_URL_READER("https://weather.com/weather/today/l/San+Francisco+CA", "screenshot")
This captures a screenshot of today's weather in San Francisco, which could then be analyzed by an AI model.
use_cache
true
false
"text"
"markdown"
"html"
"screenshot"
TASK_URL_READER
TASK_URL_READER
TASK_LM
celladdress
The cache lasts for 6 hours by default. After that time, the function will fetch fresh content the next time it runs.
Generally no. The function cannot access content behind login screens or paywalls unless the site allows public access.
It depends. Some JavaScript-rendered content may not be captured in text/HTML modes. Try the "screenshot" mode for sites with heavy JavaScript components.
The caching mechanism minimizes performance impact. However, extracting large amounts of content from multiple URLs may cause some slowdown.
Yes, the Jina reader service has rate limitations on their free plan. For high-volume needs, consider their paid plan.
No, the function can only access publicly available content that doesn't require authentication.
Use the output cell from
TASK_URL_READER
TASK_LM
=TASK_LM("Summarize this article: " & B5)
(Assuming B5 contains the
TASK_URL_READER
Not directly. However, you can extract the full page and then use other functions like
TASK_LM
5 months ago
1 month ago
1 month ago