How to use List Command to Navigate URLs and Capture Data Efficiently
Steps to create an agent using a List command to navigate urls in the List Command to capture data.
Sequentum Cloud enables users to create agents that can efficiently handle list commands to navigate URLs. In this guide, we’ll walk through the steps to create an agent in Sequentum Cloud that uses an List Commands to Navigate URLs and Capture Data Efficiently
How to use Url command to load the url:
Step 1: Open the editor window and navigate to the Agent Configuration → Browser → Url
Step 2: In the Url section, enter the site’s url for which you want to extract the data
Step 3: After adding the Url command, Select the elements in the list as shown below (To select multiple elements, click on the first element, hold the Shift key, and then click on the second element.), After that click on Extract option available at the bottom.
This will add the command in the left panel as shown below -
Note:
When you click the 'List' option (as shown above), it will insert the List command alone, without adding any extra commands, as demonstrated below:
Similarly, clicking the 'Link' option based on the selected list on the web page will function as shown below:
After clicking on the Link command, it will now add the List command with the additional Link command as shown below-
Step 4: After adding the Link List option as mentioned in the step 3, Next add the Navigate Url command on the list command to navigate the urls one by one and capture the data.
Step 5: Add the Capture command to extract content from the navigated page.
Note:
Additionally, you can generate the agent in text format by clicking on the 'Text' option in the right panel
This text format can be saved locally by clicking the option available at the bottom.
Agent:
Proxies:
Run Without Proxies: true
Commands: URL
Dynamic Load:
Timeouts:
Discover Activity: 1
Input: https://training.sequentum.com/
Commands: Page Area List
Name: Link List
Select: //div[@class='pageContent']/div/div[@class='card mb-1'][div[1][@class='card-body']/a[1][@class='media text-dark']]
Commands:
- Content: Link
Extract: /div[1][@class='card-body']/a[1][@class='media text-dark']
Attribute: Exact URL
- URL: URL
Dynamic Load:
Timeouts:
Discover Activity: 1
Input:
Command: Link
Commands: Content
Extract: //div[@class='pageContent']
Export:
Commands: CSV