How to store folders after crawler solves anti-crawling?
Octopus collector can help you solve the problem of crawler anti-crawling, and can save the collected data to the specified folder. The following are the general operation steps: 1. Open octopus collector and create a new collection task. 2. In the task settings, enter the URL to be collected as the starting URL for collection. 3. Configure collection rules. You can use the intelligent identification function to let Octopus automatically identify the data structure of the page, or you can set the collection rules manually. 4. If the collection rules are set manually, you can use the mouse to select the data elements on the page and set the corresponding collection rules to ensure the correct collection of the required data. 5. Set the file saving path. In the collection rule settings, you can choose to save the collected data to the specified folder. 6. Run the acquisition task. After confirming the correct settings, you can start the collection task and let Octopus start collecting data. 7. Wait for the collection to be completed. Octopus will automatically grab the data on the page according to the set rules and save it in the specified folder. Octopus collector supports a variety of data storage formats, including text files, Excel files, databases and so on. You can choose the appropriate storage method according to your own needs. Octopus has prepared a series of concise and easy-to-understand tutorials for users to help them quickly master the collection skills and easily cope with the data collection of various websites. Please go to official website to get a tutorial to help you learn more details.