More Premium Hugo Themes Premium Tailwind Themes

Nextcrawler

Next Crawler 是使用Playwright + Next.js + Prisma等主流技术搭建的网页数据采集器,通过可视化的UI进行配置,即可周期性的通过Playwright驱动浏览器爬取网页数据。

Nextcrawler

Next Crawler 是使用Playwright + Next.js + Prisma等主流技术搭建的网页数据采集器,通过可视化的UI进行配置,即可周期性的通过Playwright驱动浏览器爬取网页数据。

Author Avatar Theme by g089h515r806
Github Stars Github Stars: 40
Last Commit Last Commit: Dec 24, 2025 -
First Commit Created: Jan 17, 2026 -
Nextcrawler screenshot

Overview

Next Crawler is an innovative web data collector built using prominent technologies such as Playwright, Next.js, and Prisma. It allows users to seamlessly collect web data through a visual UI, making the entire process user-friendly and efficient. Whether you need to scrape articles, images, or even comments, Next Crawler provides a robust interface to help you automate regular data collection from various websites with ease.

With its powerful capabilities and customizable features, Next Crawler stands out as a versatile tool for anyone looking to gather data for research, analysis, or personal projects. The ability to configure repeated tasks and download different file formats adds to its appeal, catering to a wide range of data scraping needs.

Features

  • User-Friendly Interface: A visual UI simplifies the configuration process, making it accessible for users of all skill levels.
  • Smart Content Recognition: Built-in support for intelligent content detection using the mozilla/readability library ensures that you scrape only the relevant text.
  • Flexible File Downloads: You can download files in multiple formats like PDF, MP3, and MP4, enhancing the tool’s versatility for various content types.
  • Custom Multi-Field Parsing: The option to configure parsing settings for multiple fields allows for tailored data extraction to meet specific requirements.
  • Scheduled Tasks: Support for cron jobs lets users automate the scraping process at regular intervals, ensuring data is always up-to-date.
  • Import/Export Functionality: Easily manage scraping configurations with templates that support import and export, streamlining your setup.
  • Proxy Support & Error Logging: Features such as proxy support and detailed error logs enhance security and troubleshooting capabilities.
  • Persistent Browser Context: The capability to maintain a persistent browser session mimics real user behavior, providing more accurate data collection.