XHS-Spider is a desktop data collection tool designed to gather content and metadata from the Xiaohongshu platform. It provides a graphical interface that allows users to explore posts, collect information, and download media such as images and videos from individual notes or search results. It was developed primarily as a learning project to demonstrate approaches to building web crawlers and experimenting with technologies such as WebView2 and WPF UI. It supports multiple ways to locate content, including keyword searches, user searches, and parsing individual post links. XHS-Spider can also export collected data and comments, enabling users to analyze or store retrieved information locally. Additional capabilities include comment scraping and generating word clouds from comment data. It was originally released publicly but was later discontinued by the author due to concerns about misuse and maintenance challenges.
Features
- QR code login and automatic login support
- Keyword and user-based content search
- Parsing and extraction of individual post data
- Comment scraping and comment export capabilities
- Data export and comment word cloud generation
- Downloading images, videos, and live photo resources