- Branch selection for GitHub repositories
- Version number display in UI
- Dynamic UI controls based on URL type (GitHub vs regular websites)
- Improved GitHub repository detection and branch handling
- Enhanced UI feedback for GitHub repository crawling
- Better error handling for branch fetching
- Fixed branch selection persistence between different repositories
- Fixed UI state management for crawling method selection
- Improved GitHub repository crawling with automatic branch detection
- Enhanced error handling and user feedback for failed crawls
- Added detailed branch information logging during GitHub repository crawling
- Fixed button state not resetting after crawling errors
- Fixed incorrect branch detection in GitHub repository crawling
- Implemented exponential backoff with retries in the browser method to avoid being blocked by anti-detection measures.
- Added random user agent rotation from a list of common user agents.
- Added
Accept-Language
,Accept
, andCache-Control
headers to the browser method. - Added a delay with exponential backoff before navigating to the page.
- Enabled JavaScript in the browser method.
- Set viewport to a common desktop resolution in the browser method.
- Added common browser permissions to the browser method.
- Added common browser features to avoid detection.
- Added a wait for specific content to be loaded in the browser method.
- Updated Puppeteer navigation strategy to use
networkidle0
for more reliable page loading. - Improved browser-method link handling to ensure navigation menu links are included in crawling while keeping content clean.
- Enhanced URL processing to convert relative links to absolute URLs in browser-based crawling.
- Resolved issue where some pages were not fully parsed due to insufficient waiting conditions.
- Corrected link processing to ensure all links are properly formatted and included in the crawl.
- Removed unnecessary newlines at the start of generated content
- Improved file compatibility with Windsurf by eliminating extra spacing
- File selection button for appending to existing files
- Improved file settings UI with cleaner button layout
- Added headers to crawled content (File/Source information)
- Improved filepath handling for better folder detection
- Custom output file naming option in UI
- Added demo GIF to README for better feature visualization
- Major internal refactoring for improved code maintainability
- Enhanced error handling and reporting
- Dedicated service architecture for better separation of concerns
- Improved type safety throughout the codebase
- Better content cleaning and formatting:
- Enhanced SVG handling in browser method
- Improved navigation and footer removal
- Fixed JSON file formatting in GitHub repository crawling
- Added non-text file skipping in repository crawling
- GitHub repository documentation generation
- Support for both full repositories and specific directories
- Depth-based file filtering for repository crawling
- Improved webview UI text for better clarity and consistency
- Refactored webview code
- Alternative browser-based scraping method
- Method selection dropdown in UI
- Improved crawling logic to follow subdirectory structure from initial URL
- Updated documentation to reflect new crawling behavior
- Version compatibility issue with VS Code types
- Complete documentation with installation guides
- Marketplace publishing preparation
- Auto-open file setting with checkbox in UI
- Improved UI consistency with VS Code's native look
- Box-sizing fixes for better UI rendering
- Repository field to package.json
- Improved file naming to include full URL path instead of just domain
- Removed command palette entry in favor of sidebar UI
- Updated README to reflect new sidebar-based workflow
- Separated HTML, CSS, and JavaScript into distinct files
- Improved code organization and maintainability
- Enhanced webview UI with better styling
- Real-time progress tracking
- Stop crawling functionality
- Depth control slider with descriptions
- Improved error handling and status messages
- Enhanced UI with better visual feedback
- Crawling depth control
- Progress indicators
- Better error messages
- URL validation issues
- File saving reliability
- Support for recursive crawling
- Link extraction functionality
- Depth-based crawling limits
- Integration with Jina AI Reader API
- Markdown conversion functionality
- File saving capabilities
- Basic webview implementation
- URL input functionality
- Initial UI design
- Initial release
- Basic project structure
- Core extension setup