Releases: 3choff/docs-miner
v1.8.0 - GitHub Branch Support & UI Enhancements
This release introduces GitHub repository branch selection and improves the overall user interface.
🌳 GitHub Branch Features
- Added branch selection for GitHub repositories
- Automatic branch detection and listing
- Smart handling of branch URLs
- Support for default and custom branches
🎨 UI Improvements
- Dynamic UI controls based on URL type
- Added version number display
v1.7.0 - Anti-Detection & Crawling Resilience
This release enhances the browser-based crawling capabilities with anti-detection measures.
🛡️ Anti-Detection Improvements
- Implemented user agent rotation from modern browser profiles
- Added exponential backoff with retry logic
- Introduced realistic viewport and permissions simulation
🔄 Browser Method Enhancements
- Added content verification checks for dynamic pages
- Improved JavaScript handling and execution
- Enhanced browser simulation with common plugins and features
v1.6.0 - Enhanced Browser Crawling Reliability
This release improves the reliability and completeness of browser-based content extraction.
📊 Browser Method Improvements
- Updated page loading strategy for more reliable content capture
- Enhanced link processing to include navigation elements
- Improved handling of relative URLs to absolute conversion
🐛 Bug Fixes
- Resolved incomplete page parsing issues
- Fixed link processing type safety
- Enhanced content extraction reliability
v1.5.1 - Improved Windsurf Compatibility
This release improves file compatibility and enhances content readability.
📄 Content Generation
- Removed unnecessary newlines at content start
- Optimized spacing between content sections
- Enhanced initial file readability
🔄 Compatibility Improvements
- Better integration with Windsurf reader
v1.5.0 - Headers for crawled content and improved file handling
This release enhances content organization and improves file-handling capabilities.
📝 Content Organization
- Added informative headers to crawled content
- File path and Source URL for GitHub files
- Source URL for website content
- Enhanced content readability with clear file separators
🗂️ File Handling Improvements
- Added "Add to File" button for appending to existing files
- Implemented smart folder path detection
- Better handling of root vs subfolder files
- Improved workspace path recognition
- Enhanced file settings UI with cleaner button layout
v1.3.0: Major Architecture Improvements and Enhanced Content Processing
This release brings significant improvements to the internal architecture and content processing capabilities.
🏗️ Architecture Improvements
- Implemented dedicated service architecture for better code organization
- Enhanced error handling and reporting system
- Improved type safety throughout the codebase
🧹 Enhanced Content Processing
- Better SVG handling in browser-based scraping
- Improved navigation and footer removal for cleaner content
- Enhanced JSON file formatting in GitHub repository crawling
- Added intelligent non-text file skipping in repository processing
v1.2.0 - GitHub Repository Support & UI Improvements
This release adds support for GitHub repository documentation generation and enhances the user experience.
🚀 New Features
- Generate documentation from GitHub repositories
- Support for both full repositories and specific directories
- Smart depth-based file filtering for repository crawling
🔧 Improvements
- Enhanced webview UI text for better clarity and consistency
- Refactored webview code for improved maintainability
v1.1.0 - Dual Scraping Methods & Enhanced Reliability
🚀 New Features
- Dual Scraping Methods: Choose between:
- API Method (Faster but may fail on some sites)
- Browser Method (Slower but more reliable)
- Method Selection: New dropdown in UI to switch between methods
- Better Code Block Handling: Improved formatting for documentation code examples
🔧 Improvements
- More reliable content extraction for JavaScript-heavy sites
- Consistent metadata formatting between both methods
- Better URL handling and crawling
- Webpack bundling for optimized package size
🔍 Usage
Select your preferred method in the Docs Miner sidebar:
- Use API method for quick documentation extraction
- Switch to Browser method when API fails or for complex sites
Docs Miner v1.0.3 - Output Folder Control & Improved Crawling
Added
- Output folder field to specify save location for generated docs
Changed
- Improved crawling logic to follow subdirectory structure from initial URL
- Updated depth level descriptions to be more precise and clear
- Updated documentation to reflect new crawling behavior
Fixed
- Version compatibility issue with VS Code types
Docs Miner v1.0.0 - Initial Release
Description:
Docs Miner VSCode Extension v1.0.0
First stable release of Docs Miner, a VS Code extension for generating markdown documentation from web pages using Jina AI Reader API.
Features
- Generate markdown documentation from any web URL
- Control crawling depth (1-5 levels)
- Real-time progress tracking
- Stop crawling at any time
- Automatically saves markdown files in workspace
- Optional auto-open for generated files
Installation
- Download the
docs-miner-1.0.0.vsix
file - In VS Code, go to Extensions view (Ctrl+Shift+X)
- Click '...' menu (top-right)
- Select 'Install from VSIX...'
- Choose the downloaded file
Requirements
- VS Code 1.95.0 or higher
- Active internet connection
Notes
- Initial stable release
- Full documentation available in README
- Bug reports and feedback welcome in Issues section