Skip to content

Releases: 3choff/docs-miner

v1.8.0 - GitHub Branch Support & UI Enhancements

28 Jan 23:50
Compare
Choose a tag to compare

This release introduces GitHub repository branch selection and improves the overall user interface.

🌳 GitHub Branch Features

  • Added branch selection for GitHub repositories
  • Automatic branch detection and listing
  • Smart handling of branch URLs
  • Support for default and custom branches

🎨 UI Improvements

  • Dynamic UI controls based on URL type
  • Added version number display

v1.7.0 - Anti-Detection & Crawling Resilience

23 Jan 00:10
Compare
Choose a tag to compare

This release enhances the browser-based crawling capabilities with anti-detection measures.

🛡️ Anti-Detection Improvements

  • Implemented user agent rotation from modern browser profiles
  • Added exponential backoff with retry logic
  • Introduced realistic viewport and permissions simulation

🔄 Browser Method Enhancements

  • Added content verification checks for dynamic pages
  • Improved JavaScript handling and execution
  • Enhanced browser simulation with common plugins and features

v1.6.0 - Enhanced Browser Crawling Reliability

12 Dec 17:26
Compare
Choose a tag to compare

This release improves the reliability and completeness of browser-based content extraction.

📊 Browser Method Improvements

  • Updated page loading strategy for more reliable content capture
  • Enhanced link processing to include navigation elements
  • Improved handling of relative URLs to absolute conversion

🐛 Bug Fixes

  • Resolved incomplete page parsing issues
  • Fixed link processing type safety
  • Enhanced content extraction reliability

v1.5.1 - Improved Windsurf Compatibility

29 Nov 20:09
Compare
Choose a tag to compare

This release improves file compatibility and enhances content readability.

📄 Content Generation

  • Removed unnecessary newlines at content start
  • Optimized spacing between content sections
  • Enhanced initial file readability

🔄 Compatibility Improvements

  • Better integration with Windsurf reader

v1.5.0 - Headers for crawled content and improved file handling

28 Nov 18:26
Compare
Choose a tag to compare

This release enhances content organization and improves file-handling capabilities.

📝 Content Organization

  • Added informative headers to crawled content
    • File path and Source URL for GitHub files
    • Source URL for website content
  • Enhanced content readability with clear file separators

🗂️ File Handling Improvements

  • Added "Add to File" button for appending to existing files
  • Implemented smart folder path detection
    • Better handling of root vs subfolder files
    • Improved workspace path recognition
  • Enhanced file settings UI with cleaner button layout

v1.3.0: Major Architecture Improvements and Enhanced Content Processing

25 Nov 08:34
Compare
Choose a tag to compare

This release brings significant improvements to the internal architecture and content processing capabilities.

🏗️ Architecture Improvements

  • Implemented dedicated service architecture for better code organization
  • Enhanced error handling and reporting system
  • Improved type safety throughout the codebase

🧹 Enhanced Content Processing

  • Better SVG handling in browser-based scraping
  • Improved navigation and footer removal for cleaner content
  • Enhanced JSON file formatting in GitHub repository crawling
  • Added intelligent non-text file skipping in repository processing

v1.2.0 - GitHub Repository Support & UI Improvements

22 Nov 22:44
Compare
Choose a tag to compare

This release adds support for GitHub repository documentation generation and enhances the user experience.

🚀 New Features

  • Generate documentation from GitHub repositories
  • Support for both full repositories and specific directories
  • Smart depth-based file filtering for repository crawling

🔧 Improvements

  • Enhanced webview UI text for better clarity and consistency
  • Refactored webview code for improved maintainability

v1.1.0 - Dual Scraping Methods & Enhanced Reliability

21 Nov 19:03
Compare
Choose a tag to compare

🚀 New Features

  • Dual Scraping Methods: Choose between:
    • API Method (Faster but may fail on some sites)
    • Browser Method (Slower but more reliable)
  • Method Selection: New dropdown in UI to switch between methods
  • Better Code Block Handling: Improved formatting for documentation code examples

🔧 Improvements

  • More reliable content extraction for JavaScript-heavy sites
  • Consistent metadata formatting between both methods
  • Better URL handling and crawling
  • Webpack bundling for optimized package size

🔍 Usage

Select your preferred method in the Docs Miner sidebar:

  • Use API method for quick documentation extraction
  • Switch to Browser method when API fails or for complex sites

Docs Miner v1.0.3 - Output Folder Control & Improved Crawling

20 Nov 12:12
Compare
Choose a tag to compare

Added

  • Output folder field to specify save location for generated docs

Changed

  • Improved crawling logic to follow subdirectory structure from initial URL
  • Updated depth level descriptions to be more precise and clear
  • Updated documentation to reflect new crawling behavior

Fixed

  • Version compatibility issue with VS Code types

Docs Miner v1.0.0 - Initial Release

19 Nov 19:49
Compare
Choose a tag to compare

Description:

Docs Miner VSCode Extension v1.0.0

First stable release of Docs Miner, a VS Code extension for generating markdown documentation from web pages using Jina AI Reader API.

Features

  • Generate markdown documentation from any web URL
  • Control crawling depth (1-5 levels)
  • Real-time progress tracking
  • Stop crawling at any time
  • Automatically saves markdown files in workspace
  • Optional auto-open for generated files

Installation

  1. Download the docs-miner-1.0.0.vsix file
  2. In VS Code, go to Extensions view (Ctrl+Shift+X)
  3. Click '...' menu (top-right)
  4. Select 'Install from VSIX...'
  5. Choose the downloaded file

Requirements

  • VS Code 1.95.0 or higher
  • Active internet connection

Notes

  • Initial stable release
  • Full documentation available in README
  • Bug reports and feedback welcome in Issues section