Enhance README :: Add Cost-Effective PDF to Markdown Conversion Using AWS Bedrock Models #129
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This PR introduces another model option for converting PDFs to Markdown using vision models via the
zerox
library.snippet
What makes this so cool? The
zerox
library leverages thelitellm
library behind the scenes, giving us a wider range of options. Plus,litellm
lets us tap into vision models from the AWS Bedrock platform and a few other choices.I opened this PR for two main reasons:
If you're working with a PDF over 10 pages, you might hit Google's rate limit for the Gemini model under the free tier.
Here's a quick comparison of some models:
The Amazon model is super competitive in pricing, making it a breeze to use with AWS Lambda functions.
By adding more options right in the README, we're making it easier for newcomers to find what they need.
Thanks!