Skip to content

Commit

Permalink
drop short docs before ingestion (#268)
Browse files Browse the repository at this point in the history
  • Loading branch information
baskaryan authored Feb 13, 2024
1 parent 7356d00 commit bbe56e9
Showing 1 changed file with 1 addition and 0 deletions.
1 change: 1 addition & 0 deletions ingest.py
Original file line number Diff line number Diff line change
Expand Up @@ -130,6 +130,7 @@ def ingest_docs():
docs_transformed = text_splitter.split_documents(
docs_from_documentation + docs_from_api + docs_from_langsmith
)
docs_transformed = [doc for doc in docs_transformed if len(doc.page_content) > 10]

# We try to return 'source' and 'title' metadata when querying vector store and
# Weaviate will error at query time if one of the attributes is missing from a
Expand Down

0 comments on commit bbe56e9

Please sign in to comment.