Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

update interface and version for fastembed #340

Merged
merged 8 commits into from
Oct 16, 2023
Merged

Conversation

generall
Copy link
Member

  • Allow iterators in fastembed
  • Update to v0.1.1
  • Allow parallel inference

@generall generall requested a review from joein October 16, 2023 14:55
vector_name = self.get_vector_field_name()
for idx, meta, (doc, vector) in zip(ids, metadata, encoded_docs):
ids_accumulator.append(idx)
payload = {"document": doc, **meta}
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

if you get picky, here we can actually overwrite "document" if meta has such a field

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

yes, I also don't like this, but we can't change it anymore

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

somehow missed it in last release

hnsw_config=hnsw_config,
)
}

def add(
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

return type is not updated in the docstring

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

check

collection_name=collection_name,
records=records,
wait=True,
parallel=parallel or 1,
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

don't we want to propagate batch_size here?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

check

@joein joein self-requested a review October 16, 2023 16:08
@generall generall merged commit 0f32e57 into dev Oct 16, 2023
generall added a commit that referenced this pull request Oct 16, 2023
* update interface and version for fastembed

* fix types

* fix types

* regen async

* use python 3.11 to check compatibility

* fix docstring

* regen async

* propagate batch size
@generall generall deleted the update-fastembed-0-1-0 branch May 3, 2024 10:49
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants