towhee-0.8.1
Pre-release
Pre-release
New Models:
- Vision backbone for video embedding
- 3 SOTA vision backbones
Operators:
- Add 2 video de-copy operators: select-video, temporal-network
- Add 1 image embedding operator specifically designed for image retrieval and video de-copy with SOTA performance on VCSL dataset: isc
- Add 1 audio embedding operator specified for audio fingerprint: audio_embedding.nnfp (with pretrained weights)
Notebooks:
- Add 1 tutorial for video de-copy: video_deduplication_at_segment_level.ipynb
- Add 1 beginner tutorial for audio fingerprint: Audio Fingerprint I: Build a Demo with Towhee & Milvus
DC
- dc.flatten now support multi column
- dc.group_by;
- kv storag mixin
- dc.insert_leveldb;
- dc.serach_leveldb;
- Operator output nothing, i.e. dc.operator'input',