Highlight - Add Wenet model 35 - Add code generation and code translation from ZHIPU 33 - Hub support retry and continue-download after error - Add five finetune tasks for ofa - Add damoyolo-t and damoyolo-m - Add dpm-solver for diffusion models - Add image depth estimation pipeline - Add en-zh en-es es-en base translation models - Add GPT-3 tensor parallel finetuning
Feature
- Add five finetune tasks for ofa - Add synonym for table question answering - Add jupyter lab plugin in docker - Add language_guided_video_summarization pipeline - Add nlp/addr/structure and update token classificaiton related method - Add damoyolo-t and damoyolo-m - Add Wenet model 35 - Add code generation and code translation from ZHIPU 33 - Add camouflaged-detection - Support batch inference in pipeline for some models - Add table recognition task - Add dpm-solver for diffusion models - Ofa add asr task - Add features for alimeeting competition dataset - Add funasr based asr inference - Add extractive-summarization and topic-segmentation - Add image depth estimation pipeline - Add en-zh en-es es-en base translation models - Add gpt-moe model and pipeline - Action-detection model predownload video before inference - Add finetune for cv/language_guided_video_summarization - Add plug finetune and pretrained model - Support license plate detection - Add nextvit-small_image-classification_Dailylife-labels model - Add support for UniTE - Add video human matting task - Add LSTMCRFForWordSegmentation - Add face mask model - support new asr paraformer and conformer model - Add GPT-3 tensor parallel finetuning - Update image-portait-enhancement trainer - Add FairFace face attribute model - Add facial landmark confidence model
Improvements - Hub support retry and continue-download after error - Refactor NLP and fix some user feedbacks - Speed up the ast indexing during editing - Add tensorboard hook for visualization - reduce the GPU usage of dialog trianer - substitute face detection model in skin_retouching_pipeline.py - update git-lfs install instruction
BugFix
- fix output video path when person detect failed For 3d_body_keypoints - Fix lazy importing problem in text classification pipeline - Fix bug for distributed inference of gpt3 - Fix bug for mplug evaluation - token preprocess bug fix - fix file encoding problem in windows - fix deadlock when setting the thread number up to 90 for kws model - fix bug in token classification postprecessor - fix: torch.concat compatibility with torch1.8 - fix log print and extensions issue for datasets==2.5.2 - fix interpolate value error for vitadapter semantic segmentation - nlp csanmt translation fix finetuning bug - Fix a bug that the logging file cannot save the correct lr, which is zero instead - fix bug of tableQA on gpu - Fix bug for text generation task model - fix download file timeout too short
1.0.2
Our first official version is released at November 1.
* Using one line code for inference using pipeline interface * Using less than 10 lines of codes for finetuning using trainer * Provide models covering NLP, CV, MultiModal, Audio and Science * Provide up to 300 on-the-shelf models for convinient use