nlu Changelog

4.0.0

OCR Visual Tables into Pandas DataFrames from PDF/DOC(X)/PPT files, 1000+ new state-of-the-art transformer models for Question Answering (QA) for over 30 languages, up to 700% speedup on GPU, 20 Biomedical models for over 8 languages, 50+ Terminology Code Mappers between RXNORM, NDC, UMLS,ICD10, ICDO, UMLS, SNOMED and MESH, Deidentification in Romanian, various Spark NLP helper methods and much more in 1 line of code with John Snow Labs NLU 4.0.0

______________________

3.4.4

We are very excited to announce NLU 3.4.4 has been released with over 600 new models, over 75 new languages, and 155 languages covered in total,
400% speedup for tokenizers and 18x speedup of UniversalSentenceEncoder on GPU.

On the general NLP side, we have transformer-based Embeddings and Token Classifiers powered by state of the art [CamemBertEmbeddings](https://camembert-model.fr/) and [DeBertaForTokenClassification](https://arxiv.org/abs/2006.03654) based
architectures as well as various new models for
`Historical`, `Ancient`,`Dead`, `Extinct`, `Genetic`, and `Constructed` languages like
`Old Church Slavonic`, `Latin`, `Sanskrit`, `Esperanto`, `Volapük`, `Coptic`, `Nahuatl`, `Ancient Greek (to 1453)`, `Old Russian`.
On the healthcare side, we have `Portuguese De-identification Models`, have `NER` models for Gene detection and finally RxNorm Sentence resolution model for mapping and extracting pharmaceutical actions (e.g. analgesic, hypoglycemic)
as well as treatments (e.g. backache, diabetes).

For full release notes with all models see
[here](https://github.com/JohnSnowLabs/nlu/pull/119)
or [here](https://nlu.johnsnowlabs.com/docs/en/release_notes) ,

First-time language models covered
The languages for these models are covered for the very first time ever by NLU.

| Number | Language Name(s) | NLU Reference | Spark NLP Reference | Task | Annotator Class | Scope | Language Type |
|---------:|:----------------------------------------------------------------------------|:----------------------------------------------------------------------------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------------------------------------------------------------------------------------|:-----------------------|:---------------------|:--------------|:----------------|
|0 | [Sanskrit](https://iso639-3.sil.org/code/san)| [sa.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_sa_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_sa_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Ancient|
|1 | [Sanskrit](https://iso639-3.sil.org/code/san)| [sa.lemma](https://nlp.johnsnowlabs.com/2022/05/01/lemma_vedic_sa_3_0.html) | [lemma_vedic](https://nlp.johnsnowlabs.com/2022/05/01/lemma_vedic_sa_3_0.html)| Lemmatization | LemmatizerModel| Individual| Ancient|
|2 | [Sanskrit](https://iso639-3.sil.org/code/san)| [sa.pos](https://nlp.johnsnowlabs.com/2022/05/01/pos_vedic_sa_3_0.html) | [pos_vedic](https://nlp.johnsnowlabs.com/2022/05/01/pos_vedic_sa_3_0.html)| Part of Speech Tagging | PerceptronModel| Individual| Ancient|
|3 | [Sanskrit](https://iso639-3.sil.org/code/san)| [sa.stopwords](https://nlp.johnsnowlabs.com/2022/03/07/stopwords_iso_sa_3_0.html) | [stopwords_iso](https://nlp.johnsnowlabs.com/2022/03/07/stopwords_iso_sa_3_0.html)| Stop Words Removal | StopWordsCleaner| Individual| Ancient|
|4 | [Volapük](https://iso639-3.sil.org/code/vol)| [vo.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_vo_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_vo_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Constructed|
|5 | [Nahuatl languages](https://iso639-3.sil.org/code/nah)| [nah.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_nah_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_nah_3_0.html)| Embeddings | WordEmbeddingsModel| Collective| Genetic|
|6 | [Aragonese](https://iso639-3.sil.org/code/arg)| [an.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_an_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_an_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|7 | [Assamese](https://iso639-3.sil.org/code/asm)| [as.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_as_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_as_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|8 | [Asturian, Asturleonese, Bable, Leonese](https://iso639-3.sil.org/code/ast)| [ast.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_ast_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_ast_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|9 | [Bashkir](https://iso639-3.sil.org/code/bak)| [ba.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_ba_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_ba_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|10 | [Bavarian](https://iso639-3.sil.org/code/bar) | [bar.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_bar_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_bar_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|11 | [Bishnupriya](https://iso639-3.sil.org/code/bpy) | [bpy.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_bpy_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_bpy_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|12 | [Burmese](https://iso639-3.sil.org/code/mya) | [my.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_my_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_my_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|13 | [Cebuano](https://iso639-3.sil.org/code/ceb) | [ceb.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_ceb_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_ceb_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|14 | [Central Bikol](https://iso639-3.sil.org/code/bcl) | [bcl.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_bcl_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_bcl_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|15 | [Chechen](https://iso639-3.sil.org/code/che) | [ce.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_ce_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_ce_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|16 | [Chuvash](https://iso639-3.sil.org/code/chv) | [cv.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_cv_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_cv_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|17 | [Corsican](https://iso639-3.sil.org/code/cos) | [co.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_co_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_co_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|18 | [Dhivehi, Divehi, Maldivian](https://iso639-3.sil.org/code/div) | [dv.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_dv_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_dv_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|19 | [Egyptian Arabic](https://iso639-3.sil.org/code/arz) | [arz.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_arz_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_arz_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|20 | [Emiliano-Romagnolo](https://iso639-3.sil.org/code/eml) | [eml.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_eml_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_eml_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|21 | [Erzya](https://iso639-3.sil.org/code/myv) | [myv.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/15/w2v_cc_300d_myv_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/15/w2v_cc_300d_myv_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|22 | [Georgian](https://iso639-3.sil.org/code/kat) | [ka.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/15/w2v_cc_300d_ka_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/15/w2v_cc_300d_ka_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|23 | [Goan Konkani](https://iso639-3.sil.org/code/gom) | [gom.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/15/w2v_cc_300d_gom_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/15/w2v_cc_300d_gom_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|24 | [Javanese](https://iso639-3.sil.org/code/jav) | [jv.embed.distilbert](https://nlp.johnsnowlabs.com/2022/04/12/distilbert_embeddings_javanese_distilbert_small_jv_3_0.html) | [distilbert_embeddings_javanese_distilbert_small](https://nlp.johnsnowlabs.com/2022/04/12/distilbert_embeddings_javanese_distilbert_small_jv_3_0.html) | Embeddings | DistilBertEmbeddings| Individual| Living|
|25 | [Javanese](https://iso639-3.sil.org/code/jav) | [jv.embed.javanese_distilbert_small_imdb](https://nlp.johnsnowlabs.com/2022/04/12/distilbert_embeddings_javanese_distilbert_small_imdb_jv_3_0.html) | [distilbert_embeddings_javanese_distilbert_small_imdb](https://nlp.johnsnowlabs.com/2022/04/12/distilbert_embeddings_javanese_distilbert_small_imdb_jv_3_0.html) | Embeddings | DistilBertEmbeddings| Individual| Living|
|26 | [Javanese](https://iso639-3.sil.org/code/jav) | [jv.embed.javanese_roberta_small](https://nlp.johnsnowlabs.com/2022/04/14/roberta_embeddings_javanese_roberta_small_jv_3_0.html) | [roberta_embeddings_javanese_roberta_small](https://nlp.johnsnowlabs.com/2022/04/14/roberta_embeddings_javanese_roberta_small_jv_3_0.html)| Embeddings | RoBertaEmbeddings| Individual| Living|
|27 | [Javanese](https://iso639-3.sil.org/code/jav) | [jv.embed.javanese_roberta_small_imdb](https://nlp.johnsnowlabs.com/2022/04/14/roberta_embeddings_javanese_roberta_small_imdb_jv_3_0.html) | [roberta_embeddings_javanese_roberta_small_imdb](https://nlp.johnsnowlabs.com/2022/04/14/roberta_embeddings_javanese_roberta_small_imdb_jv_3_0.html)| Embeddings | RoBertaEmbeddings| Individual| Living|
|28 | [Javanese](https://iso639-3.sil.org/code/jav) | [jv.embed.javanese_bert_small_imdb](https://nlp.johnsnowlabs.com/2022/04/11/bert_embeddings_javanese_bert_small_imdb_jv_3_0.html) | [bert_embeddings_javanese_bert_small_imdb](https://nlp.johnsnowlabs.com/2022/04/11/bert_embeddings_javanese_bert_small_imdb_jv_3_0.html)| Embeddings | BertEmbeddings| Individual| Living|
|29 | [Javanese](https://iso639-3.sil.org/code/jav) | [jv.embed.javanese_bert_small](https://nlp.johnsnowlabs.com/2022/04/11/bert_embeddings_javanese_bert_small_jv_3_0.html) | [bert_embeddings_javanese_bert_small](https://nlp.johnsnowlabs.com/2022/04/11/bert_embeddings_javanese_bert_small_jv_3_0.html) | Embeddings | BertEmbeddings| Individual| Living|
|30 | [Kirghiz, Kyrgyz](https://iso639-3.sil.org/code/kir) | [ky.stopwords](https://nlp.johnsnowlabs.com/2022/03/07/stopwords_iso_ky_3_0.html) | [stopwords_iso](https://nlp.johnsnowlabs.com/2022/03/07/stopwords_iso_ky_3_0.html)| Stop Words Removal | StopWordsCleaner| Individual| Living|
|31 | [Letzeburgesch, Luxembourgish](https://iso639-3.sil.org/code/ltz) | [lb.stopwords](https://nlp.johnsnowlabs.com/2022/03/07/stopwords_iso_lb_3_0.html) | [stopwords_iso](https://nlp.johnsnowlabs.com/2022/03/07/stopwords_iso_lb_3_0.html) | Stop Words Removal | StopWordsCleaner| Individual| Living|
|32 | [Letzeburgesch, Luxembourgish](https://iso639-3.sil.org/code/ltz) | [lb.lemma](https://nlp.johnsnowlabs.com/2022/03/03/lemma_spacylookup_lb_3_0.html) | [lemma_spacylookup](https://nlp.johnsnowlabs.com/2022/03/03/lemma_spacylookup_lb_3_0.html)| Lemmatization | LemmatizerModel| Individual| Living|
|33 | [Letzeburgesch, Luxembourgish](https://iso639-3.sil.org/code/ltz) | [lb.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_lb_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_lb_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|34 | [Ligurian](https://iso639-3.sil.org/code/lij) | [lij.stopwords](https://nlp.johnsnowlabs.com/2022/03/07/stopwords_iso_lij_3_0.html) | [stopwords_iso](https://nlp.johnsnowlabs.com/2022/03/07/stopwords_iso_lij_3_0.html)| Stop Words Removal | StopWordsCleaner| Individual| Living|
|35 | [Lombard](https://iso639-3.sil.org/code/lmo) | [lmo.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_lmo_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_lmo_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|36 | [Low German, Low Saxon](https://iso639-3.sil.org/code/nds) | [nds.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_nds_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_nds_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|37 | [Macedonian](https://iso639-3.sil.org/code/mkd) | [mk.stopwords](https://nlp.johnsnowlabs.com/2022/03/07/stopwords_iso_mk_3_0.html) | [stopwords_iso](https://nlp.johnsnowlabs.com/2022/03/07/stopwords_iso_mk_3_0.html)| Stop Words Removal | StopWordsCleaner| Individual| Living|
|38 | [Macedonian](https://iso639-3.sil.org/code/mkd) | [mk.lemma](https://nlp.johnsnowlabs.com/2022/03/03/lemma_spacylookup_mk_3_0.html) | [lemma_spacylookup](https://nlp.johnsnowlabs.com/2022/03/03/lemma_spacylookup_mk_3_0.html)| Lemmatization | LemmatizerModel| Individual| Living|
|39 | [Macedonian](https://iso639-3.sil.org/code/mkd) | [mk.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_mk_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_mk_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|40 | [Maithili](https://iso639-3.sil.org/code/mai) | [mai.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_mai_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_mai_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|41 | [Manx](https://iso639-3.sil.org/code/glv) | [gv.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_gv_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_gv_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|42 | [Mazanderani](https://iso639-3.sil.org/code/mzn) | [mzn.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_mzn_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_mzn_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|43 | [Minangkabau](https://iso639-3.sil.org/code/min) | [min.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_min_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_min_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|44 | [Mingrelian](https://iso639-3.sil.org/code/xmf) | [xmf.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_xmf_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_xmf_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|45 | [Mirandese](https://iso639-3.sil.org/code/mwl) | [mwl.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_mwl_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_mwl_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|46 | [Neapolitan](https://iso639-3.sil.org/code/nap) | [nap.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_nap_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_nap_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|47 | [Nepal Bhasa, Newari](https://iso639-3.sil.org/code/new) | [new.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_new_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_new_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|48 | [Northern Frisian](https://iso639-3.sil.org/code/frr) | [frr.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_frr_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_frr_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|49 | [Northern Sami](https://iso639-3.sil.org/code/sme) | [sme.lemma](https://nlp.johnsnowlabs.com/2022/05/01/lemma_giella_sme_3_0.html) | [lemma_giella](https://nlp.johnsnowlabs.com/2022/05/01/lemma_giella_sme_3_0.html)| Lemmatization | LemmatizerModel| Individual| Living|
|50 | [Northern Sami](https://iso639-3.sil.org/code/sme) | [sme.pos](https://nlp.johnsnowlabs.com/2022/05/01/pos_giella_sme_3_0.html) | [pos_giella](https://nlp.johnsnowlabs.com/2022/05/01/pos_giella_sme_3_0.html)| Part of Speech Tagging | PerceptronModel| Individual| Living|
|51 | [Northern Sotho, Pedi, Sepedi](https://iso639-3.sil.org/code/nso) | [nso.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_nso_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_nso_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|52 | [Occitan (post 1500)](https://iso639-3.sil.org/code/oci) | [oc.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_oc_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_oc_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|53 | [Ossetian, Ossetic](https://iso639-3.sil.org/code/oss) | [os.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_os_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_os_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|54 | [Pfaelzisch](https://iso639-3.sil.org/code/pfl) | [pfl.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_pfl_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_pfl_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|55 | [Piemontese](https://iso639-3.sil.org/code/pms) | [pms.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_pms_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_pms_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|56 | [Romansh](https://iso639-3.sil.org/code/roh) | [rm.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_rm_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_rm_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|57 | [Scots](https://iso639-3.sil.org/code/sco) | [sco.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_sco_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_sco_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|58 | [Sicilian](https://iso639-3.sil.org/code/scn) | [scn.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_scn_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_scn_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|59 | [Sinhala, Sinhalese](https://iso639-3.sil.org/code/sin) | [si.stopwords](https://nlp.johnsnowlabs.com/2022/03/07/stopwords_iso_si_3_0.html) | [stopwords_iso](https://nlp.johnsnowlabs.com/2022/03/07/stopwords_iso_si_3_0.html)| Stop Words Removal | StopWordsCleaner| Individual| Living|
|60 | [Sinhala, Sinhalese](https://iso639-3.sil.org/code/sin) | [si.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_si_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_si_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|61 | [Sundanese](https://iso639-3.sil.org/code/sun) | [su.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_su_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_su_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|62 | [Sundanese](https://iso639-3.sil.org/code/sun) | [su.embed.sundanese_roberta_base](https://nlp.johnsnowlabs.com/2022/04/14/roberta_embeddings_sundanese_roberta_base_su_3_0.html) | [roberta_embeddings_sundanese_roberta_base](https://nlp.johnsnowlabs.com/2022/04/14/roberta_embeddings_sundanese_roberta_base_su_3_0.html)| Embeddings | RoBertaEmbeddings| Individual| Living|
|63 | [Tagalog](https://iso639-3.sil.org/code/tgl) | [tl.lemma](https://nlp.johnsnowlabs.com/2022/03/03/lemma_spacylookup_tl_3_0.html) | [lemma_spacylookup](https://nlp.johnsnowlabs.com/2022/03/03/lemma_spacylookup_tl_3_0.html)| Lemmatization | LemmatizerModel| Individual| Living|
|64 | [Tagalog](https://iso639-3.sil.org/code/tgl) | [tl.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_tl_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_tl_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|65 | [Tagalog](https://iso639-3.sil.org/code/tgl) | [tl.stopwords](https://nlp.johnsnowlabs.com/2022/03/07/stopwords_iso_tl_3_0.html) | [stopwords_iso](https://nlp.johnsnowlabs.com/2022/03/07/stopwords_iso_tl_3_0.html)| Stop Words Removal | StopWordsCleaner| Individual| Living|
|66 | [Tagalog](https://iso639-3.sil.org/code/tgl) | [tl.embed.roberta_tagalog_large](https://nlp.johnsnowlabs.com/2022/04/14/roberta_embeddings_roberta_tagalog_large_tl_3_0.html) | [roberta_embeddings_roberta_tagalog_large](https://nlp.johnsnowlabs.com/2022/04/14/roberta_embeddings_roberta_tagalog_large_tl_3_0.html)| Embeddings | RoBertaEmbeddings| Individual| Living|
|67 | [Tagalog](https://iso639-3.sil.org/code/tgl) | [tl.embed.roberta_tagalog_base](https://nlp.johnsnowlabs.com/2022/04/14/roberta_embeddings_roberta_tagalog_base_tl_3_0.html) | [roberta_embeddings_roberta_tagalog_base](https://nlp.johnsnowlabs.com/2022/04/14/roberta_embeddings_roberta_tagalog_base_tl_3_0.html)| Embeddings | RoBertaEmbeddings| Individual| Living|
|68 | [Tajik](https://iso639-3.sil.org/code/tgk) | [tg.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_tg_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_tg_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|69 | [Tatar](https://iso639-3.sil.org/code/tat) | [tt.stopwords](https://nlp.johnsnowlabs.com/2022/03/07/stopwords_iso_tt_3_0.html) | [stopwords_iso](https://nlp.johnsnowlabs.com/2022/03/07/stopwords_iso_tt_3_0.html)| Stop Words Removal | StopWordsCleaner| Individual| Living|
|70 | [Tatar](https://iso639-3.sil.org/code/tat) | [tt.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_tt_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_tt_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|71 | [Tigrinya](https://iso639-3.sil.org/code/tir) | [ti.stopwords](https://nlp.johnsnowlabs.com/2022/03/07/stopwords_iso_ti_3_0.html) | [stopwords_iso](https://nlp.johnsnowlabs.com/2022/03/07/stopwords_iso_ti_3_0.html)| Stop Words Removal | StopWordsCleaner| Individual| Living|
|72 | [Tosk Albanian](https://iso639-3.sil.org/code/als) | [als.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_als_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_als_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|73 | [Tswana](https://iso639-3.sil.org/code/tsn) | [tn.stopwords](https://nlp.johnsnowlabs.com/2022/03/07/stopwords_iso_tn_3_0.html) | [stopwords_iso](https://nlp.johnsnowlabs.com/2022/03/07/stopwords_iso_tn_3_0.html)| Stop Words Removal | StopWordsCleaner| Individual| Living|
|74 | [Turkmen](https://iso639-3.sil.org/code/tuk) | [tk.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_tk_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_tk_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|75 | [Upper Sorbian](https://iso639-3.sil.org/code/hsb) | [hsb.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_hsb_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_hsb_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|76 | [Venetian](https://iso639-3.sil.org/code/vec) | [vec.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_vec_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_vec_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|77 | [Vlaams](https://iso639-3.sil.org/code/vls) | [vls.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_vls_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_vls_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|78 | [Walloon](https://iso639-3.sil.org/code/wln) | [wa.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_wa_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_wa_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|79 | [Waray (Philippines)](https://iso639-3.sil.org/code/war) | [war.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_war_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_war_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|80 | [Western Armenian](https://iso639-3.sil.org/code/hyw) | [hyw.pos](https://nlp.johnsnowlabs.com/2022/05/01/pos_armtdp_hyw_3_0.html) | [pos_armtdp](https://nlp.johnsnowlabs.com/2022/05/01/pos_armtdp_hyw_3_0.html)| Part of Speech Tagging | PerceptronModel| Individual| Living|
|81 | [Western Armenian](https://iso639-3.sil.org/code/hyw) | [hyw.lemma](https://nlp.johnsnowlabs.com/2022/05/01/lemma_armtdp_hyw_3_0.html) | [lemma_armtdp](https://nlp.johnsnowlabs.com/2022/05/01/lemma_armtdp_hyw_3_0.html)| Lemmatization | LemmatizerModel| Individual| Living|
|82 | [Western Frisian](https://iso639-3.sil.org/code/fry) | [fy.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_fy_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_fy_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|83 | [Western Panjabi](https://iso639-3.sil.org/code/pnb) | [pnb.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_pnb_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_pnb_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|84 | [Yakut](https://iso639-3.sil.org/code/sah) | [sah.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_sah_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_sah_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|85 | [Zeeuws](https://iso639-3.sil.org/code/zea) | [zea.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_zea_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_zea_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|86 | [Albanian](https://iso639-3.sil.org/code/sqi) | [sq.stopwords](https://nlp.johnsnowlabs.com/2022/03/07/stopwords_iso_sq_3_0.html) | [stopwords_iso](https://nlp.johnsnowlabs.com/2022/03/07/stopwords_iso_sq_3_0.html)| Stop Words Removal | StopWordsCleaner| Macrolanguage| Living|
|87 | [Albanian](https://iso639-3.sil.org/code/sqi) | [sq.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_sq_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_sq_3_0.html)| Embeddings | WordEmbeddingsModel| Macrolanguage| Living|
|88 | [Azerbaijani](https://iso639-3.sil.org/code/aze) | [az.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_az_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_az_3_0.html)| Embeddings | WordEmbeddingsModel| Macrolanguage| Living|
|89 | [Azerbaijani](https://iso639-3.sil.org/code/aze) | [az.stopwords](https://nlp.johnsnowlabs.com/2022/03/07/stopwords_iso_az_3_0.html) | [stopwords_iso](https://nlp.johnsnowlabs.com/2022/03/07/stopwords_iso_az_3_0.html)| Stop Words Removal | StopWordsCleaner| Macrolanguage| Living|
|90 | [Malagasy](https://iso639-3.sil.org/code/mlg) | [mg.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_mg_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_mg_3_0.html)| Embeddings | WordEmbeddingsModel| Macrolanguage| Living|
|91 | [Malay (macrolanguage)](https://iso639-3.sil.org/code/msa) | [ms.embed.albert](https://nlp.johnsnowlabs.com/2022/04/14/albert_embeddings_albert_large_bahasa_cased_ms_3_0.html) | [albert_embeddings_albert_large_bahasa_cased](https://nlp.johnsnowlabs.com/2022/04/14/albert_embeddings_albert_large_bahasa_cased_ms_3_0.html) | Embeddings | AlbertEmbeddings| Macrolanguage| Living|
|92 | [Malay (macrolanguage)](https://iso639-3.sil.org/code/msa) | [ms.embed.distilbert](https://nlp.johnsnowlabs.com/2022/04/12/distilbert_embeddings_malaysian_distilbert_small_ms_3_0.html) | [distilbert_embeddings_malaysian_distilbert_small](https://nlp.johnsnowlabs.com/2022/04/12/distilbert_embeddings_malaysian_distilbert_small_ms_3_0.html)| Embeddings | DistilBertEmbeddings| Macrolanguage| Living|
|93 | [Malay (macrolanguage)](https://iso639-3.sil.org/code/msa) | [ms.embed.albert_tiny_bahasa_cased](https://nlp.johnsnowlabs.com/2022/04/14/albert_embeddings_albert_tiny_bahasa_cased_ms_3_0.htm)l| [albert_embeddings_albert_tiny_bahasa_cased](https://nlp.johnsnowlabs.com/2022/04/14/albert_embeddings_albert_tiny_bahasa_cased_ms_3_0.html)| Embeddings | AlbertEmbeddings| Macrolanguage| Living|
|94 | [Malay (macrolanguage)](https://iso639-3.sil.org/code/msa) | [ms.embed.albert_base_bahasa_cased](https://nlp.johnsnowlabs.com/2022/04/14/albert_embeddings_albert_base_bahasa_cased_ms_3_0.html)| [albert_embeddings_albert_base_bahasa_cased](https://nlp.johnsnowlabs.com/2022/04/14/albert_embeddings_albert_base_bahasa_cased_ms_3_0.html)| Embeddings | AlbertEmbeddings| Macrolanguage| Living|
|95 | [Malay (macrolanguage)](https://iso639-3.sil.org/code/msa) | [ms.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_ms_3_0.html)| [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_ms_3_0.html)| Embeddings | WordEmbeddingsModel| Macrolanguage| Living|
|96 | [Mongolian](https://iso639-3.sil.org/code/mon) | [mn.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_mn_3_0.html)| [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_mn_3_0.html)| Embeddings | WordEmbeddingsModel| Macrolanguage| Living|
|97 | [Oriya (macrolanguage)](https://iso639-3.sil.org/code/ori) | [or.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_or_3_0.html)| [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_or_3_0.html)| Embeddings | WordEmbeddingsModel| Macrolanguage| Living|
|98 | [Pashto, Pushto](https://iso639-3.sil.org/code/pus) | [ps.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_ps_3_0.html)| [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_ps_3_0.html)| Embeddings | WordEmbeddingsModel| Macrolanguage| Living|
|99 | [Quechua](https://iso639-3.sil.org/code/que) | [qu.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_qu_3_0.html)| [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_qu_3_0.html)| Embeddings | WordEmbeddingsModel| Macrolanguage| Living|
|100 | [Sardinian](https://iso639-3.sil.org/code/srd) | [sc.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_sc_3_0.html)| [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_sc_3_0.html)| Embeddings | WordEmbeddingsModel| Macrolanguage| Living|
|101 | [Serbo-Croatian](https://iso639-3.sil.org/code/sh) | [sh.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_sh_3_0.html)| [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_sh_3_0.html)| Embeddings | WordEmbeddingsModel| Macrolanguage| Living|
|102 | [Uzbek](https://iso639-3.sil.org/code/uzb) | [uz.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_uz_3_0.html)| [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_uz_3_0.html)| Embeddings | WordEmbeddingsModel| Macrolanguage| Living|

All Healthcare models
Powered by the amazing
[Spark NLP for Healthcare 3.5.2](https://nlp.johnsnowlabs.com/docs/en/licensed_release_notes#352) and
[Spark NLP for Healthcare 3.5.1](https://nlp.johnsnowlabs.com/docs/en/licensed_release_notes#351) releases.

| Number | NLU Reference | Spark NLP Reference | Task | Language Name(s) | Annotator Class | Language Type | Scope |
|---------:|:-----------------------------------------------------------------------------------------------------------------------------------|:----------------------------------------------------------------------------------------------------------------------------------------|:-------------------------|:--------------------------------------------------|:-------------------------------|:--------------|:-----------|
|0 | [en.med_ner.biomedical_bc2gm](https://nlp.johnsnowlabs.com/2022/05/10/ner_biomedical_bc2gm_en_3_0.html) | [ner_biomedical_bc2gm](https://nlp.johnsnowlabs.com/2022/05/10/ner_biomedical_bc2gm_en_3_0.html) | Named Entity Recognition | [English](https://iso639-3.sil.org/code/eng) | MedicalNerModel | Living | Individual |
|1 | [en.med_ner.biomedical_bc2gm](https://nlp.johnsnowlabs.com/2022/05/11/ner_biomedical_bc2gm_en_2_4.html) | [ner_biomedical_bc2gm](https://nlp.johnsnowlabs.com/2022/05/11/ner_biomedical_bc2gm_en_2_4.html) | Named Entity Recognition | [English](https://iso639-3.sil.org/code/eng) | MedicalNerModel | Living | Individual |
|2 | [en.resolve.rxnorm_action_treatment](https://nlp.johnsnowlabs.com/2022/04/25/sbiobertresolve_rxnorm_action_treatment_en_2_4.html) | [sbiobertresolve_rxnorm_action_treatment](https://nlp.johnsnowlabs.com/2022/04/25/sbiobertresolve_rxnorm_action_treatment_en_2_4.html) | Entity Resolution | [English](https://iso639-3.sil.org/code/eng) | SentenceEntityResolverModel | Living | Individual |
|3 | [en.classify.token_bert.ner_ade](https://nlp.johnsnowlabs.com/2021/09/30/bert_token_classifier_ner_ade_en.html) | [bert_token_classifier_ner_ade](https://nlp.johnsnowlabs.com/2021/09/30/bert_token_classifier_ner_ade_en.html) | Named Entity Recognition | [English](https://iso639-3.sil.org/code/eng) | MedicalBertForTokenClassifier | Living | Individual |
|4 | [en.classify.token_bert.ner_ade](https://nlp.johnsnowlabs.com/2022/01/04/bert_token_classifier_ner_ade_en.html) | [bert_token_classifier_ner_ade](https://nlp.johnsnowlabs.com/2022/01/04/bert_token_classifier_ner_ade_en.html) | Named Entity Recognition | [English](https://iso639-3.sil.org/code/eng) | MedicalBertForTokenClassifier | Living | Individual |
|5 | [pt.med_ner.deid.subentity](https://nlp.johnsnowlabs.com/2022/04/13/ner_deid_subentity_pt_3_0.html) | [ner_deid_subentity](https://nlp.johnsnowlabs.com/2022/04/13/ner_deid_subentity_pt_3_0.html) | De-identification | [Portuguese](https://iso639-3.sil.org/code/por) | MedicalNerModel | Living | Individual |
|6 | [pt.med_ner.deid.generic](https://nlp.johnsnowlabs.com/2022/04/13/ner_deid_generic_pt_3_0.html) | [ner_deid_generic](https://nlp.johnsnowlabs.com/2022/04/13/ner_deid_generic_pt_3_0.html) | De-identification | [Portuguese](https://iso639-3.sil.org/code/por) | MedicalNerModel | Living | Individual |
|7 | [pt.med_ner.deid](https://nlp.johnsnowlabs.com/2022/04/13/ner_deid_generic_pt_3_0.html) | [ner_deid_generic](https://nlp.johnsnowlabs.com/2022/04/13/ner_deid_generic_pt_3_0.html) | De-identification | [Portuguese](https://iso639-3.sil.org/code/por) | MedicalNerModel | Living | Individual |

See next comment for more details

3.4.3

We are very excited to announce NLU 3.4.3 has been released!

This release features new models for `Zero-Shot-Relation-Extraction`, [DeBERTa for Sequence Classification](https://arxiv.org/abs/2006.03654),
`Deidentification` in `French` and `Italian` and
Lemmatizers, Parts of Speech Taggers, and Word2Vec Embeddings for over `66 languages`, with 20 languages being covered
for the first time by NLU, including ancient and exotic languages like `Ancient Greek`, `Old Russian`,
`Old French` and much more. Once again we would like to thank our community to make this release possible.

NLU for Healthcare

On the healthcare NLP side, a new `ZeroShotRelationExtractionModel` is available, which can extract relations between
clinical entities in an unsupervised fashion, no training required!
Additionally, New French and Italian Deidentification models are available for clinical and healthcare domains.
Powerd by the fantastic [ Spark NLP for helathcare 3.5.0 release](https://nlp.johnsnowlabs.com/docs/en/licensed_release_notes)

Zero-Shot Relation Extraction

Zero-shot Relation Extraction to extract relations between clinical entities with no training dataset

python
import nlu

pipe = nlu.load('med_ner.clinical relation.zeroshot_biobert')
Configure relations to extract
pipe['zero_shot_relation_extraction'].setRelationalCategories({
"CURE": ["{{TREATMENT}} cures {{PROBLEM}}."],
"IMPROVE": ["{{TREATMENT}} improves {{PROBLEM}}.", "{{TREATMENT}} cures {{PROBLEM}}."],
"REVEAL": ["{{TEST}} reveals {{PROBLEM}}."]})
.setMultiLabel(False)
df = pipe.predict("Paracetamol can alleviate headache or sickness. An MRI test can be used to find cancer.")
df[
'relation', 'relation_confidence', 'relation_entity1', 'relation_entity1_class', 'relation_entity2', 'relation_entity2_class',]
Results in following table :

| relation | relation_confidence | relation_entity1 | relation_entity1_class | relation_entity2 | relation_entity2_class |
|:-----------|----------------------:|:-------------------|:-------------------------|:-------------------|:-------------------------|
| REVEAL | 0.976004 | An MRI test | TEST | cancer | PROBLEM |
| IMPROVE | 0.988195 | Paracetamol | TREATMENT | sickness | PROBLEM |
| IMPROVE | 0.992962 | Paracetamol | TREATMENT | headache | PROBLEM |

New Healthcare Models overview

| Language | NLU Reference | Spark NLP Reference | Task | Annotator Class |
|:-----------|:--------------------------------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------------------|:-------------------------|:--------------------------------|
| en | [en.relation.zeroshot_biobert](https://nlp.johnsnowlabs.com/2022/04/05/re_zeroshot_biobert_en_3_0.html) | [re_zeroshot_biobert](https://nlp.johnsnowlabs.com/2022/04/05/re_zeroshot_biobert_en_3_0.html) | Relation Extraction | ZeroShotRelationExtractionModel |
| fr | [fr.med_ner.deid_generic](https://nlp.johnsnowlabs.com/2022/02/11/ner_deid_generic_fr.html) | [ner_deid_generic](https://nlp.johnsnowlabs.com/2022/02/11/ner_deid_generic_fr.html) | De-identification | MedicalNerModel |
| fr | [fr.med_ner.deid_subentity](https://nlp.johnsnowlabs.com/2022/02/14/ner_deid_subentity_fr.html) | [ner_deid_subentity](https://nlp.johnsnowlabs.com/2022/02/14/ner_deid_subentity_fr.html) | De-identification | MedicalNerModel |
| it | [it.med_ner.deid_generic](https://nlp.johnsnowlabs.com/2022/03/25/ner_deid_generic_it_3_0.html) | [ner_deid_generic](https://nlp.johnsnowlabs.com/2022/03/25/ner_deid_generic_it_3_0.html) | Named Entity Recognition | MedicalNerModel |
| it | [it.med_ner.deid_subentity](https://nlp.johnsnowlabs.com/2022/03/22/ner_deid_subentity_it_3_0.html) | [ner_deid_subentity](https://nlp.johnsnowlabs.com/2022/03/22/ner_deid_subentity_it_3_0.html) | Named Entity Recognition | MedicalNerModel |

NLU general

On the general NLP side we have new transformer based `DeBERTa v3 sequence classifiers` models fine-tuned in Urdu, French and English for
Sentiment and News classification. Additionally, 100+ Part Of Speech Taggers and Lemmatizers for 66 Languages and for 7
languages new word2vec embeddings, including `hi`,`azb`,`bo`,`diq`,`cy`,`es`,`it`,
powered by the amazing [Spark NLP 3.4.3 release](https://github.com/JohnSnowLabs/spark-nlp/releases/tag/3.4.3)

New Languages covered:
First time languages covered by NLU are :
`South Azerbaijani`, `Tibetan`, `Dimli`, `Central Kurdish`, `Southern Altai`,
`Scottish Gaelic`,`Faroese`,`Literary Chinese`,`Ancient Greek`,
`Gothic`, `Old Russian`, `Church Slavic`,
`Old French`,`Uighur`,`Coptic`,`Croatian`, `Belarusian`, `Serbian`

and their respective ISO-639-3 and ISO 630-2 codes are :
`azb`,`bo`,`diq`,`ckb`, `lt` `gd`, `fo`,`lzh`,`grc`,`got`,`orv`,`cu`,`fro`,`qtd`,`ug`,`cop`,`hr`,`be`,`qhe`,`sr`

New NLP Models Overview

| Language | NLU Reference | Spark NLP Reference | Task | Annotator Class |
|:-----------|:------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------|:-----------------------|:---------------------------------|
| en | [en.classify.sentiment.imdb.deberta](https://nlp.johnsnowlabs.com/2022/04/09/deberta_v3_xsmall_sequence_classifier_imdb_en_3_0.html) | [deberta_v3_xsmall_sequence_classifier_imdb](https://nlp.johnsnowlabs.com/2022/04/09/deberta_v3_xsmall_sequence_classifier_imdb_en_3_0.html) | Text Classification | DeBertaForSequenceClassification |
| en | [en.classify.sentiment.imdb.deberta.small](https://nlp.johnsnowlabs.com/2022/04/09/deberta_v3_small_sequence_classifier_imdb_en_3_0.html) | [deberta_v3_small_sequence_classifier_imdb](https://nlp.johnsnowlabs.com/2022/04/09/deberta_v3_small_sequence_classifier_imdb_en_3_0.html) | Text Classification | DeBertaForSequenceClassification |
| en | [en.classify.sentiment.imdb.deberta.base](https://nlp.johnsnowlabs.com/2022/04/09/deberta_v3_base_sequence_classifier_imdb_en_3_0.html) | [deberta_v3_base_sequence_classifier_imdb](https://nlp.johnsnowlabs.com/2022/04/09/deberta_v3_base_sequence_classifier_imdb_en_3_0.html) | Text Classification | DeBertaForSequenceClassification |
| en | [en.classify.sentiment.imdb.deberta.large](https://nlp.johnsnowlabs.com/2022/04/09/deberta_v3_large_sequence_classifier_imdb_en_3_0.html) | [deberta_v3_large_sequence_classifier_imdb](https://nlp.johnsnowlabs.com/2022/04/09/deberta_v3_large_sequence_classifier_imdb_en_3_0.html) | Text Classification | DeBertaForSequenceClassification |
| en | [en.classify.news.deberta](https://nlp.johnsnowlabs.com/2022/04/09/deberta_v3_xsmall_sequence_classifier_ag_news_en_3_0.html) | [deberta_v3_xsmall_sequence_classifier_ag_news](https://nlp.johnsnowlabs.com/2022/04/09/deberta_v3_xsmall_sequence_classifier_ag_news_en_3_0.html) | Text Classification | DeBertaForSequenceClassification |
| en | [en.classify.news.deberta.small](https://nlp.johnsnowlabs.com/2022/04/09/deberta_v3_small_sequence_classifier_ag_news_en_3_0.html) | [deberta_v3_small_sequence_classifier_ag_news](https://nlp.johnsnowlabs.com/2022/04/09/deberta_v3_small_sequence_classifier_ag_news_en_3_0.html) | Text Classification | DeBertaForSequenceClassification |
| ur | [ur.classify.sentiment.imdb](https://nlp.johnsnowlabs.com/2022/04/09/mdeberta_v3_base_sequence_classifier_imdb_ur_3_0.html) | [mdeberta_v3_base_sequence_classifier_imdb](https://nlp.johnsnowlabs.com/2022/04/09/mdeberta_v3_base_sequence_classifier_imdb_ur_3_0.html) | Text Classification | DeBertaForSequenceClassification |
| fr | [fr.classify.allocine](https://nlp.johnsnowlabs.com/2022/04/09/mdeberta_v3_base_sequence_classifier_allocine_fr_3_0.html) | [mdeberta_v3_base_sequence_classifier_allocine](https://nlp.johnsnowlabs.com/2022/04/09/mdeberta_v3_base_sequence_classifier_allocine_fr_3_0.html) | Text Classification | DeBertaForSequenceClassification |
| ur | [ur.embed.bert_cased](https://nlp.johnsnowlabs.com/2022/04/11/bert_embeddings_bert_base_ur_cased_ur_3_0.html) | [bert_embeddings_bert_base_ur_cased](https://nlp.johnsnowlabs.com/2022/04/11/bert_embeddings_bert_base_ur_cased_ur_3_0.html) | Embeddings | BertEmbeddings |
| fr | [fr.embed.bert_5lang_cased](https://nlp.johnsnowlabs.com/2022/04/11/bert_embeddings_bert_base_5lang_cased_fr_3_0.html) | [bert_embeddings_bert_base_5lang_cased](https://nlp.johnsnowlabs.com/2022/04/11/bert_embeddings_bert_base_5lang_cased_fr_3_0.html) | Embeddings | BertEmbeddings |
| de | [de.embed.medbert](https://nlp.johnsnowlabs.com/2022/04/11/bert_embeddings_German_MedBERT_de_3_0.html) | [bert_embeddings_German_MedBERT](https://nlp.johnsnowlabs.com/2022/04/11/bert_embeddings_German_MedBERT_de_3_0.html) | Embeddings | BertEmbeddings |
| ar | [ar.embed.arbert](https://nlp.johnsnowlabs.com/2022/04/11/bert_embeddings_ARBERT_ar_3_0.html) | [bert_embeddings_ARBERT](https://nlp.johnsnowlabs.com/2022/04/11/bert_embeddings_ARBERT_ar_3_0.html) | Embeddings | BertEmbeddings |
| bn | [bn.embed.bangala_bert](https://nlp.johnsnowlabs.com/2022/04/11/bert_embeddings_bangla_bert_base_bn_3_0.html) | [bert_embeddings_bangla_bert_base](https://nlp.johnsnowlabs.com/2022/04/11/bert_embeddings_bangla_bert_base_bn_3_0.html) | Embeddings | BertEmbeddings |
| zh | [zh.embed.bert_5lang_cased](https://nlp.johnsnowlabs.com/2022/04/11/bert_embeddings_bert_base_5lang_cased_zh_3_0.html) | [bert_embeddings_bert_base_5lang_cased](https://nlp.johnsnowlabs.com/2022/04/11/bert_embeddings_bert_base_5lang_cased_zh_3_0.html) | Embeddings | BertEmbeddings |
| hi | [hi.embed.bert_hi_cased](https://nlp.johnsnowlabs.com/2022/04/11/bert_embeddings_bert_base_hi_cased_hi_3_0.html) | [bert_embeddings_bert_base_hi_cased](https://nlp.johnsnowlabs.com/2022/04/11/bert_embeddings_bert_base_hi_cased_hi_3_0.html) | Embeddings | BertEmbeddings |
| it | [it.embed.bert_it_cased](https://nlp.johnsnowlabs.com/2022/04/11/bert_embeddings_bert_base_it_cased_it_3_0.html) | [bert_embeddings_bert_base_it_cased](https://nlp.johnsnowlabs.com/2022/04/11/bert_embeddings_bert_base_it_cased_it_3_0.html) | Embeddings | BertEmbeddings |
| ko | [ko.embed.bert](https://nlp.johnsnowlabs.com/2022/04/11/bert_embeddings_bert_base_ko_3_0.html) | [bert_embeddings_bert_base](https://nlp.johnsnowlabs.com/2022/04/11/bert_embeddings_bert_base_ko_3_0.html) | Embeddings | BertEmbeddings |
| tr | [tr.embed.bert_cased](https://nlp.johnsnowlabs.com/2022/04/11/bert_embeddings_bert_base_tr_cased_tr_3_0.html) | [bert_embeddings_bert_base_tr_cased](https://nlp.johnsnowlabs.com/2022/04/11/bert_embeddings_bert_base_tr_cased_tr_3_0.html) | Embeddings | BertEmbeddings |
| vi | [vi.embed.bert_cased](https://nlp.johnsnowlabs.com/2022/04/11/bert_embeddings_bert_base_vi_cased_vi_3_0.html) | [bert_embeddings_bert_base_vi_cased](https://nlp.johnsnowlabs.com/2022/04/11/bert_embeddings_bert_base_vi_cased_vi_3_0.html) | Embeddings | BertEmbeddings |
| hif | [hif.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/15/w2v_cc_300d_hif_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/15/w2v_cc_300d_hif_3_0.html) | Embeddings | WordEmbeddingsModel |
| azb | [azb.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_azb_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_azb_3_0.html) | Embeddings | WordEmbeddingsModel |
| bo | [bo.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_bo_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_bo_3_0.html) | Embeddings | WordEmbeddingsModel |
| diq | [diq.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_diq_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_diq_3_0.html) | Embeddings | WordEmbeddingsModel |
| cy | [cy.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_cy_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_cy_3_0.html) | Embeddings | WordEmbeddingsModel |
| es | [es.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_es_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_es_3_0.html) | Embeddings | WordEmbeddingsModel |
| it | [it.embed.word2vec](https://nlp.johnsnowlabs.com/2022/03/07/w2v_cc_300d_it_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/07/w2v_cc_300d_it_3_0.html) | Embeddings | WordEmbeddingsModel |
| af | [af.lemma](https://nlp.johnsnowlabs.com/2021/04/02/lemma_af.html) | [lemma](https://nlp.johnsnowlabs.com/2021/04/02/lemma_af.html) | Lemmatization | LemmatizerModel |
| lt | [lt.lemma](https://nlp.johnsnowlabs.com/2022/03/31/lemma_alksnis_lt_3_0.html) | [lemma_alksnis](https://nlp.johnsnowlabs.com/2022/03/31/lemma_alksnis_lt_3_0.html) | Lemmatization | LemmatizerModel |
| nl | [nl.lemma](https://nlp.johnsnowlabs.com/2020/05/03/lemma_nl.html) | [lemma](https://nlp.johnsnowlabs.com/2020/05/03/lemma_nl.html) | Lemmatization | LemmatizerModel |
| gd | [gd.lemma](https://nlp.johnsnowlabs.com/2022/03/31/lemma_arcosg_gd_3_0.html) | [lemma_arcosg](https://nlp.johnsnowlabs.com/2022/03/31/lemma_arcosg_gd_3_0.html) | Lemmatization | LemmatizerModel |
| es | [es.lemma](https://nlp.johnsnowlabs.com/2020/02/16/lemma_es.html) | [lemma](https://nlp.johnsnowlabs.com/2020/02/16/lemma_es.html) | Lemmatization | LemmatizerModel |
| ca | [ca.lemma](https://nlp.johnsnowlabs.com/2020/07/29/lemma_ca.html) | [lemma](https://nlp.johnsnowlabs.com/2020/07/29/lemma_ca.html) | Lemmatization | LemmatizerModel |
| el | [el.lemma.gdt](https://nlp.johnsnowlabs.com/2022/03/31/lemma_gdt_el_3_0.html) | [lemma_gdt](https://nlp.johnsnowlabs.com/2022/03/31/lemma_gdt_el_3_0.html) | Lemmatization | LemmatizerModel |
| en | [en.lemma.atis](https://nlp.johnsnowlabs.com/2022/03/31/lemma_atis_en_3_0.html) | [lemma_atis](https://nlp.johnsnowlabs.com/2022/03/31/lemma_atis_en_3_0.html) | Lemmatization | LemmatizerModel |
| tr | [tr.lemma.boun](https://nlp.johnsnowlabs.com/2022/03/31/lemma_boun_tr_3_0.html) | [lemma_boun](https://nlp.johnsnowlabs.com/2022/03/31/lemma_boun_tr_3_0.html) | Lemmatization | LemmatizerModel |
| da | [da.lemma.ddt](https://nlp.johnsnowlabs.com/2022/03/31/lemma_ddt_da_3_0.html) | [lemma_ddt](https://nlp.johnsnowlabs.com/2022/03/31/lemma_ddt_da_3_0.html) | Lemmatization | LemmatizerModel |
| cs | [cs.lemma.cac](https://nlp.johnsnowlabs.com/2022/03/31/lemma_cac_cs_3_0.html) | [lemma_cac](https://nlp.johnsnowlabs.com/2022/03/31/lemma_cac_cs_3_0.html) | Lemmatization | LemmatizerModel |
| en | [en.lemma.esl](https://nlp.johnsnowlabs.com/2022/03/31/lemma_esl_en_3_0.html) | [lemma_esl](https://nlp.johnsnowlabs.com/2022/03/31/lemma_esl_en_3_0.html) | Lemmatization | LemmatizerModel |
| bg | [bg.lemma.btb](https://nlp.johnsnowlabs.com/2022/03/31/lemma_btb_bg_3_0.html) | [lemma_btb](https://nlp.johnsnowlabs.com/2022/03/31/lemma_btb_bg_3_0.html) | Lemmatization | LemmatizerModel |
| id | [id.lemma.csui](https://nlp.johnsnowlabs.com/2022/03/31/lemma_csui_id_3_0.html) | [lemma_csui](https://nlp.johnsnowlabs.com/2022/03/31/lemma_csui_id_3_0.html) | Lemmatization | LemmatizerModel |
| gl | [gl.lemma.ctg](https://nlp.johnsnowlabs.com/2022/03/31/lemma_ctg_gl_3_0.html) | [lemma_ctg](https://nlp.johnsnowlabs.com/2022/03/31/lemma_ctg_gl_3_0.html) | Lemmatization | LemmatizerModel |
| cy | [cy.lemma.ccg](https://nlp.johnsnowlabs.com/2022/03/31/lemma_ccg_cy_3_0.html) | [lemma_ccg](https://nlp.johnsnowlabs.com/2022/03/31/lemma_ccg_cy_3_0.html) | Lemmatization | LemmatizerModel |
| fo | [fo.lemma.farpahc](https://nlp.johnsnowlabs.com/2022/03/31/lemma_farpahc_fo_3_0.html) | [lemma_farpahc](https://nlp.johnsnowlabs.com/2022/03/31/lemma_farpahc_fo_3_0.html) | Lemmatization | LemmatizerModel |
| tr | [tr.lemma.atis](https://nlp.johnsnowlabs.com/2022/03/31/lemma_atis_tr_3_0.html) | [lemma_atis](https://nlp.johnsnowlabs.com/2022/03/31/lemma_atis_tr_3_0.html) | Lemmatization | LemmatizerModel |
| ga | [ga.lemma.idt](https://nlp.johnsnowlabs.com/2022/03/31/lemma_idt_ga_3_0.html) | [lemma_idt](https://nlp.johnsnowlabs.com/2022/03/31/lemma_idt_ga_3_0.html) | Lemmatization | LemmatizerModel |
| ja | [ja.lemma.gsdluw](https://nlp.johnsnowlabs.com/2022/03/31/lemma_gsdluw_ja_3_0.html) | [lemma_gsdluw](https://nlp.johnsnowlabs.com/2022/03/31/lemma_gsdluw_ja_3_0.html) | Lemmatization | LemmatizerModel |
| es | [es.lemma.gsd](https://nlp.johnsnowlabs.com/2022/03/31/lemma_gsd_es_3_0.html) | [lemma_gsd](https://nlp.johnsnowlabs.com/2022/03/31/lemma_gsd_es_3_0.html) | Lemmatization | LemmatizerModel |
| en | [en.lemma.gum](https://nlp.johnsnowlabs.com/2022/03/31/lemma_gum_en_3_0.html) | [lemma_gum](https://nlp.johnsnowlabs.com/2022/03/31/lemma_gum_en_3_0.html) | Lemmatization | LemmatizerModel |
| zh | [zh.lemma.gsd](https://nlp.johnsnowlabs.com/2022/03/31/lemma_gsd_zh_3_0.html) | [lemma_gsd](https://nlp.johnsnowlabs.com/2022/03/31/lemma_gsd_zh_3_0.html) | Lemmatization | LemmatizerModel |
| lv | [lv.lemma.lvtb](https://nlp.johnsnowlabs.com/2022/03/31/lemma_lvtb_lv_3_0.html) | [lemma_lvtb](https://nlp.johnsnowlabs.com/2022/03/31/lemma_lvtb_lv_3_0.html) | Lemmatization | LemmatizerModel |
| hi | [hi.lemma.hdtb](https://nlp.johnsnowlabs.com/2022/03/31/lemma_hdtb_hi_3_0.html) | [lemma_hdtb](https://nlp.johnsnowlabs.com/2022/03/31/lemma_hdtb_hi_3_0.html) | Lemmatization | LemmatizerModel |
| pt | [pt.lemma.gsd](https://nlp.johnsnowlabs.com/2022/03/31/lemma_gsd_pt_3_0.html) | [lemma_gsd](https://nlp.johnsnowlabs.com/2022/03/31/lemma_gsd_pt_3_0.html) | Lemmatization | LemmatizerModel |
| de | [de.lemma.gsd](https://nlp.johnsnowlabs.com/2022/03/31/lemma_gsd_de_3_0.html) | [lemma_gsd](https://nlp.johnsnowlabs.com/2022/03/31/lemma_gsd_de_3_0.html) | Lemmatization | LemmatizerModel |
| nl | [nl.lemma.lassysmall](https://nlp.johnsnowlabs.com/2022/03/31/lemma_lassysmall_nl_3_0.html) | [lemma_lassysmall](https://nlp.johnsnowlabs.com/2022/03/31/lemma_lassysmall_nl_3_0.html) | Lemmatization | LemmatizerModel |
| lzh | [lzh.lemma.kyoto](https://nlp.johnsnowlabs.com/2022/03/31/lemma_kyoto_lzh_3_0.html) | [lemma_kyoto](https://nlp.johnsnowlabs.com/2022/03/31/lemma_kyoto_lzh_3_0.html) | Lemmatization | LemmatizerModel |
| zh | [zh.lemma.gsdsimp](https://nlp.johnsnowlabs.com/2022/03/31/lemma_gsdsimp_zh_3_0.html) | [lemma_gsdsimp](https://nlp.johnsnowlabs.com/2022/03/31/lemma_gsdsimp_zh_3_0.html) | Lemmatization | LemmatizerModel |
| he | [he.lemma.htb](https://nlp.johnsnowlabs.com/2022/03/31/lemma_htb_he_3_0.html) | [lemma_htb](https://nlp.johnsnowlabs.com/2022/03/31/lemma_htb_he_3_0.html) | Lemmatization | LemmatizerModel |
| fr | [fr.lemma.gsd](https://nlp.johnsnowlabs.com/2022/03/31/lemma_gsd_fr_3_0.html) | [lemma_gsd](https://nlp.johnsnowlabs.com/2022/03/31/lemma_gsd_fr_3_0.html) | Lemmatization | LemmatizerModel |
| ro | [ro.lemma.nonstandard](https://nlp.johnsnowlabs.com/2022/03/31/lemma_nonstandard_ro_3_0.html) | [lemma_nonstandard](https://nlp.johnsnowlabs.com/2022/03/31/lemma_nonstandard_ro_3_0.html) | Lemmatization | LemmatizerModel |
| ja | [ja.lemma.gsd](https://nlp.johnsnowlabs.com/2022/03/31/lemma_gsd_ja_3_0.html) | [lemma_gsd](https://nlp.johnsnowlabs.com/2022/03/31/lemma_gsd_ja_3_0.html) | Lemmatization | LemmatizerModel |
| it | [it.lemma.isdt](https://nlp.johnsnowlabs.com/2022/03/31/lemma_isdt_it_3_0.html) | [lemma_isdt](https://nlp.johnsnowlabs.com/2022/03/31/lemma_isdt_it_3_0.html) | Lemmatization | LemmatizerModel |
| de | [de.lemma.hdt](https://nlp.johnsnowlabs.com/2022/03/31/lemma_hdt_de_3_0.html) | [lemma_hdt](https://nlp.johnsnowlabs.com/2022/03/31/lemma_hdt_de_3_0.html) | Lemmatization | LemmatizerModel |
| is | [is.lemma.modern](https://nlp.johnsnowlabs.com/2022/03/31/lemma_modern_is_3_0.html) | [lemma_modern](https://nlp.johnsnowlabs.com/2022/03/31/lemma_modern_is_3_0.html) | Lemmatization | LemmatizerModel |
| la | [la.lemma.ittb](https://nlp.johnsnowlabs.com/2022/03/31/lemma_ittb_la_3_0.html) | [lemma_ittb](https://nlp.johnsnowlabs.com/2022/03/31/lemma_ittb_la_3_0.html) | Lemmatization | LemmatizerModel |
| fr | [fr.lemma.partut](https://nlp.johnsnowlabs.com/2022/03/31/lemma_partut_fr_3_0.html) | [lemma_partut](https://nlp.johnsnowlabs.com/2022/03/31/lemma_partut_fr_3_0.html) | Lemmatization | LemmatizerModel |
| pcm | [pcm.lemma.nsc](https://nlp.johnsnowlabs.com/2022/03/31/lemma_nsc_pcm_3_0.html) | [lemma_nsc](https://nlp.johnsnowlabs.com/2022/03/31/lemma_nsc_pcm_3_0.html) | Lemmatization | LemmatizerModel |
| pl | [pl.lemma.pdb](https://nlp.johnsnowlabs.com/2022/03/31/lemma_pdb_pl_3_0.html) | [lemma_pdb](https://nlp.johnsnowlabs.com/2022/03/31/lemma_pdb_pl_3_0.html) | Lemmatization | LemmatizerModel |
| grc | [grc.lemma.perseus](https://nlp.johnsnowlabs.com/2022/03/31/lemma_perseus_grc_3_0.html) | [lemma_perseus](https://nlp.johnsnowlabs.com/2022/03/31/lemma_perseus_grc_3_0.html) | Lemmatization | LemmatizerModel |
| cs | [cs.lemma.pdt](https://nlp.johnsnowlabs.com/2022/03/31/lemma_pdt_cs_3_0.html) | [lemma_pdt](https://nlp.johnsnowlabs.com/2022/03/31/lemma_pdt_cs_3_0.html) | Lemmatization | LemmatizerModel |
| fa | [fa.lemma.perdt](https://nlp.johnsnowlabs.com/2022/03/31/lemma_perdt_fa_3_0.html) | [lemma_perdt](https://nlp.johnsnowlabs.com/2022/03/31/lemma_perdt_fa_3_0.html) | Lemmatization | LemmatizerModel |
| got | [got.lemma.proiel](https://nlp.johnsnowlabs.com/2022/03/31/lemma_proiel_got_3_0.html) | [lemma_proiel](https://nlp.johnsnowlabs.com/2022/03/31/lemma_proiel_got_3_0.html) | Lemmatization | LemmatizerModel |
| fr | [fr.lemma.rhapsodie](https://nlp.johnsnowlabs.com/2022/03/31/lemma_rhapsodie_fr_3_0.html) | [lemma_rhapsodie](https://nlp.johnsnowlabs.com/2022/03/31/lemma_rhapsodie_fr_3_0.html) | Lemmatization | LemmatizerModel |
| it | [it.lemma.partut](https://nlp.johnsnowlabs.com/2022/03/31/lemma_partut_it_3_0.html) | [lemma_partut](https://nlp.johnsnowlabs.com/2022/03/31/lemma_partut_it_3_0.html) | Lemmatization | LemmatizerModel |
| en | [en.lemma.partut](https://nlp.johnsnowlabs.com/2022/03/31/lemma_partut_en_3_0.html) | [lemma_partut](https://nlp.johnsnowlabs.com/2022/03/31/lemma_partut_en_3_0.html) | Lemmatization | LemmatizerModel |
| no | [no.lemma.nynorsklia](https://nlp.johnsnowlabs.com/2022/03/31/lemma_nynorsklia_no_3_0.html) | [lemma_nynorsklia](https://nlp.johnsnowlabs.com/2022/03/31/lemma_nynorsklia_no_3_0.html) | Lemmatization | LemmatizerModel |
| orv | [orv.lemma.rnc](https://nlp.johnsnowlabs.com/2022/03/31/lemma_rnc_orv_3_0.html) | [lemma_rnc](https://nlp.johnsnowlabs.com/2022/03/31/lemma_rnc_orv_3_0.html) | Lemmatization | LemmatizerModel |
| cu | [cu.lemma.proiel](https://nlp.johnsnowlabs.com/2022/03/31/lemma_proiel_cu_3_0.html) | [lemma_proiel](https://nlp.johnsnowlabs.com/2022/03/31/lemma_proiel_cu_3_0.html) | Lemmatization | LemmatizerModel |
| la | [la.lemma.perseus](https://nlp.johnsnowlabs.com/2022/03/31/lemma_perseus_la_3_0.html) | [lemma_perseus](https://nlp.johnsnowlabs.com/2022/03/31/lemma_perseus_la_3_0.html) | Lemmatization | LemmatizerModel |
| fr | [fr.lemma.parisstories](https://nlp.johnsnowlabs.com/2022/03/31/lemma_parisstories_fr_3_0.html) | [lemma_parisstories](https://nlp.johnsnowlabs.com/2022/03/31/lemma_parisstories_fr_3_0.html) | Lemmatization | LemmatizerModel |
| fro | [fro.lemma.srcmf](https://nlp.johnsnowlabs.com/2022/03/31/lemma_srcmf_fro_3_0.html) | [lemma_srcmf](https://nlp.johnsnowlabs.com/2022/03/31/lemma_srcmf_fro_3_0.html) | Lemmatization | LemmatizerModel |
| vi | [vi.lemma.vtb](https://nlp.johnsnowlabs.com/2022/03/31/lemma_vtb_vi_3_0.html) | [lemma_vtb](https://nlp.johnsnowlabs.com/2022/03/31/lemma_vtb_vi_3_0.html) | Lemmatization | LemmatizerModel |
| qtd | [qtd.lemma.sagt](https://nlp.johnsnowlabs.com/2022/03/31/lemma_sagt_qtd_3_0.html) | [lemma_sagt](https://nlp.johnsnowlabs.com/2022/03/31/lemma_sagt_qtd_3_0.html) | Lemmatization | LemmatizerModel |
| ro | [ro.lemma.rrt](https://nlp.johnsnowlabs.com/2022/03/31/lemma_rrt_ro_3_0.html) | [lemma_rrt](https://nlp.johnsnowlabs.com/2022/03/31/lemma_rrt_ro_3_0.html) | Lemmatization | LemmatizerModel |
| hu | [hu.lemma.szeged](https://nlp.johnsnowlabs.com/2022/03/31/lemma_szeged_hu_3_0.html) | [lemma_szeged](https://nlp.johnsnowlabs.com/2022/03/31/lemma_szeged_hu_3_0.html) | Lemmatization | LemmatizerModel |
| ug | [ug.lemma.udt](https://nlp.johnsnowlabs.com/2022/03/31/lemma_udt_ug_3_0.html) | [lemma_udt](https://nlp.johnsnowlabs.com/2022/03/31/lemma_udt_ug_3_0.html) | Lemmatization | LemmatizerModel |
| wo | [wo.lemma.wtb](https://nlp.johnsnowlabs.com/2022/03/31/lemma_wtb_wo_3_0.html) | [lemma_wtb](https://nlp.johnsnowlabs.com/2022/03/31/lemma_wtb_wo_3_0.html) | Lemmatization | LemmatizerModel |
| cop | [cop.lemma.scriptorium](https://nlp.johnsnowlabs.com/2022/03/31/lemma_scriptorium_cop_3_0.html) | [lemma_scriptorium](https://nlp.johnsnowlabs.com/2022/03/31/lemma_scriptorium_cop_3_0.html) | Lemmatization | LemmatizerModel |
| ru | [ru.lemma.syntagrus](https://nlp.johnsnowlabs.com/2022/03/31/lemma_syntagrus_ru_3_0.html) | [lemma_syntagrus](https://nlp.johnsnowlabs.com/2022/03/31/lemma_syntagrus_ru_3_0.html) | Lemmatization | LemmatizerModel |
| ru | [ru.lemma.taiga](https://nlp.johnsnowlabs.com/2022/03/31/lemma_taiga_ru_3_0.html) | [lemma_taiga](https://nlp.johnsnowlabs.com/2022/03/31/lemma_taiga_ru_3_0.html) | Lemmatization | LemmatizerModel |
| fr | [fr.lemma.sequoia](https://nlp.johnsnowlabs.com/2022/03/31/lemma_sequoia_fr_3_0.html) | [lemma_sequoia](https://nlp.johnsnowlabs.com/2022/03/31/lemma_sequoia_fr_3_0.html) | Lemmatization | LemmatizerModel |
| la | [la.lemma.udante](https://nlp.johnsnowlabs.com/2022/03/31/lemma_udante_la_3_0.html) | [lemma_udante](https://nlp.johnsnowlabs.com/2022/03/31/lemma_udante_la_3_0.html) | Lemmatization | LemmatizerModel |
| ro | [ro.lemma.simonero](https://nlp.johnsnowlabs.com/2022/03/31/lemma_simonero_ro_3_0.html) | [lemma_simonero](https://nlp.johnsnowlabs.com/2022/03/31/lemma_simonero_ro_3_0.html) | Lemmatization | LemmatizerModel |
| it | [it.lemma.vit](https://nlp.johnsnowlabs.com/2022/03/31/lemma_vit_it_3_0.html) | [lemma_vit](https://nlp.johnsnowlabs.com/2022/03/31/lemma_vit_it_3_0.html) | Lemmatization | LemmatizerModel |
| hr | [hr.lemma.set](https://nlp.johnsnowlabs.com/2022/03/31/lemma_set_hr_3_0.html) | [lemma_set](https://nlp.johnsnowlabs.com/2022/03/31/lemma_set_hr_3_0.html) | Lemmatization | LemmatizerModel |
| fa | [fa.lemma.seraji](https://nlp.johnsnowlabs.com/2022/03/31/lemma_seraji_fa_3_0.html) | [lemma_seraji](https://nlp.johnsnowlabs.com/2022/03/31/lemma_seraji_fa_3_0.html) | Lemmatization | LemmatizerModel |
| tr | [tr.lemma.tourism](https://nlp.johnsnowlabs.com/2022/03/31/lemma_tourism_tr_3_0.html) | [lemma_tourism](https://nlp.johnsnowlabs.com/2022/03/31/lemma_tourism_tr_3_0.html) | Lemmatization | LemmatizerModel |
| ta | [ta.lemma.ttb](https://nlp.johnsnowlabs.com/2022/03/31/lemma_ttb_ta_3_0.html) | [lemma_ttb](https://nlp.johnsnowlabs.com/2022/03/31/lemma_ttb_ta_3_0.html) | Lemmatization | LemmatizerModel |
| sl | [sl.lemma.ssj](https://nlp.johnsnowlabs.com/2022/03/31/lemma_ssj_sl_3_0.html) | [lemma_ssj](https://nlp.johnsnowlabs.com/2022/03/31/lemma_ssj_sl_3_0.html) | Lemmatization | LemmatizerModel |
| sv | [sv.lemma.talbanken](https://nlp.johnsnowlabs.com/2022/03/31/lemma_talbanken_sv_3_0.html) | [lemma_talbanken](https://nlp.johnsnowlabs.com/2022/03/31/lemma_talbanken_sv_3_0.html) | Lemmatization | LemmatizerModel |
| uk | [uk.lemma.iu](https://nlp.johnsnowlabs.com/2022/03/31/lemma_iu_uk_3_0.html) | [lemma_iu](https://nlp.johnsnowlabs.com/2022/03/31/lemma_iu_uk_3_0.html) | Lemmatization | LemmatizerModel |
| te | [te.pos](https://nlp.johnsnowlabs.com/2021/03/10/pos_mtg_te.html) | [pos_mtg](https://nlp.johnsnowlabs.com/2021/03/10/pos_mtg_te.html) | Part of Speech Tagging | PerceptronModel |
| te | [te.pos](https://nlp.johnsnowlabs.com/2022/04/01/pos_mtg_te_3_0.html) | [pos_mtg](https://nlp.johnsnowlabs.com/2022/04/01/pos_mtg_te_3_0.html) | Part of Speech Tagging | PerceptronModel |
| ta | [ta.pos](https://nlp.johnsnowlabs.com/2021/03/10/pos_ttb_ta.html) | [pos_ttb](https://nlp.johnsnowlabs.com/2021/03/10/pos_ttb_ta.html) | Part of Speech Tagging | PerceptronModel |
| ta | [ta.pos](https://nlp.johnsnowlabs.com/2022/04/01/pos_ttb_ta_3_0.html) | [pos_ttb](https://nlp.johnsnowlabs.com/2022/04/01/pos_ttb_ta_3_0.html) | Part of Speech Tagging | PerceptronModel |
| cs | [cs.pos](https://nlp.johnsnowlabs.com/2020/05/04/pos_ud_pdt_cs.html) | [pos_ud_pdt](https://nlp.johnsnowlabs.com/2020/05/04/pos_ud_pdt_cs.html) | Part of Speech Tagging | PerceptronModel |
| cs | [cs.pos](https://nlp.johnsnowlabs.com/2021/03/08/pos_ud_pdt_cs.html) | [pos_ud_pdt](https://nlp.johnsnowlabs.com/2021/03/08/pos_ud_pdt_cs.html) | Part of Speech Tagging | PerceptronModel |
| bg | [bg.pos](https://nlp.johnsnowlabs.com/2021/03/23/pos_btb_bg.html) | [pos_btb](https://nlp.johnsnowlabs.com/2021/03/23/pos_btb_bg.html) | Part of Speech Tagging | PerceptronModel |
| bg | [bg.pos](https://nlp.johnsnowlabs.com/2022/04/01/pos_btb_bg_3_0.html) | [pos_btb](https://nlp.johnsnowlabs.com/2022/04/01/pos_btb_bg_3_0.html) | Part of Speech Tagging | PerceptronModel |
| af | [af.pos](https://nlp.johnsnowlabs.com/2021/03/16/pos_afribooms_af.html) | [pos_afribooms](https://nlp.johnsnowlabs.com/2021/03/16/pos_afribooms_af.html) | Part of Speech Tagging | PerceptronModel |
| af | [af.pos](https://nlp.johnsnowlabs.com/2021/04/06/pos_afribooms_af.html) | [pos_afribooms](https://nlp.johnsnowlabs.com/2021/04/06/pos_afribooms_af.html) | Part of Speech Tagging | PerceptronModel |
| af | [af.pos](https://nlp.johnsnowlabs.com/2022/04/01/pos_afribooms_af_3_0.html) | [pos_afribooms](https://nlp.johnsnowlabs.com/2022/04/01/pos_afribooms_af_3_0.html) | Part of Speech Tagging | PerceptronModel |
| es | [es.pos.gsd](https://nlp.johnsnowlabs.com/2022/04/01/pos_gsd_es_3_0.html) | [pos_gsd](https://nlp.johnsnowlabs.com/2022/04/01/pos_gsd_es_3_0.html) | Part of Speech Tagging | PerceptronModel |
| en | [en.pos.ewt](https://nlp.johnsnowlabs.com/2022/04/01/pos_ewt_en_3_0.html) | [pos_ewt](https://nlp.johnsnowlabs.com/2022/04/01/pos_ewt_en_3_0.html) | Part of Speech Tagging | PerceptronModel |
| gd | [gd.pos.arcosg](https://nlp.johnsnowlabs.com/2022/04/01/pos_arcosg_gd_3_0.html) | [pos_arcosg](https://nlp.johnsnowlabs.com/2022/04/01/pos_arcosg_gd_3_0.html) | Part of Speech Tagging | PerceptronModel |
| el | [el.pos.gdt](https://nlp.johnsnowlabs.com/2022/04/01/pos_gdt_el_3_0.html) | [pos_gdt](https://nlp.johnsnowlabs.com/2022/04/01/pos_gdt_el_3_0.html) | Part of Speech Tagging | PerceptronModel |
| hy | [hy.pos.armtdp](https://nlp.johnsnowlabs.com/2022/04/01/pos_armtdp_hy_3_0.html) | [pos_armtdp](https://nlp.johnsnowlabs.com/2022/04/01/pos_armtdp_hy_3_0.html) | Part of Speech Tagging | PerceptronModel |
| pt | [pt.pos.bosque](https://nlp.johnsnowlabs.com/2022/04/01/pos_bosque_pt_3_0.html) | [pos_bosque](https://nlp.johnsnowlabs.com/2022/04/01/pos_bosque_pt_3_0.html) | Part of Speech Tagging | PerceptronModel |
| tr | [tr.pos.framenet](https://nlp.johnsnowlabs.com/2022/04/01/pos_framenet_tr_3_0.html) | [pos_framenet](https://nlp.johnsnowlabs.com/2022/04/01/pos_framenet_tr_3_0.html) | Part of Speech Tagging | PerceptronModel |
| cs | [cs.pos.cltt](https://nlp.johnsnowlabs.com/2022/04/01/pos_cltt_cs_3_0.html) | [pos_cltt](https://nlp.johnsnowlabs.com/2022/04/01/pos_cltt_cs_3_0.html) | Part of Speech Tagging | PerceptronModel |
| eu | [eu.pos.bdt](https://nlp.johnsnowlabs.com/2022/04/01/pos_bdt_eu_3_0.html) | [pos_bdt](https://nlp.johnsnowlabs.com/2022/04/01/pos_bdt_eu_3_0.html) | Part of Speech Tagging | PerceptronModel |
| et | [et.pos.ewt](https://nlp.johnsnowlabs.com/2022/04/01/pos_ewt_et_3_0.html) | [pos_ewt](https://nlp.johnsnowlabs.com/2022/04/01/pos_ewt_et_3_0.html) | Part of Speech Tagging | PerceptronModel |
| da | [da.pos.ddt](https://nlp.johnsnowlabs.com/2022/04/01/pos_ddt_da_3_0.html) | [pos_ddt](https://nlp.johnsnowlabs.com/2022/04/01/pos_ddt_da_3_0.html) | Part of Speech Tagging | PerceptronModel |
| cy | [cy.pos.ccg](https://nlp.johnsnowlabs.com/2022/04/01/pos_ccg_cy_3_0.html) | [pos_ccg](https://nlp.johnsnowlabs.com/2022/04/01/pos_ccg_cy_3_0.html) | Part of Speech Tagging | PerceptronModel |
| lt | [lt.pos.alksnis](https://nlp.johnsnowlabs.com/2022/04/01/pos_alksnis_lt_3_0.html) | [pos_alksnis](https://nlp.johnsnowlabs.com/2022/04/01/pos_alksnis_lt_3_0.html) | Part of Speech Tagging | PerceptronModel |
| nl | [nl.pos.alpino](https://nlp.johnsnowlabs.com/2022/04/01/pos_alpino_nl_3_0.html) | [pos_alpino](https://nlp.johnsnowlabs.com/2022/04/01/pos_alpino_nl_3_0.html) | Part of Speech Tagging | PerceptronModel |
| fi | [fi.pos.ftb](https://nlp.johnsnowlabs.com/2022/04/01/pos_ftb_fi_3_0.html) | [pos_ftb](https://nlp.johnsnowlabs.com/2022/04/01/pos_ftb_fi_3_0.html) | Part of Speech Tagging | PerceptronModel |
| tr | [tr.pos.atis](https://nlp.johnsnowlabs.com/2022/04/01/pos_atis_tr_3_0.html) | [pos_atis](https://nlp.johnsnowlabs.com/2022/04/01/pos_atis_tr_3_0.html) | Part of Speech Tagging | PerceptronModel |
| ca | [ca.pos.ancora](https://nlp.johnsnowlabs.com/2022/04/01/pos_ancora_ca_3_0.html) | [pos_ancora](https://nlp.johnsnowlabs.com/2022/04/01/pos_ancora_ca_3_0.html) | Part of Speech Tagging | PerceptronModel |
| gl | [gl.pos.ctg](https://nlp.johnsnowlabs.com/2022/04/01/pos_ctg_gl_3_0.html) | [pos_ctg](https://nlp.johnsnowlabs.com/2022/04/01/pos_ctg_gl_3_0.html) | Part of Speech Tagging | PerceptronModel |
| de | [de.pos.gsd](https://nlp.johnsnowlabs.com/2022/04/01/pos_gsd_de_3_0.html) | [pos_gsd](https://nlp.johnsnowlabs.com/2022/04/01/pos_gsd_de_3_0.html) | Part of Speech Tagging | PerceptronModel |
| fr | [fr.pos.gsd](https://nlp.johnsnowlabs.com/2022/04/01/pos_gsd_fr_3_0.html) | [pos_gsd](https://nlp.johnsnowlabs.com/2022/04/01/pos_gsd_fr_3_0.html) | Part of Speech Tagging | PerceptronModel |
| ja | [ja.pos.gsdluw](https://nlp.johnsnowlabs.com/2022/04/01/pos_gsdluw_ja_3_0.html) | [pos_gsdluw](https://nlp.johnsnowlabs.com/2022/04/01/pos_gsdluw_ja_3_0.html) | Part of Speech Tagging | PerceptronModel |
| it | [it.pos.isdt](https://nlp.johnsnowlabs.com/2022/04/01/pos_isdt_it_3_0.html) | [pos_isdt](https://nlp.johnsnowlabs.com/2022/04/01/pos_isdt_it_3_0.html) | Part of Speech Tagging | PerceptronModel |
| be | [be.pos.hse](https://nlp.johnsnowlabs.com/2022/04/01/pos_hse_be_3_0.html) | [pos_hse](https://nlp.johnsnowlabs.com/2022/04/01/pos_hse_be_3_0.html) | Part of Speech Tagging | PerceptronModel |
| nl | [nl.pos.lassysmall](https://nlp.johnsnowlabs.com/2022/04/01/pos_lassysmall_nl_3_0.html) | [pos_lassysmall](https://nlp.johnsnowlabs.com/2022/04/01/pos_lassysmall_nl_3_0.html) | Part of Speech Tagging | PerceptronModel |
| sv | [sv.pos.lines](https://nlp.johnsnowlabs.com/2022/04/01/pos_lines_sv_3_0.html) | [pos_lines](https://nlp.johnsnowlabs.com/2022/04/01/pos_lines_sv_3_0.html) | Part of Speech Tagging | PerceptronModel |
| uk | [uk.pos.iu](https://nlp.johnsnowlabs.com/2022/04/01/pos_iu_uk_3_0.html) | [pos_iu](https://nlp.johnsnowlabs.com/2022/04/01/pos_iu_uk_3_0.html) | Part of Speech Tagging | PerceptronModel |
| fr | [fr.pos.parisstories](https://nlp.johnsnowlabs.com/2022/04/01/pos_parisstories_fr_3_0.html) | [pos_parisstories](https://nlp.johnsnowlabs.com/2022/04/01/pos_parisstories_fr_3_0.html) | Part of Speech Tagging | PerceptronModel |
| en | [en.pos.partut](https://nlp.johnsnowlabs.com/2022/04/01/pos_partut_en_3_0.html) | [pos_partut](https://nlp.johnsnowlabs.com/2022/04/01/pos_partut_en_3_0.html) | Part of Speech Tagging | PerceptronModel |
| la | [la.pos.ittb](https://nlp.johnsnowlabs.com/2022/04/01/pos_ittb_la_3_0.html) | [pos_ittb](https://nlp.johnsnowlabs.com/2022/04/01/pos_ittb_la_3_0.html) | Part of Speech Tagging | PerceptronModel |
| lzh | [lzh.pos.kyoto](https://nlp.johnsnowlabs.com/2022/04/01/pos_kyoto_lzh_3_0.html) | [pos_kyoto](https://nlp.johnsnowlabs.com/2022/04/01/pos_kyoto_lzh_3_0.html) | Part of Speech Tagging | PerceptronModel |
| id | [id.pos.gsd](https://nlp.johnsnowlabs.com/2022/04/01/pos_gsd_id_3_0.html) | [pos_gsd](https://nlp.johnsnowlabs.com/2022/04/01/pos_gsd_id_3_0.html) | Part of Speech Tagging | PerceptronModel |
| he | [he.pos.htb](https://nlp.johnsnowlabs.com/2022/04/01/pos_htb_he_3_0.html) | [pos_htb](https://nlp.johnsnowlabs.com/2022/04/01/pos_htb_he_3_0.html) | Part of Speech Tagging | PerceptronModel |
| tr | [tr.pos.kenet](https://nlp.johnsnowlabs.com/2022/04/01/pos_kenet_tr_3_0.html) | [pos_kenet](https://nlp.johnsnowlabs.com/2022/04/01/pos_kenet_tr_3_0.html) | Part of Speech Tagging | PerceptronModel |
| de | [de.pos.hdt](https://nlp.johnsnowlabs.com/2022/04/01/pos_hdt_de_3_0.html) | [pos_hdt](https://nlp.johnsnowlabs.com/2022/04/01/pos_hdt_de_3_0.html) | Part of Speech Tagging | PerceptronModel |
| qhe | [qhe.pos.hiencs](https://nlp.johnsnowlabs.com/2022/04/01/pos_hiencs_qhe_3_0.html) | [pos_hiencs](https://nlp.johnsnowlabs.com/2022/04/01/pos_hiencs_qhe_3_0.html) | Part of Speech Tagging | PerceptronModel |
| la | [la.pos.llct](https://nlp.johnsnowlabs.com/2022/04/01/pos_llct_la_3_0.html) | [pos_llct](https://nlp.johnsnowlabs.com/2022/04/01/pos_llct_la_3_0.html) | Part of Speech Tagging | PerceptronModel |
| en | [en.pos.lines](https://nlp.johnsnowlabs.com/2022/04/01/pos_lines_en_3_0.html) | [pos_lines](https://nlp.johnsnowlabs.com/2022/04/01/pos_lines_en_3_0.html) | Part of Speech Tagging | PerceptronModel |
| pcm | [pcm.pos.nsc](https://nlp.johnsnowlabs.com/2022/04/01/pos_nsc_pcm_3_0.html) | [pos_nsc](https://nlp.johnsnowlabs.com/2022/04/01/pos_nsc_pcm_3_0.html) | Part of Speech Tagging | PerceptronModel |
| ko | [ko.pos.kaist](https://nlp.johnsnowlabs.com/2022/04/01/pos_kaist_ko_3_0.html) | [pos_kaist](https://nlp.johnsnowlabs.com/2022/04/01/pos_kaist_ko_3_0.html) | Part of Speech Tagging | PerceptronModel |
| pt | [pt.pos.gsd](https://nlp.johnsnowlabs.com/2022/04/01/pos_gsd_pt_3_0.html) | [pos_gsd](https://nlp.johnsnowlabs.com/2022/04/01/pos_gsd_pt_3_0.html) | Part of Speech Tagging | PerceptronModel |
| hi | [hi.pos.hdtb](https://nlp.johnsnowlabs.com/2022/04/01/pos_hdtb_hi_3_0.html) | [pos_hdtb](https://nlp.johnsnowlabs.com/2022/04/01/pos_hdtb_hi_3_0.html) | Part of Speech Tagging | PerceptronModel |
| is | [is.pos.modern](https://nlp.johnsnowlabs.com/2022/04/01/pos_modern_is_3_0.html) | [pos_modern](https://nlp.johnsnowlabs.com/2022/04/01/pos_modern_is_3_0.html) | Part of Speech Tagging | PerceptronModel |
| en | [en.pos.gum](https://nlp.johnsnowlabs.com/2022/04/01/pos_gum_en_3_0.html) | [pos_gum](https://nlp.johnsnowlabs.com/2022/04/01/pos_gum_en_3_0.html) | Part of Speech Tagging | PerceptronModel |
| fro | [fro.pos.srcmf](https://nlp.johnsnowlabs.com/2022/04/01/pos_srcmf_fro_3_0.html) | [pos_srcmf](https://nlp.johnsnowlabs.com/2022/04/01/pos_srcmf_fro_3_0.html) | Part of Speech Tagging | PerceptronModel |
| sl | [sl.pos.ssj](https://nlp.johnsnowlabs.com/2022/04/01/pos_ssj_sl_3_0.html) | [pos_ssj](https://nlp.johnsnowlabs.com/2022/04/01/pos_ssj_sl_3_0.html) | Part of Speech Tagging | PerceptronModel |
| ru | [ru.pos.taiga](https://nlp.johnsnowlabs.com/2022/04/01/pos_taiga_ru_3_0.html) | [pos_taiga](https://nlp.johnsnowlabs.com/2022/04/01/pos_taiga_ru_3_0.html) | Part of Speech Tagging | PerceptronModel |
| grc | [grc.pos.perseus](https://nlp.johnsnowlabs.com/2022/04/01/pos_perseus_grc_3_0.html) | [pos_perseus](https://nlp.johnsnowlabs.com/2022/04/01/pos_perseus_grc_3_0.html) | Part of Speech Tagging | PerceptronModel |
| sr | [sr.pos.set](https://nlp.johnsnowlabs.com/2022/04/01/pos_set_sr_3_0.html) | [pos_set](https://nlp.johnsnowlabs.com/2022/04/01/pos_set_sr_3_0.html) | Part of Speech Tagging | PerceptronModel |
| orv | [orv.pos.rnc](https://nlp.johnsnowlabs.com/2022/04/01/pos_rnc_orv_3_0.html) | [pos_rnc](https://nlp.johnsnowlabs.com/2022/04/01/pos_rnc_orv_3_0.html) | Part of Speech Tagging | PerceptronModel |
| ug | [ug.pos.udt](https://nlp.johnsnowlabs.com/2022/04/01/pos_udt_ug_3_0.html) | [pos_udt](https://nlp.johnsnowlabs.com/2022/04/01/pos_udt_ug_3_0.html) | Part of Speech Tagging | PerceptronModel |
| got | [got.pos.proiel](https://nlp.johnsnowlabs.com/2022/04/01/pos_proiel_got_3_0.html) | [pos_proiel](https://nlp.johnsnowlabs.com/2022/04/01/pos_proiel_got_3_0.html) | Part of Speech Tagging | PerceptronModel |
| sv | [sv.pos.talbanken](https://nlp.johnsnowlabs.com/2021/03/23/pos_talbanken_sv.html) | [pos_talbanken](https://nlp.johnsnowlabs.com/2021/03/23/pos_talbanken_sv.html) | Part of Speech Tagging | PerceptronModel |
| sv | [sv.pos.talbanken](https://nlp.johnsnowlabs.com/2022/04/01/pos_talbanken_sv_3_0.html) | [pos_talbanken](https://nlp.johnsnowlabs.com/2022/04/01/pos_talbanken_sv_3_0.html) | Part of Speech Tagging | PerceptronModel |
| pl | [pl.pos.pdb](https://nlp.johnsnowlabs.com/2022/04/01/pos_pdb_pl_3_0.html) | [pos_pdb](https://nlp.johnsnowlabs.com/2022/04/01/pos_pdb_pl_3_0.html) | Part of Speech Tagging | PerceptronModel |
| fa | [fa.pos.seraji](https://nlp.johnsnowlabs.com/2022/04/01/pos_seraji_fa_3_0.html) | [pos_seraji](https://nlp.johnsnowlabs.com/2022/04/01/pos_seraji_fa_3_0.html) | Part of Speech Tagging | PerceptronModel |
| tr | [tr.pos.penn](https://nlp.johnsnowlabs.com/2022/04/01/pos_penn_tr_3_0.html) | [pos_penn](https://nlp.johnsnowlabs.com/2022/04/01/pos_penn_tr_3_0.html) | Part of Speech Tagging | PerceptronModel |
| hu | [hu.pos.szeged](https://nlp.johnsnowlabs.com/2022/04/01/pos_szeged_hu_3_0.html) | [pos_szeged](https://nlp.johnsnowlabs.com/2022/04/01/pos_szeged_hu_3_0.html) | Part of Speech Tagging | PerceptronModel |
| sk | [sk.pos.snk](https://nlp.johnsnowlabs.com/2021/03/23/pos_snk_sk.html) | [pos_snk](https://nlp.johnsnowlabs.com/2021/03/23/pos_snk_sk.html) | Part of Speech Tagging | PerceptronModel |
| sk | [sk.pos.snk](https://nlp.johnsnowlabs.com/2022/04/01/pos_snk_sk_3_0.html) | [pos_snk](https://nlp.johnsnowlabs.com/2022/04/01/pos_snk_sk_3_0.html) | Part of Speech Tagging | PerceptronModel |
| ro | [ro.pos.simonero](https://nlp.johnsnowlabs.com/2022/04/01/pos_simonero_ro_3_0.html) | [pos_simonero](https://nlp.johnsnowlabs.com/2022/04/01/pos_simonero_ro_3_0.html) | Part of Speech Tagging | PerceptronModel |
| it | [it.pos.postwita](https://nlp.johnsnowlabs.com/2022/04/01/pos_postwita_it_3_0.html) | [pos_postwita](https://nlp.johnsnowlabs.com/2022/04/01/pos_postwita_it_3_0.html) | Part of Speech Tagging | PerceptronModel |
| gl | [gl.pos.treegal](https://nlp.johnsnowlabs.com/2022/04/01/pos_treegal_gl_3_0.html) | [pos_treegal](https://nlp.johnsnowlabs.com/2022/04/01/pos_treegal_gl_3_0.html) | Part of Speech Tagging | PerceptronModel |
| cs | [cs.pos.pdt](https://nlp.johnsnowlabs.com/2022/04/01/pos_pdt_cs_3_0.html) | [pos_pdt](https://nlp.johnsnowlabs.com/2022/04/01/pos_pdt_cs_3_0.html) | Part of Speech Tagging | PerceptronModel |
| ro | [ro.pos.rrt](https://nlp.johnsnowlabs.com/2022/04/01/pos_rrt_ro_3_0.html) | [pos_rrt](https://nlp.johnsnowlabs.com/2022/04/01/pos_rrt_ro_3_0.html) | Part of Speech Tagging | PerceptronModel |
| orv | [orv.pos.torot](https://nlp.johnsnowlabs.com/2022/04/01/pos_torot_orv_3_0.html) | [pos_torot](https://nlp.johnsnowlabs.com/2022/04/01/pos_torot_orv_3_0.html) | Part of Speech Tagging | PerceptronModel |
| hr | [hr.pos.set](https://nlp.johnsnowlabs.com/2022/04/01/pos_set_hr_3_0.html) | [pos_set](https://nlp.johnsnowlabs.com/2022/04/01/pos_set_hr_3_0.html) | Part of Speech Tagging | PerceptronModel |
| la | [la.pos.proiel](https://nlp.johnsnowlabs.com/2022/04/01/pos_proiel_la_3_0.html) | [pos_proiel](https://nlp.johnsnowlabs.com/2022/04/01/pos_proiel_la_3_0.html) | Part of Speech Tagging | PerceptronModel |
| fr | [fr.pos.partut](https://nlp.johnsnowlabs.com/2022/04/01/pos_partut_fr_3_0.html) | [pos_partut](https://nlp.johnsnowlabs.com/2022/04/01/pos_partut_fr_3_0.html) | Part of Speech Tagging | PerceptronModel |
| it | [it.pos.vit](https://nlp.johnsnowlabs.com/2022/04/01/pos_vit_it_3_0.html) | [pos_vit](https://nlp.johnsnowlabs.com/2022/04/01/pos_vit_it_3_0.html) | Part of Speech Tagging | PerceptronModel |

Bugfixes

- Improved Error Messages and integrated detection and stopping of endless loops which could occur during construction
of nlu pipelines

Additional NLU resources
* [140+ NLU Tutorials](https://nlu.johnsnowlabs.com/docs/en/notebooks)
* [NLU in Action](https://nlp.johnsnowlabs.com/demo)
* [Streamlit visualizations docs](https://nlu.johnsnowlabs.com/docs/en/streamlit_viz_examples)
* The complete list of all 4000+ models & pipelines in 200+ languages is available on [Models Hub](https://nlp.johnsnowlabs.com/models).
* [Spark NLP publications](https://medium.com/spark-nlp)
* [NLU documentation](https://nlu.johnsnowlabs.com/docs/en/install)
* [Discussions](https://github.com/JohnSnowLabs/spark-nlp/discussions) Engage with other community members, share ideas, and show off how you use Spark NLP and NLU!

1 line Install NLU on Google Colab
!wget https://setup.johnsnowlabs.com/nlu/colab.sh -O - | bash
1 line Install NLU on Kaggle
!wget https://setup.johnsnowlabs.com/nlu/kaggle.sh -O - | bash
Install via PIP
! pip install nlu pyspark streamlit==0.80.0

3.4.2

Multilingual DeBERTa Transformer Embeddings for 100+ Languages, Spanish Deidentification and NER for Randomized Clinical Trials - John Snow Labs NLU 3.4.2

We are very excited NLU 3.4.2 has been released.
On the open source side we have 5 new DeBERTa Transformer models for English and Multi-Lingual for 100+ languages.
DeBERTa improves over BERT and RoBERTa by introducing two novel techniques.

For the healthcare side we have new NER models for randomized clinical trials (RCT) which can detect entities of type
`BACKGROUND`, `CONCLUSIONS`, `METHODS`, `OBJECTIVE`, `RESULTS` from clinical text.
Additionally, new Spanish Deidentification NER models for entities like `STATE`, `PATIENT`, `DEVICE`, `COUNTRY`, `ZIP`, `PHONE`, `HOSPITAL` and many more.

New Open Source Models

Integrates models from [Spark NLP 3.4.2](https://github.com/JohnSnowLabs/spark-nlp/releases/tag/3.4.2) release

| Language | NLU Reference | Spark NLP Reference | Task | Annotator Class |
|:-----------|:----------------------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------------|:-----------|:------------------|
| en | [en.embed.deberta_v3_xsmall](https://nlp.johnsnowlabs.com/2022/03/10/deberta_v3_xsmall_en_3_0.html) | [deberta_v3_xsmall](https://nlp.johnsnowlabs.com/2022/03/10/deberta_v3_xsmall_en_3_0.html) | Embeddings | DeBertaEmbeddings |
| en | [en.embed.deberta_v3_small](https://nlp.johnsnowlabs.com/2022/03/10/deberta_v3_small_en_3_0.html) | [deberta_v3_small](https://nlp.johnsnowlabs.com/2022/03/10/deberta_v3_small_en_3_0.html) | Embeddings | DeBertaEmbeddings |
| en | [en.embed.deberta_v3_base](https://nlp.johnsnowlabs.com/2022/03/10/deberta_v3_base_en_3_0.html) | [deberta_v3_base](https://nlp.johnsnowlabs.com/2022/03/10/deberta_v3_base_en_3_0.html) | Embeddings | DeBertaEmbeddings |
| en | [en.embed.deberta_v3_large](https://nlp.johnsnowlabs.com/2022/03/10/deberta_v3_large_en_3_0.html) | [deberta_v3_large](https://nlp.johnsnowlabs.com/2022/03/10/deberta_v3_large_en_3_0.html) | Embeddings | DeBertaEmbeddings |
| xx | [xx.embed.mdeberta_v3_base](https://nlp.johnsnowlabs.com/2022/03/10/mdeberta_v3_base_xx_3_0.html) | [mdeberta_v3_base](https://nlp.johnsnowlabs.com/2022/03/10/mdeberta_v3_base_xx_3_0.html) | Embeddings | DeBertaEmbeddings |

New Healthcare Models

Integrates models from [Spark NLP For Healthcare 3.4.2](https://nlp.johnsnowlabs.com/docs/en/licensed_release_notes#342) release

| Language | NLU Reference | Spark NLP Reference | Task | Annotator Class |
|:-----------|:--------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------|:--------------------------------------------------|:-------------------------------------|
| en | [en.med_ner.clinical_trials](https://nlp.johnsnowlabs.com/2022/03/01/bert_sequence_classifier_rct_biobert_en_2_4.html) | [bert_sequence_classifier_rct_biobert](https://nlp.johnsnowlabs.com/2022/03/01/bert_sequence_classifier_rct_biobert_en_2_4.html) | Text Classification | MedicalBertForSequenceClassification |
| es | [es.med_ner.deid.generic.roberta](https://nlp.johnsnowlabs.com/2022/02/15/ner_deid_generic_roberta_augmented_es.html) | [ner_deid_generic_roberta_augmented](https://nlp.johnsnowlabs.com/2022/02/15/ner_deid_generic_roberta_augmented_es.html) | De-identification | MedicalNerModel |
| es | [es.med_ner.deid.subentity.roberta](https://nlp.johnsnowlabs.com/2022/02/15/ner_deid_subentity_roberta_augmented_es.html) | [ner_deid_subentity_roberta_augmented](https://nlp.johnsnowlabs.com/2022/02/15/ner_deid_subentity_roberta_augmented_es.html) | De-identification | MedicalNerModel |
| en | [en.med_ner.deid.generic_augmented](https://nlp.johnsnowlabs.com/2021/06/30/ner_deid_generic_augmented_en.html) | [ner_deid_generic_augmented](https://nlp.johnsnowlabs.com/2021/06/30/ner_deid_generic_augmented_en.html) | ['Named Entity Recognition', 'De-identification'] | MedicalNerModel |
| en | [en.med_ner.deid.subentity_augmented](https://nlp.johnsnowlabs.com/2021/06/01/ner_deid_subentity_augmented_en.html) | [ner_deid_subentity_augmented](https://nlp.johnsnowlabs.com/2021/06/01/ner_deid_subentity_augmented_en.html) | ['Named Entity Recognition', 'De-identification'] | MedicalNerModel |

Additional NLU resources
* [140+ NLU Tutorials](https://nlu.johnsnowlabs.com/docs/en/notebooks)
* [NLU in Action](https://nlp.johnsnowlabs.com/demo)
* [Streamlit visualizations docs](https://nlu.johnsnowlabs.com/docs/en/streamlit_viz_examples)
* The complete list of all 4000+ models & pipelines in 200+ languages is available on [Models Hub](https://nlp.johnsnowlabs.com/models).
* [Spark NLP publications](https://medium.com/spark-nlp)
* [NLU documentation](https://nlu.johnsnowlabs.com/docs/en/install)
* [Discussions](https://github.com/JohnSnowLabs/spark-nlp/discussions) Engage with other community members, share ideas, and show off how you use Spark NLP and NLU!

1 line Install NLU on Google Colab
!wget https://setup.johnsnowlabs.com/nlu/colab.sh -O - | bash
1 line Install NLU on Kaggle
!wget https://setup.johnsnowlabs.com/nlu/kaggle.sh -O - | bash
Install via PIP
! pip install nlu pyspark streamlit==0.80.0

3.4.1

We are very excited to announce the release of NLU 3.4.1
which features 22 new models for 23 languages where the
The open-source side covers new Embeddings for Vietnamese and English Clinical domains and Multilingual Embeddings for 12 Indian and 9 African Languages.
Additionally, there are new Sequence classifiers for Multilingual NER for 9 African languages,
German Sentiment Classifiers and English Emotion and Typo Classifiers.
The healthcare side covers Medical Spanish models, Classifiers for Drugs, Gender, the Pico Framework, and Relation Extractors for Adverse Drug events and Temporality.
Finally, Spark 3.2.X is now supported and bugs related to Databricks environments have been fixed.

General NLU Improvements
- Support for Spark 3.2.x

New Open Source Models
Based on the amazing [3.4.1 Spark NLP Release](https://github.com/JohnSnowLabs/spark-nlp/releases/tag/3.4.1)
integrates new Multilingual embeddings for 12 Major Indian languages,
embeddings for Vietnamese, French, and English Clinical domains.
Additionally new Multilingual NER model for 9 African languages, English 6 Class Emotion classifier and Typo detectors.

New Embeddings
- **Multilingual ALBERT - IndicBert** model pretrained exclusively on 12 major Indian languages with size smaller and performance on par or better than competing models. Languages covered are Assamese, Bengali, English, Gujarati, Hindi, Kannada, Malayalam, Marathi, Oriya, Punjabi, Tamil, Telugu.
Available with [xx.embed.albert.indic](https://nlp.johnsnowlabs.com/2022/01/26/albert_indic_xx.html)
- **Fine tuned Vietnamese DistilBERT** Base cased embeddings. Available with [vi.embed.distilbert.cased](https://nlp.johnsnowlabs.com/2022/01/13/distilbert_base_cased_vi.html)
- **Clinical Longformer Embeddings** which consistently out-performs ClinicalBERT for various downstream
tasks and on datasets. Available with [en.embed.longformer.clinical](https://nlp.johnsnowlabs.com/2022/02/08/clinical_longformer_en.html)
- **Fine tuned Static French Word2Vec Embeddings** in 3 sizes, 200d, 300d and 100d. Available with [fr.embed.word2vec_wiki_1000](https://nlp.johnsnowlabs.com/2022/01/26/word2vec_wiki_1000_fr.html), [fr.embed.word2vec_wac_200](https://nlp.johnsnowlabs.com/2022/02/01/word2vec_wac_200_fr.html) and [fr.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/02/03/w2v_cc_300d_fr.html)

New Transformer based Token and Sequence Classifiers
- **Multilingual NER Distilbert** model which detects entities `DATE`, `LOC`, `ORG`, `PER` for the languages 9 African languages (Hausa, Igbo, Kinyarwanda, Luganda, Nigerian, Pidgin, Swahili, Wolof, and Yorùbá).
Available with [xx.ner.masakhaner.distilbert](https://nlp.johnsnowlabs.com/2021/12/06/xlm_roberta_large_token_classifier_masakhaner_xx.html)
- **German News Sentiment Classifier** available with [de.classify.news_sentiment.bert](https://nlp.johnsnowlabs.com/2022/01/18/bert_sequence_classifier_news_sentiment_de.html)
- **English Emotion Classifier for 6 Classes** available with [en.classify.emotion.bert](https://nlp.johnsnowlabs.com/2022/01/14/bert_sequence_classifier_emotion_en.html)
- **English Typo Detector **: available with [en.classify.typos.distilbert](https://nlp.johnsnowlabs.com/2022/01/19/distilbert_token_classifier_typo_detector_en.html)

| Language | NLU Reference | Spark NLP Reference | Task | Annotator Class |
|:-----------|:---------------------------------------------------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------------------------------------------------------------------|:-------------------------|:---------------------------------|
| xx | [xx.embed.albert.indic](https://nlp.johnsnowlabs.com/2022/01/26/albert_indic_xx.html) | [albert_indic](https://nlp.johnsnowlabs.com/2022/01/26/albert_indic_xx.html) | Embeddings | AlbertEmbeddings |
| xx | [xx.ner.masakhaner.distilbert](https://nlp.johnsnowlabs.com/2021/12/06/xlm_roberta_large_token_classifier_masakhaner_xx.html) | [xlm_roberta_large_token_classifier_masakhaner](https://nlp.johnsnowlabs.com/2022/01/18/distilbert_base_token_classifier_masakhaner_xx.html) | Named Entity Recognition | DistilBertForTokenClassification |
| en | [en.embed.longformer.clinical](https://nlp.johnsnowlabs.com/2022/02/08/clinical_longformer_en.html) | [clinical_longformer](https://nlp.johnsnowlabs.com/2022/02/08/clinical_longformer_en.html) | Embeddings | LongformerEmbeddings |
| en | [en.classify.emotion.bert](https://nlp.johnsnowlabs.com/2022/01/14/bert_sequence_classifier_emotion_en.html) | [bert_sequence_classifier_emotion](https://nlp.johnsnowlabs.com/2022/01/14/bert_sequence_classifier_emotion_en.html) | Text Classification | BertForSequenceClassification |
| de | [de.classify.news_sentiment.bert](https://nlp.johnsnowlabs.com/2022/01/18/bert_sequence_classifier_news_sentiment_de.html) | [bert_sequence_classifier_news_sentiment](https://nlp.johnsnowlabs.com/2022/01/18/bert_sequence_classifier_news_sentiment_de.html) | Sentiment Analysis | BertForSequenceClassification |
| en | [en.classify.typos.distilbert](https://nlp.johnsnowlabs.com/2022/01/19/distilbert_token_classifier_typo_detector_en.html) | [distilbert_token_classifier_typo_detector](https://nlp.johnsnowlabs.com/2022/01/19/distilbert_token_classifier_typo_detector_en.html) | Named Entity Recognition | DistilBertForTokenClassification |
| fr | [fr.embed.word2vec_wiki_1000](https://nlp.johnsnowlabs.com/2022/01/26/word2vec_wiki_1000_fr.html) | [word2vec_wiki_1000](https://nlp.johnsnowlabs.com/2022/01/26/word2vec_wiki_1000_fr.html) | Embeddings | WordEmbeddingsModel |
| fr | [fr.embed.word2vec_wac_200](https://nlp.johnsnowlabs.com/2022/02/01/word2vec_wac_200_fr.html) | [word2vec_wac_200](https://nlp.johnsnowlabs.com/2022/02/01/word2vec_wac_200_fr.html) | Embeddings | WordEmbeddingsModel |
| fr | [fr.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/02/03/w2v_cc_300d_fr.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/02/03/w2v_cc_300d_fr.html) | Embeddings | WordEmbeddingsModel |
| vi | [vi.embed.distilbert.cased](https://nlp.johnsnowlabs.com/2022/01/13/distilbert_base_cased_vi.html) | [distilbert_base_cased](https://nlp.johnsnowlabs.com/2022/01/13/distilbert_base_cased_vi.html) | Embeddings | DistilBertEmbeddings |

New Healthcare Models
Integrated from the amazing [3.4.1 Spark NLP For Healthcare Release](https://nlp.johnsnowlabs.com/docs/en/licensed_release_notes#341).
which makes 2 new Annotator Classes available, `MedicalBertForSequenceClassification` and `MedicalDistilBertForSequenceClassification`,
various medical Spanish models, RxNorm Resolvers,
Transformer based sequence classifiers for Drugs, Gender and the PICO framework,
and Relation extractors for Temporality and Causality of Drugs and Adverse Events.

New Medical Spanish Models
- **Spanish Word2Vec Embeddings** available with [es.embed.sciwiki_300d](https://nlp.johnsnowlabs.com/2020/05/27/embeddings_sciwiki_300d_es.html)
- **Spanish PHI Deidentification NER models** with two different subsets of entities extracted, available with [ner_deid_generic](https://nlp.johnsnowlabs.com/2022/01/18/ner_deid_generic_es.html) and [ner_deid_subentity](https://nlp.johnsnowlabs.com/2022/01/18/ner_deid_subentity_es.html)

New Resolvers
- **RxNorm resolvers** with augmented concept data available with [en.med_ner.supplement_clinical](https://nlp.johnsnowlabs.com/2022/02/01/ner_supplement_clinical_en.html)

New Transformer based Sequence Classifiers
- **Adverse Drug Event Classifier Biobert based** available with [en.classify.ade.seq_biobert](https://nlp.johnsnowlabs.com/2022/02/08/bert_sequence_classifier_ade_en.html)
- **Patient Gender Classifier Biobert and Distilbert based** available with [en.classify.gender.seq_biobert](https://nlp.johnsnowlabs.com/2022/02/08/bert_sequence_classifier_gender_biobert_en.html)
and available with [en.classify.ade.seq_distilbert](https://nlp.johnsnowlabs.com/2022/02/08/distilbert_sequence_classifier_ade_en.html)
- **PiCO Framework Classifier** available with [en.classify.pico.seq_biobert](https://nlp.johnsnowlabs.com/2022/02/07/bert_sequence_classifier_pico_biobert_en.html)

New Relation Extractors
- **Temporal Relation Extractor** available with [en.relation.temporal_events_clinical](https://nlp.johnsnowlabs.com/2020/09/28/re_temporal_events_clinical_en.html)
- **Adverse Drug Event Relation Extractors** one version Biobert Embeddings and one non-DL version available with [en.relation.adverse_drug_events.clinical](https://nlp.johnsnowlabs.com/2021/07/12/re_ade_clinical_en.html) available with [en.relation.adverse_drug_events.clinical.biobert](https://nlp.johnsnowlabs.com/2021/07/12/redl_ade_biobert_en.html)

| Language | NLU Reference | Spark NLP Reference | Task | Annotator Class |
|:-----------|:--------------------------------------------------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------------------------------------------------------|:-------------------------|:-------------------------------------------|
| es | [es.embed.sciwiki_300d](https://nlp.johnsnowlabs.com/2020/05/27/embeddings_sciwiki_300d_es.html) | [embeddings_sciwiki_300d](https://nlp.johnsnowlabs.com/2020/05/27/embeddings_sciwiki_300d_es.html) | Embeddings | WordEmbeddingsModel |
| es | [es.med_ner.deid.generic](https://nlp.johnsnowlabs.com/2022/01/18/ner_deid_generic_es.html) | [ner_deid_generic](https://nlp.johnsnowlabs.com/2022/01/18/ner_deid_generic_es.html) | De-identification | MedicalNerModel |
| es | [es.med_ner.deid.subentity](https://nlp.johnsnowlabs.com/2022/01/18/ner_deid_subentity_es.html) | [ner_deid_subentity](https://nlp.johnsnowlabs.com/2022/01/18/ner_deid_subentity_es.html) | De-identification | MedicalNerModel |
| en | [en.med_ner.supplement_clinical](https://nlp.johnsnowlabs.com/2022/02/01/ner_supplement_clinical_en.html) | [ner_supplement_clinical](https://nlp.johnsnowlabs.com/2022/02/01/ner_supplement_clinical_en.html) | Named Entity Recognition | MedicalNerModel |
| en | [en.resolve.rxnorm.augmented_re](https://nlp.johnsnowlabs.com/2022/02/09/sbiobertresolve_rxnorm_augmented_re_en.html) | [sbiobertresolve_rxnorm_augmented_re](https://nlp.johnsnowlabs.com/2022/02/09/sbiobertresolve_rxnorm_augmented_re_en.html) | Entity Resolution | SentenceEntityResolverModel |
| en | [en.classify.ade.seq_biobert](https://nlp.johnsnowlabs.com/2022/02/08/bert_sequence_classifier_ade_en.html) | [bert_sequence_classifier_ade](https://nlp.johnsnowlabs.com/2022/02/08/bert_sequence_classifier_ade_en.html) | Text Classification | MedicalBertForSequenceClassification |
| en | [en.classify.gender.seq_biobert](https://nlp.johnsnowlabs.com/2022/02/08/bert_sequence_classifier_gender_biobert_en.html) | [bert_sequence_classifier_gender_biobert](https://nlp.johnsnowlabs.com/2022/02/08/bert_sequence_classifier_gender_biobert_en.html) | Text Classification | MedicalBertForSequenceClassification |
| en | [en.classify.pico.seq_biobert](https://nlp.johnsnowlabs.com/2022/02/07/bert_sequence_classifier_pico_biobert_en.html) | [bert_sequence_classifier_pico_biobert](https://nlp.johnsnowlabs.com/2022/02/07/bert_sequence_classifier_pico_biobert_en.html) | Text Classification | MedicalBertForSequenceClassification |
| en | [en.classify.ade.seq_distilbert](https://nlp.johnsnowlabs.com/2022/02/08/distilbert_sequence_classifier_ade_en.html) | [distilbert_sequence_classifier_ade](https://nlp.johnsnowlabs.com/2022/02/08/distilbert_sequence_classifier_ade_en.html) | Text Classification | MedicalDistilBertForSequenceClassification |
| en | [en.relation.temporal_events_clinical](https://nlp.johnsnowlabs.com/2020/09/28/re_temporal_events_clinical_en.html) | [re_temporal_events_clinical](https://nlp.johnsnowlabs.com/2020/09/28/re_temporal_events_clinical_en.html) | Relation Extraction | RelationExtractionModel |
| en | [en.relation.adverse_drug_events.clinical](https://nlp.johnsnowlabs.com/2021/07/12/re_ade_clinical_en.html) | [re_ade_clinical](https://nlp.johnsnowlabs.com/2021/07/12/re_ade_clinical_en.html) | Relation Extraction | RelationExtractionModel |
| en | [en.relation.adverse_drug_events.clinical.biobert](https://nlp.johnsnowlabs.com/2021/07/12/redl_ade_biobert_en.html) | [redl_ade_biobert](https://nlp.johnsnowlabs.com/2021/07/12/redl_ade_biobert_en.html) | Relation Extraction | RelationExtractionDLModel |

Bugfixes
- Fixed bug that caused non-default output level of components to be sentence
- Fixed a bug that caused nlu references pointing to pretrained pipelines in spark nlp to crash in Databricks environments

Additional NLU resources
* [140+ NLU Tutorials](https://nlu.johnsnowlabs.com/docs/en/notebooks)
* [NLU in Action](https://nlp.johnsnowlabs.com/demo)
* [Streamlit visualizations docs](https://nlu.johnsnowlabs.com/docs/en/streamlit_viz_examples)
* The complete list of all 4000+ models & pipelines in 200+ languages is available on [Models Hub](https://nlp.johnsnowlabs.com/models).
* [Spark NLP publications](https://medium.com/spark-nlp)
* [NLU documentation](https://nlu.johnsnowlabs.com/docs/en/install)
* [Discussions](https://github.com/JohnSnowLabs/spark-nlp/discussions) Engage with other community members, share ideas, and show off how you use Spark NLP and NLU!

1 line Install NLU on Google Colab
!wget https://setup.johnsnowlabs.com/nlu/colab.sh -O - | bash
1 line Install NLU on Kaggle
!wget https://setup.johnsnowlabs.com/nlu/kaggle.sh -O - | bash
Install via PIP
! pip install nlu pyspark streamlit==0.80.0

3.4.0

We are incredibly excited to announce John Snow Labs NLU 3.4.0 has been released!
This release features `11 new annotator classes` and `80` new models, including 3 `OCR Transformers` which enable you to extract text
from various file types, support for `GPT2` and new pretrained `T5` models for **Text Generation** and dozens more of new transformer based models
for **Token and Sequence Classification**.
This includes `8 new Sequence classifier models` which can be pretrained in Huggingface and imported into Spark NLP and NLU.
Finally, the NLU tutorial page of the [140+ notebooks has been updated](https://nlu.johnsnowlabs.com/docs/en/notebooks)

**New** NLU OCR Features
3 new OCR based spells are supported, which enable extracting `text` from files of type
`JPEG`, `PNG`, `BMP`, `WBMP`, `GIF`, `JPG`, `TIFF`, `DOCX`, `PDF` in just 1 line of code.
You need a Spark OCR license for using these, which is available for [free here](https://www.johnsnowlabs.com/spark-nlp-try-free/) and refer to the new
[OCR tutorial notebook](https://colab.research.google.com/github/JohnSnowLabs/nlu/blob/master/examples/colab/ocr/ocr_for_img_pdf_docx_files.ipynb)
[![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/JohnSnowLabs/nlu/blob/master/examples/colab/ocr/ocr_for_img_pdf_docx_files.ipynb)
Find more details on the [NLU OCR documentation page](https://nlu.johnsnowlabs.com/docs/en/nlu_for_ocr)

**New** NLU Healthcare Features
The healthcare side features a new `MedicalBertForTokenClassifier` annotator which is a Bert based model for token classification problems like `Named Entity Recognition`,
`Parts of Speech` and much more. Overall there are `28` new models which include German De-Identification models, English NER models for extracting `Drug Development Trials`,
`Clinical Abbreviations and Acronyms`, NER models for chemical compounds/drugs and genes/proteins, updated `MedicalBertForTokenClassifier` NER models for the medical domains `Adverse drug Events`,
`Anatomy`, `Chemicals`, `Genes`,`Proteins`, `Cellular/Molecular Biology`, `Drugs`, `Bacteria`, `De-Identification` and general Medical and Clinical Named Entities.
For **Entity Relation Extraction** between entity pairs new models for interaction between `Drugs and Proteins`.
For **Entity Resolution** new models for resolving `Clinical Abbreviations and Acronyms` to their full length names and also a model for resolving `Drug Substance Entities` to the categories
`Clinical Drug`, `Pharmacologic Substance`, `Antibiotic`, `Hazardous` or `Poisonous Substance` and new resolvers for `LOINC` and `SNOMED` terminologies.

**New** NLU Open source Features
On the open source side we have new support for [Open Ai's `GPT2`](https://openai.com/blog/tags/gpt-2/) for various text sequence to sequence problems and
additionally the following new Transformer models are supported :
`RoBertaForSequenceClassification`, `XlmRoBertaForSequenceClassification`, `LongformerForSequenceClassification`,
`AlbertForSequenceClassification`, `XlnetForSequenceClassification`, `Word2Vec` with various pre-trained weights for various problems!

New **GPT2** models for generating text conditioned on some input,
New **T5 style transfer models** for `active to passive`, `formal to informal`, `informal to formal`, `passive to active` sequence to sequence generation.
Additionally, a new T5 model for generating SQL code from natural language input is provided.

On top of this dozens new Transformer based Sequence Classifiers and Token Classifiers have been released, this is includes for `Token Classifier` the following models :
Multi-Lingual general NER models for **10 African Languages** (`Amharic`, `Hausa`, `Igbo`, `Kinyarwanda`, `Luganda`, `Nigerian`, `Pidgin`, `Swahilu`, `Wolof`, and `Yorùbá`),
**10 high resourced languages** (10 high resourced languages (`Arabic`, `German`, `English`, `Spanish`, `French`, `Italian`, `Latvian`, `Dutch`, `Portuguese` and `Chinese`),
**6 Scandinavian languages** (`Danish`, `Norwegian-Bokmål`, `Norwegian-Nynorsk`, `Swedish`, `Icelandic`, `Faroese`) ,
Uni-Lingual NER models for general entites in the language `Chinese`, `Hindi`, `Islandic`, `Indonesian`
and finally English NER models for extracting entities related to `Stocks Ticker Symbols`, `Restaurants`, `Time`.

For `Sequence Classification` new models for classifying `Toxicity in Russian text` and English models for
`Movie Reviews`, `News Categorization`, `Sentimental Tone` and `General Sentiment`

New NLU OCR Models
The following Transformers have been integrated from [Spark OCR](https://nlp.johnsnowlabs.com/docs/en/ocr_pipeline_components)

| NLU Spell | Transformer Class |
|----------------------|-----------------------------------------------------------------------------------------|
| nlu.load(`img2text`) | [ImageToText](https://nlp.johnsnowlabs.com/docs/en/ocr_pipeline_components#imagetotext) |
| nlu.load(`pdf2text`) | [PdfToText](https://nlp.johnsnowlabs.com/docs/en/ocr_pipeline_components#pdftotext) |
| nlu.load(`doc2text`) | [DocToText](https://nlp.johnsnowlabs.com/docs/en/ocr_pipeline_components#doctotext) |

New Open Source Models

Integration for the 49 new models from the colossal [Spark NLP 3.4.0 release](https://nlp.johnsnowlabs.com/docs/en/release_notes#340)

| Language | NLU Reference | Spark NLP Reference | Task | Annotator Class |
|:-----------|:----------------------------------------------------------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------------------------------------------------------------------|:-------------------------|:------------------------------------|
| en | [en.gpt2.distilled](https://nlp.johnsnowlabs.com/2021/12/03/gpt2_distilled_en.html) | [gpt2_distilled](https://nlp.johnsnowlabs.com/2021/12/03/gpt2_distilled_en.html) | Text Generation | GPT2Transformer |
| en | [en.gpt2](https://nlp.johnsnowlabs.com/2021/12/03/gpt2_en.html) | [gpt2](https://nlp.johnsnowlabs.com/2021/12/03/gpt2_en.html) | Text Generation | GPT2Transformer |
| en | [en.gpt2.medium](https://nlp.johnsnowlabs.com/2021/12/03/gpt2_medium_en.html) | [gpt2_medium](https://nlp.johnsnowlabs.com/2021/12/03/gpt2_medium_en.html) | Text Generation | GPT2Transformer |
| en | [en.gpt2.large](https://nlp.johnsnowlabs.com/2021/12/03/gpt_large_en.html) | [gpt_large](https://nlp.johnsnowlabs.com/2021/12/03/gpt_large_en.html) | Text Generation | GPT2Transformer |
| en | [en.t5.active_to_passive_styletransfer](https://nlp.johnsnowlabs.com/2022/01/12/t5_active_to_passive_styletransfer_en.html) | [t5_active_to_passive_styletransfer](https://nlp.johnsnowlabs.com/2022/01/12/t5_active_to_passive_styletransfer_en.html) | Text Generation | T5Transformer |
| en | [en.t5.formal_to_informal_styletransfer](https://nlp.johnsnowlabs.com/2022/01/12/t5_formal_to_informal_styletransfer_en.html) | [t5_formal_to_informal_styletransfer](https://nlp.johnsnowlabs.com/2022/01/12/t5_formal_to_informal_styletransfer_en.html) | Text Generation | T5Transformer |
| en | [en.t5.grammar_error_corrector](https://nlp.johnsnowlabs.com/2022/01/12/t5_grammar_error_corrector_en.html) | [t5_grammar_error_corrector](https://nlp.johnsnowlabs.com/2022/01/12/t5_grammar_error_corrector_en.html) | Text Generation | T5Transformer |
| en | [en.t5.informal_to_formal_styletransfer](https://nlp.johnsnowlabs.com/2022/01/12/t5_informal_to_formal_styletransfer_en.html) | [t5_informal_to_formal_styletransfer](https://nlp.johnsnowlabs.com/2022/01/12/t5_informal_to_formal_styletransfer_en.html) | Text Generation | T5Transformer |
| en | [en.t5.passive_to_active_styletransfer](https://nlp.johnsnowlabs.com/2022/01/12/t5_passive_to_active_styletransfer_en.html) | [t5_passive_to_active_styletransfer](https://nlp.johnsnowlabs.com/2022/01/12/t5_passive_to_active_styletransfer_en.html) | Text Generation | T5Transformer |
| en | [en.t5.wikiSQL](https://nlp.johnsnowlabs.com/2022/01/12/t5_small_wikiSQL_en.html) | [t5_small_wikiSQL](https://nlp.johnsnowlabs.com/2022/01/12/t5_small_wikiSQL_en.html) | Text Generation | T5Transformer |
| xx | [xx.ner.masakhaner](https://nlp.johnsnowlabs.com/2021/12/06/xlm_roberta_large_token_classifier_masakhaner_xx.html) | [xlm_roberta_large_token_classifier_masakhaner](https://nlp.johnsnowlabs.com/2021/12/06/xlm_roberta_large_token_classifier_masakhaner_xx.html) | Named Entity Recognition | XlmRoBertaForTokenClassification |
| xx | [xx.ner.high_resourced_lang](https://nlp.johnsnowlabs.com/2021/12/26/xlm_roberta_large_token_classifier_hrl_xx.html) | [xlm_roberta_large_token_classifier_hrl](https://nlp.johnsnowlabs.com/2021/12/26/xlm_roberta_large_token_classifier_hrl_xx.html) | Named Entity Recognition | XlmRoBertaForTokenClassification |
| xx | [xx.ner.scandinavian](https://nlp.johnsnowlabs.com/2021/12/09/bert_token_classifier_scandi_ner_xx.html) | [bert_token_classifier_scandi_ner](https://nlp.johnsnowlabs.com/2021/12/09/bert_token_classifier_scandi_ner_xx.html) | Named Entity Recognition | BertForTokenClassification |
| en | [en.embed.electra.medical](https://nlp.johnsnowlabs.com/2022/01/04/electra_medal_acronym_en.html) | [electra_medal_acronym](https://nlp.johnsnowlabs.com/2022/01/04/electra_medal_acronym_en.html) | Embeddings | BertEmbeddings |
| en | [en.ner.restaurant](https://nlp.johnsnowlabs.com/2021/12/31/nerdl_restaurant_100d_en.html) | [nerdl_restaurant_100d](https://nlp.johnsnowlabs.com/2021/12/31/nerdl_restaurant_100d_en.html) | Named Entity Recognition | NerDLModel |
| en | [en.embed.word2vec.gigaword_wiki](https://nlp.johnsnowlabs.com/2022/01/03/word2vec_gigaword_wiki_300_en.html) | [word2vec_gigaword_wiki_300](https://nlp.johnsnowlabs.com/2022/01/03/word2vec_gigaword_wiki_300_en.html) | Embeddings | Word2VecModel |
| en | [en.embed.word2vec.gigaword](https://nlp.johnsnowlabs.com/2022/01/03/word2vec_gigaword_300_en.html) | [word2vec_gigaword_300](https://nlp.johnsnowlabs.com/2022/01/03/word2vec_gigaword_300_en.html) | Embeddings | Word2VecModel |
| en | [en.classify.xlm_roberta.imdb](https://nlp.johnsnowlabs.com/2021/12/23/xlm_roberta_base_sequence_classifier_imdb_en.html) | [xlm_roberta_base_sequence_classifier_imdb](https://nlp.johnsnowlabs.com/2021/12/23/xlm_roberta_base_sequence_classifier_imdb_en.html) | Text Classification | XlmRoBertaForSequenceClassification |
| en | [en.classify.xlm_roberta.ag_news](https://nlp.johnsnowlabs.com/2021/12/23/xlm_roberta_base_sequence_classifier_ag_news_en.html) | [xlm_roberta_base_sequence_classifier_ag_news](https://nlp.johnsnowlabs.com/2021/12/23/xlm_roberta_base_sequence_classifier_ag_news_en.html) | Text Classification | XlmRoBertaForSequenceClassification |
| en | [en.classify.roberta.imdb](https://nlp.johnsnowlabs.com/2021/12/16/roberta_base_sequence_classifier_imdb_en.html) | [roberta_base_sequence_classifier_imdb](https://nlp.johnsnowlabs.com/2021/12/16/roberta_base_sequence_classifier_imdb_en.html) | Text Classification | RoBertaForSequenceClassification |
| en | [en.classify.roberta.ag_news](https://nlp.johnsnowlabs.com/2021/12/16/roberta_base_sequence_classifier_ag_news_en.html) | [roberta_base_sequence_classifier_ag_news](https://nlp.johnsnowlabs.com/2021/12/16/roberta_base_sequence_classifier_ag_news_en.html) | Text Classification | RoBertaForSequenceClassification |
| en | [en.classify.albert.ag_news](https://nlp.johnsnowlabs.com/2021/12/16/albert_base_sequence_classifier_ag_news_en.html) | [albert_base_sequence_classifier_ag_news](https://nlp.johnsnowlabs.com/2021/12/16/albert_base_sequence_classifier_ag_news_en.html) | Text Classification | AlbertForSequenceClassification |
| en | [en.classify.albert.imdb](https://nlp.johnsnowlabs.com/2021/12/16/albert_base_sequence_classifier_imdb_en.html) | [albert_base_sequence_classifier_imdb](https://nlp.johnsnowlabs.com/2021/12/16/albert_base_sequence_classifier_imdb_en.html) | Text Classification | AlbertForSequenceClassification |
| en | [en.classify.ag_news.longformer](https://nlp.johnsnowlabs.com/2021/12/16/longformer_base_sequence_classifier_ag_news_en.html) | [longformer_base_sequence_classifier_ag_news](https://nlp.johnsnowlabs.com/2021/12/16/longformer_base_sequence_classifier_ag_news_en.html) | Text Classification | LongformerForSequenceClassification |
| en | [en.classify.imdb.xlnet](https://nlp.johnsnowlabs.com/2021/12/23/xlnet_base_sequence_classifier_imdb_en.html) | [xlnet_base_sequence_classifier_imdb](https://nlp.johnsnowlabs.com/2021/12/23/xlnet_base_sequence_classifier_imdb_en.html) | Text Classification | XlnetForSequenceClassification |
| en | [en.classify.finance_sentiment](https://nlp.johnsnowlabs.com/2021/12/21/bert_sequence_classifier_finbert_tone_en.html) | [bert_sequence_classifier_finbert_tone](https://nlp.johnsnowlabs.com/2021/12/21/bert_sequence_classifier_finbert_tone_en.html) | Sentiment Analysis | BertForSequenceClassification |
| en | [en.classify.imdb.longformer](https://nlp.johnsnowlabs.com/2021/12/16/longformer_base_sequence_classifier_imdb_en.html) | [longformer_base_sequence_classifier_imdb](https://nlp.johnsnowlabs.com/2021/12/16/longformer_base_sequence_classifier_imdb_en.html) | Text Classification | LongformerForSequenceClassification |
| en | [en.classify.ag_news.longformer](https://nlp.johnsnowlabs.com/2021/12/16/longformer_base_sequence_classifier_ag_news_en.html) | [longformer_base_sequence_classifier_ag_news](https://nlp.johnsnowlabs.com/2021/12/16/longformer_base_sequence_classifier_ag_news_en.html) | Text Classification | LongformerForSequenceClassification |
| en | [en.ner.time](https://nlp.johnsnowlabs.com/2021/12/28/roberta_token_classifier_timex_semeval_en.html) | [roberta_token_classifier_timex_semeval](https://nlp.johnsnowlabs.com/2021/12/28/roberta_token_classifier_timex_semeval_en.html) | Named Entity Recognition | RoBertaForTokenClassification |
| en | [en.ner.stocks_ticker](https://nlp.johnsnowlabs.com/2021/12/27/roberta_token_classifier_ticker_en.html) | [roberta_token_classifier_ticker](https://nlp.johnsnowlabs.com/2021/12/27/roberta_token_classifier_ticker_en.html) | Named Entity Recognition | RoBertaForTokenClassification |
| ru | [ru.classify.toxic](https://nlp.johnsnowlabs.com/2021/12/22/bert_sequence_classifier_toxicity_ru.html) | [bert_sequence_classifier_toxicity](https://nlp.johnsnowlabs.com/2021/12/22/bert_sequence_classifier_toxicity_ru.html) | Text Classification | BertForSequenceClassification |
| it | [it.classify.sentiment](https://nlp.johnsnowlabs.com/2021/12/21/bert_sequence_classifier_sentiment_it.html) | [bert_sequence_classifier_sentiment](https://nlp.johnsnowlabs.com/2021/12/21/bert_sequence_classifier_sentiment_it.html) | Sentiment Analysis | BertForSequenceClassification |
| es | [es.ner](https://nlp.johnsnowlabs.com/2020/02/03/wikiner_6B_100_es.html) | [wikiner_6B_100](https://nlp.johnsnowlabs.com/2020/02/03/wikiner_6B_100_es.html) | Named Entity Recognition | NerDLModel |
| is | [is.ner](https://nlp.johnsnowlabs.com/2021/12/06/roberta_token_classifier_icelandic_ner_is.html) | [roberta_token_classifier_icelandic_ner](https://nlp.johnsnowlabs.com/2021/12/06/roberta_token_classifier_icelandic_ner_is.html) | Named Entity Recognition | RoBertaForTokenClassification |
| id | [id.pos](https://nlp.johnsnowlabs.com/2021/12/27/roberta_token_classifier_pos_tagger_id.html) | [roberta_token_classifier_pos_tagger](https://nlp.johnsnowlabs.com/2021/12/27/roberta_token_classifier_pos_tagger_id.html) | Part of Speech Tagging | RoBertaForTokenClassification |
| tr | [tr.ner](https://nlp.johnsnowlabs.com/2020/11/10/turkish_ner_840B_300_tr.html) | [turkish_ner_840B_300](https://nlp.johnsnowlabs.com/2020/11/10/turkish_ner_840B_300_tr.html) | Named Entity Recognition | NerDLModel |
| id | [id.ner](https://nlp.johnsnowlabs.com/2021/12/03/xlm_roberta_large_token_classification_ner_id.html) | [xlm_roberta_large_token_classification_ner](https://nlp.johnsnowlabs.com/2021/12/03/xlm_roberta_large_token_classification_ner_id.html) | Named Entity Recognition | XlmRoBertaForTokenClassification |
| de | [de.ner](https://nlp.johnsnowlabs.com/2021/12/25/xlm_roberta_large_token_classifier_conll03_de.html) | [xlm_roberta_large_token_classifier_conll03](https://nlp.johnsnowlabs.com/2021/12/25/xlm_roberta_large_token_classifier_conll03_de.html) | Named Entity Recognition | XlmRoBertaForTokenClassification |
| hi | [hi.ner](https://nlp.johnsnowlabs.com/2021/12/27/bert_token_classifier_hi_en_ner_hi.html) | [bert_token_classifier_hi_en_ner](https://nlp.johnsnowlabs.com/2021/12/27/bert_token_classifier_hi_en_ner_hi.html) | Named Entity Recognition | BertForTokenClassification |
| nl | [nl.ner](https://nlp.johnsnowlabs.com/2020/05/10/wikiner_6B_100_nl.html) | [wikiner_6B_100](https://nlp.johnsnowlabs.com/2020/05/10/wikiner_6B_100_nl.html) | Named Entity Recognition | NerDLModel |
| zh | [zh.ner](https://nlp.johnsnowlabs.com/2021/12/07/bert_token_classifier_chinese_ner_zh.html) | [bert_token_classifier_chinese_ner](https://nlp.johnsnowlabs.com/2021/12/07/bert_token_classifier_chinese_ner_zh.html) | Named Entity Recognition | BertForTokenClassification |
| fr | [fr.classify.xlm_roberta.allocine](https://nlp.johnsnowlabs.com/2021/12/23/xlm_roberta_base_sequence_classifier_allocine_fr.html) | [xlm_roberta_base_sequence_classifier_allocine](https://nlp.johnsnowlabs.com/2021/12/23/xlm_roberta_base_sequence_classifier_allocine_fr.html) | Text Classification | XlmRoBertaForSequenceClassification |
| ur | [ur.classify.fakenews](https://nlp.johnsnowlabs.com/2021/12/29/classifierdl_urduvec_fakenews_ur.html) | [classifierdl_urduvec_fakenews](https://nlp.johnsnowlabs.com/2021/12/29/classifierdl_urduvec_fakenews_ur.html) | Text Classification | ClassifierDLModel |
| ur | [ur.classify.news](https://nlp.johnsnowlabs.com/2021/12/10/classifierdl_bert_news_ur.html) | [classifierdl_bert_news](https://nlp.johnsnowlabs.com/2021/12/10/classifierdl_bert_news_ur.html) | Text Classification | ClassifierDLModel |
| fi | [fi.embed_sentence.bert.uncased](https://nlp.johnsnowlabs.com/2022/01/03/bert_base_finnish_uncased_fi.html) | [bert_base_finnish_uncased](https://nlp.johnsnowlabs.com/2022/01/03/bert_base_finnish_uncased_fi.html) | Embeddings | BertSentenceEmbeddings |
| fi | [fi.embed_sentence.bert](https://nlp.johnsnowlabs.com/2022/01/03/bert_base_finnish_uncased_fi.html) | [bert_base_finnish_uncased](https://nlp.johnsnowlabs.com/2022/01/03/bert_base_finnish_uncased_fi.html) | Embeddings | BertSentenceEmbeddings |
| fi | [fi.embed_sentence.bert.cased](https://nlp.johnsnowlabs.com/2022/01/03/bert_base_finnish_cased_fi.html) | [bert_base_finnish_cased](https://nlp.johnsnowlabs.com/2022/01/03/bert_base_finnish_cased_fi.html) | Embeddings | BertSentenceEmbeddings |
| te | [te.embed.distilbert](https://nlp.johnsnowlabs.com/2021/12/14/distilbert_uncased_te.html) | [distilbert_uncased](https://nlp.johnsnowlabs.com/2021/12/14/distilbert_uncased_te.html) | Embeddings | DistilBertEmbeddings |
| sw | [sw.embed.xlm_roberta](https://nlp.johnsnowlabs.com/2021/10/16/xlm_roberta_base_finetuned_swahili_sw.html) | [xlm_roberta_base_finetuned_swahili](https://nlp.johnsnowlabs.com/2021/10/16/xlm_roberta_base_finetuned_swahili_sw.html) | Embeddings | XlmRoBertaEmbeddings |

New Healthcare Models
Integration for the 28 new models from the amazing [Spark NLP for healthcare 3.4.0 release](https://nlp.johnsnowlabs.com/docs/en/licensed_release_notes#340)

| Language | NLU Reference | Spark NLP Reference | Task | Annotator Class |
|:-----------|:------------------------------------------------------------------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------------------------------------------------------------------|:-------------------------|:----------------------------|
| en | [en.med_ner.chemprot.bert](https://nlp.johnsnowlabs.com/2021/10/19/bert_token_classifier_ner_chemprot_en.html) | [bert_token_classifier_ner_chemprot](https://nlp.johnsnowlabs.com/2021/10/19/bert_token_classifier_ner_chemprot_en.html) | Named Entity Recognition | MedicalBertForTokenClassifier |
| en | [en.med_ner.chemprot.bert](https://nlp.johnsnowlabs.com/2022/01/06/bert_token_classifier_ner_chemprot_en.html) | [bert_token_classifier_ner_chemprot](https://nlp.johnsnowlabs.com/2022/01/06/bert_token_classifier_ner_chemprot_en.html) | Named Entity Recognition | MedicalBertForTokenClassifier |
| en | [en.classify.token_bert.ner_bacteria](https://nlp.johnsnowlabs.com/2021/09/30/bert_token_classifier_ner_bacteria_en.html) | [bert_token_classifier_ner_bacteria](https://nlp.johnsnowlabs.com/2021/09/30/bert_token_classifier_ner_bacteria_en.html) | Named Entity Recognition | MedicalBertForTokenClassifier |
| en | [en.classify.token_bert.ner_bacteria](https://nlp.johnsnowlabs.com/2022/01/07/bert_token_classifier_ner_bacteria_en.html) | [bert_token_classifier_ner_bacteria](https://nlp.johnsnowlabs.com/2022/01/07/bert_token_classifier_ner_bacteria_en.html) | Named Entity Recognition | MedicalBertForTokenClassifier |
| en | [en.classify.token_bert.ner_anatomy](https://nlp.johnsnowlabs.com/2021/09/30/bert_token_classifier_ner_anatomy_en.html) | [bert_token_classifier_ner_anatomy](https://nlp.johnsnowlabs.com/2021/09/30/bert_token_classifier_ner_anatomy_en.html) | Named Entity Recognition | MedicalBertForTokenClassifier |
| en | [en.classify.token_bert.ner_anatomy](https://nlp.johnsnowlabs.com/2022/01/06/bert_token_classifier_ner_anatomy_en.html) | [bert_token_classifier_ner_anatomy](https://nlp.johnsnowlabs.com/2022/01/06/bert_token_classifier_ner_anatomy_en.html) | Named Entity Recognition | MedicalBertForTokenClassifier |
| en | [en.classify.token_bert.ner_drugs](https://nlp.johnsnowlabs.com/2021/09/20/bert_token_classifier_ner_drugs_en.html) | [bert_token_classifier_ner_drugs](https://nlp.johnsnowlabs.com/2021/09/20/bert_token_classifier_ner_drugs_en.html) | Named Entity Recognition | MedicalBertForTokenClassifier |
| en | [en.classify.token_bert.ner_drugs](https://nlp.johnsnowlabs.com/2022/01/06/bert_token_classifier_ner_drugs_en.html) | [bert_token_classifier_ner_drugs](https://nlp.johnsnowlabs.com/2022/01/06/bert_token_classifier_ner_drugs_en.html) | Named Entity Recognition | MedicalBertForTokenClassifier |
| en | [en.classify.token_bert.ner_jsl_slim](https://nlp.johnsnowlabs.com/2021/09/24/bert_token_classifier_ner_jsl_slim_en.html) | [bert_token_classifier_ner_jsl_slim](https://nlp.johnsnowlabs.com/2021/09/24/bert_token_classifier_ner_jsl_slim_en.html) | Named Entity Recognition | MedicalBertForTokenClassifier |
| en | [en.classify.token_bert.ner_jsl_slim](https://nlp.johnsnowlabs.com/2022/01/06/bert_token_classifier_ner_jsl_slim_en.html) | [bert_token_classifier_ner_jsl_slim](https://nlp.johnsnowlabs.com/2022/01/06/bert_token_classifier_ner_jsl_slim_en.html) | Named Entity Recognition | MedicalBertForTokenClassifier |
| en | [en.classify.token_bert.ner_ade](https://nlp.johnsnowlabs.com/2021/09/30/bert_token_classifier_ner_ade_en.html) | [bert_token_classifier_ner_ade](https://nlp.johnsnowlabs.com/2021/09/30/bert_token_classifier_ner_ade_en.html) | Named Entity Recognition | MedicalBertForTokenClassifier |
| en | [en.classify.token_bert.ner_ade](https://nlp.johnsnowlabs.com/2022/01/04/bert_token_classifier_ner_ade_en.html) | [bert_token_classifier_ner_ade](https://nlp.johnsnowlabs.com/2022/01/04/bert_token_classifier_ner_ade_en.html) | Named Entity Recognition | MedicalBertForTokenClassifier |
| en | [en.classify.token_bert.ner_deid](https://nlp.johnsnowlabs.com/2021/09/13/bert_token_classifier_ner_deid_en.html) | [bert_token_classifier_ner_deid](https://nlp.johnsnowlabs.com/2021/09/13/bert_token_classifier_ner_deid_en.html) | Named Entity Recognition | MedicalBertForTokenClassifier |
| en | [en.classify.token_bert.ner_deid](https://nlp.johnsnowlabs.com/2022/01/06/bert_token_classifier_ner_deid_en.html) | [bert_token_classifier_ner_deid](https://nlp.johnsnowlabs.com/2022/01/06/bert_token_classifier_ner_deid_en.html) | Named Entity Recognition | MedicalBertForTokenClassifier |
| en | [en.classify.token_bert.ner_clinical](https://nlp.johnsnowlabs.com/2021/08/28/bert_token_classifier_ner_clinical_en.html) | [bert_token_classifier_ner_clinical](https://nlp.johnsnowlabs.com/2021/08/28/bert_token_classifier_ner_clinical_en.html) | Named Entity Recognition | MedicalBertForTokenClassifier |
| en | [en.classify.token_bert.ner_clinical](https://nlp.johnsnowlabs.com/2022/01/06/bert_token_classifier_ner_clinical_en.html) | [bert_token_classifier_ner_clinical](https://nlp.johnsnowlabs.com/2022/01/06/bert_token_classifier_ner_clinical_en.html) | Named Entity Recognition | MedicalBertForTokenClassifier |
| en | [en.classify.token_bert.ner_jsl](https://nlp.johnsnowlabs.com/2021/08/28/bert_token_classifier_ner_jsl_en.html) | [bert_token_classifier_ner_jsl](https://nlp.johnsnowlabs.com/2021/08/28/bert_token_classifier_ner_jsl_en.html) | Named Entity Recognition | MedicalBertForTokenClassifier |
| en | [en.classify.token_bert.ner_jsl](https://nlp.johnsnowlabs.com/2021/09/16/bert_token_classifier_ner_jsl_en.html) | [bert_token_classifier_ner_jsl](https://nlp.johnsnowlabs.com/2021/09/16/bert_token_classifier_ner_jsl_en.html) | Named Entity Recognition | MedicalBertForTokenClassifier |
| en | [en.classify.token_bert.ner_jsl](https://nlp.johnsnowlabs.com/2022/01/06/bert_token_classifier_ner_jsl_en.html) | [bert_token_classifier_ner_jsl](https://nlp.johnsnowlabs.com/2022/01/06/bert_token_classifier_ner_jsl_en.html) | Named Entity Recognition | MedicalBertForTokenClassifier |
| en | [en.classify.token_bert.ner_chemical](https://nlp.johnsnowlabs.com/2021/10/19/bert_token_classifier_ner_chemicals_en.html) | [bert_token_classifier_ner_chemicals](https://nlp.johnsnowlabs.com/2021/10/19/bert_token_classifier_ner_chemicals_en.html) | Named Entity Recognition | MedicalBertForTokenClassifier |
| en | [en.classify.token_bert.ner_chemical](https://nlp.johnsnowlabs.com/2022/01/06/bert_token_classifier_ner_chemicals_en.html) | [bert_token_classifier_ner_chemicals](https://nlp.johnsnowlabs.com/2022/01/06/bert_token_classifier_ner_chemicals_en.html) | Named Entity Recognition | MedicalBertForTokenClassifier |
| en | [en.classify.token_bert.bionlp](https://nlp.johnsnowlabs.com/2021/11/03/bert_token_classifier_ner_bionlp_en.html) | [bert_token_classifier_ner_bionlp](https://nlp.johnsnowlabs.com/2021/11/03/bert_token_classifier_ner_bionlp_en.html) | Named Entity Recognition | MedicalBertForTokenClassifier |
| en | [en.classify.token_bert.bionlp](https://nlp.johnsnowlabs.com/2022/01/03/bert_token_classifier_ner_bionlp_en.html) | [bert_token_classifier_ner_bionlp](https://nlp.johnsnowlabs.com/2022/01/03/bert_token_classifier_ner_bionlp_en.html) | Named Entity Recognition | MedicalBertForTokenClassifier |
| en | [en.classify.token_bert.cellular](https://nlp.johnsnowlabs.com/2021/11/03/bert_token_classifier_ner_cellular_en.html) | [bert_token_classifier_ner_cellular](https://nlp.johnsnowlabs.com/2021/11/03/bert_token_classifier_ner_cellular_en.html) | Named Entity Recognition | MedicalBertForTokenClassifier |
| en | [en.classify.token_bert.cellular](https://nlp.johnsnowlabs.com/2022/01/06/bert_token_classifier_ner_cellular_en.html) | [bert_token_classifier_ner_cellular](https://nlp.johnsnowlabs.com/2022/01/06/bert_token_classifier_ner_cellular_en.html) | Named Entity Recognition | MedicalBertForTokenClassifier |
| en | [en.med_ner.abbreviation_clinical](https://nlp.johnsnowlabs.com/2021/12/30/ner_abbreviation_clinical_en.html) | [ner_abbreviation_clinical](https://nlp.johnsnowlabs.com/2021/12/30/ner_abbreviation_clinical_en.html) | Named Entity Recognition | MedicalNerModel |
| en | [en.med_ner.drugprot_clinical](https://nlp.johnsnowlabs.com/2021/12/20/ner_drugprot_clinical_en.html) | [ner_drugprot_clinical](https://nlp.johnsnowlabs.com/2021/12/20/ner_drugprot_clinical_en.html) | Named Entity Recognition | MedicalNerModel |
| en | [en.ner.drug_development_trials](https://nlp.johnsnowlabs.com/2021/12/17/bert_token_classifier_drug_development_trials_en.html) | [bert_token_classifier_drug_development_trials](https://nlp.johnsnowlabs.com/2021/12/17/bert_token_classifier_drug_development_trials_en.html) | Named Entity Recognition | BertForTokenClassification |
| en | [en.med_ner.chemprot](https://nlp.johnsnowlabs.com/2021/04/01/ner_chemprot_biobert_en.html) | [ner_chemprot_biobert](https://nlp.johnsnowlabs.com/2021/04/01/ner_chemprot_biobert_en.html) | Named Entity Recognition | MedicalNerModel |
| en | [en.relation.drugprot](https://nlp.johnsnowlabs.com/2022/01/05/redl_drugprot_biobert_en.html) | [redl_drugprot_biobert](https://nlp.johnsnowlabs.com/2022/01/05/redl_drugprot_biobert_en.html) | Relation Extraction | RelationExtractionDLModel |
| en | [en.relation.drugprot.clinical](https://nlp.johnsnowlabs.com/2022/01/05/re_drugprot_clinical_en.html) | [re_drugprot_clinical](https://nlp.johnsnowlabs.com/2022/01/05/re_drugprot_clinical_en.html) | Relation Extraction | RelationExtractionModel |
| en | [en.resolve.clinical_abbreviation_acronym](https://nlp.johnsnowlabs.com/2021/12/11/sbiobertresolve_clinical_abbreviation_acronym_en.html) | [sbiobertresolve_clinical_abbreviation_acronym](https://nlp.johnsnowlabs.com/2021/12/11/sbiobertresolve_clinical_abbreviation_acronym_en.html) | Entity Resolution | SentenceEntityResolverModel |
| en | [en.resolve.clinical_abbreviation_acronym](https://nlp.johnsnowlabs.com/2022/01/03/sbiobertresolve_clinical_abbreviation_acronym_en.html) | [sbiobertresolve_clinical_abbreviation_acronym](https://nlp.johnsnowlabs.com/2022/01/03/sbiobertresolve_clinical_abbreviation_acronym_en.html) | Entity Resolution | SentenceEntityResolverModel |
| en | [en.resolve.umls_drug_substance](https://nlp.johnsnowlabs.com/2021/12/06/sbiobertresolve_umls_drug_substance_en.html) | [sbiobertresolve_umls_drug_substance](https://nlp.johnsnowlabs.com/2021/12/06/sbiobertresolve_umls_drug_substance_en.html) | Entity Resolution | SentenceEntityResolverModel |
| en | [en.resolve.loinc_cased](https://nlp.johnsnowlabs.com/2021/12/24/sbiobertresolve_loinc_cased_en.html) | [sbiobertresolve_loinc_cased](https://nlp.johnsnowlabs.com/2021/12/24/sbiobertresolve_loinc_cased_en.html) | Entity Resolution | SentenceEntityResolverModel |
| en | [en.resolve.loinc_uncased](https://nlp.johnsnowlabs.com/2021/12/31/sbluebertresolve_loinc_uncased_en.html) | [sbluebertresolve_loinc_uncased](https://nlp.johnsnowlabs.com/2021/12/31/sbluebertresolve_loinc_uncased_en.html) | Entity Resolution | SentenceEntityResolverModel |
| en | [en.embed_sentence.biobert.rxnorm](https://nlp.johnsnowlabs.com/2021/12/23/sbiobert_jsl_rxnorm_cased_en.html) | [sbiobert_jsl_rxnorm_cased](https://nlp.johnsnowlabs.com/2021/12/23/sbiobert_jsl_rxnorm_cased_en.html) | Entity Resolution | BertSentenceEmbeddings |
| en | [en.embed_sentence.bert_uncased.rxnorm](https://nlp.johnsnowlabs.com/2021/12/23/sbert_jsl_medium_rxnorm_uncased_en.html) | [sbert_jsl_medium_rxnorm_uncased](https://nlp.johnsnowlabs.com/2021/12/23/sbert_jsl_medium_rxnorm_uncased_en.html) | Embeddings | BertSentenceEmbeddings |
| en | [en.embed_sentence.bert_uncased.rxnorm](https://nlp.johnsnowlabs.com/2022/01/03/sbert_jsl_medium_rxnorm_uncased_en.html) | [sbert_jsl_medium_rxnorm_uncased](https://nlp.johnsnowlabs.com/2022/01/03/sbert_jsl_medium_rxnorm_uncased_en.html) | Embeddings | BertSentenceEmbeddings |
| en | [en.resolve.snomed_drug](https://nlp.johnsnowlabs.com/2022/01/01/sbiobertresolve_snomed_drug_en.html) | [sbiobertresolve_snomed_drug](https://nlp.johnsnowlabs.com/2022/01/01/sbiobertresolve_snomed_drug_en.html) | Entity Resolution | SentenceEntityResolverModel |
| de | [de.med_ner.deid_subentity](https://nlp.johnsnowlabs.com/2022/01/06/ner_deid_subentity_de.html) | [ner_deid_subentity](https://nlp.johnsnowlabs.com/2022/01/06/ner_deid_subentity_de.html) | Named Entity Recognition | MedicalNerModel |
| de | [de.med_ner.deid_generic](https://nlp.johnsnowlabs.com/2022/01/06/ner_deid_generic_de.html) | [ner_deid_generic](https://nlp.johnsnowlabs.com/2022/01/06/ner_deid_generic_de.html) | Named Entity Recognition | MedicalNerModel |
| de | [de.embed.w2v](https://nlp.johnsnowlabs.com/2020/09/06/w2v_cc_300d_de.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2020/09/06/w2v_cc_300d_de.html) | Embeddings | WordEmbeddingsModel |

Additional NLU resources

* [NLU OCR tutorial notebook](https://colab.research.google.com/github/JohnSnowLabs/nlu/blob/master/examples/colab/ocr/ocr_for_img_pdf_docx_files.ipynb)
* [140+ NLU Tutorials](https://nlu.johnsnowlabs.com/docs/en/notebooks)
* [NLU in Action](https://nlp.johnsnowlabs.com/demo)
* [Streamlit visualizations docs](https://nlu.johnsnowlabs.com/docs/en/streamlit_viz_examples)
* The complete list of all 4000+ models & pipelines in 200+ languages is available on [Models Hub](https://nlp.johnsnowlabs.com/models).
* [Spark NLP publications](https://medium.com/spark-nlp)
* [NLU documentation](https://nlu.johnsnowlabs.com/docs/en/install)
* [Discussions](https://github.com/JohnSnowLabs/spark-nlp/discussions) Engage with other community members, share ideas, and show off how you use Spark NLP and NLU!

1 line Install NLU on Google Colab
!wget https://setup.johnsnowlabs.com/nlu/colab.sh -O - | bash
1 line Install NLU on Kaggle
!wget https://setup.johnsnowlabs.com/nlu/kaggle.sh -O - | bash
Install via PIP
! pip install nlu pyspark streamlit==0.80.0

Nlu

Page 5 of 12