Nlu

Latest version: v5.3.2

Safety actively analyzes 631877 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 5 of 12

3.4.4

We are very excited to announce NLU 3.4.4 has been released with over 600 new models, over 75 new languages, and 155 languages covered in total,
400% speedup for tokenizers and 18x speedup of UniversalSentenceEncoder on GPU.

On the general NLP side, we have transformer-based Embeddings and Token Classifiers powered by state of the art [CamemBertEmbeddings](https://camembert-model.fr/) and [DeBertaForTokenClassification](https://arxiv.org/abs/2006.03654) based
architectures as well as various new models for
`Historical`, `Ancient`,`Dead`, `Extinct`, `Genetic`, and `Constructed` languages like
`Old Church Slavonic`, `Latin`, `Sanskrit`, `Esperanto`, `Volapük`, `Coptic`, `Nahuatl`, `Ancient Greek (to 1453)`, `Old Russian`.
On the healthcare side, we have `Portuguese De-identification Models`, have `NER` models for Gene detection and finally RxNorm Sentence resolution model for mapping and extracting pharmaceutical actions (e.g. analgesic, hypoglycemic)
as well as treatments (e.g. backache, diabetes).

For full release notes with all models see
[here](https://github.com/JohnSnowLabs/nlu/pull/119)
or [here](https://nlu.johnsnowlabs.com/docs/en/release_notes) ,



First-time language models covered
The languages for these models are covered for the very first time ever by NLU.



| Number | Language Name(s) | NLU Reference | Spark NLP Reference | Task | Annotator Class | Scope | Language Type |
|---------:|:----------------------------------------------------------------------------|:----------------------------------------------------------------------------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------------------------------------------------------------------------------------|:-----------------------|:---------------------|:--------------|:----------------|
|0 | [Sanskrit](https://iso639-3.sil.org/code/san)| [sa.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_sa_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_sa_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Ancient|
|1 | [Sanskrit](https://iso639-3.sil.org/code/san)| [sa.lemma](https://nlp.johnsnowlabs.com/2022/05/01/lemma_vedic_sa_3_0.html) | [lemma_vedic](https://nlp.johnsnowlabs.com/2022/05/01/lemma_vedic_sa_3_0.html)| Lemmatization | LemmatizerModel| Individual| Ancient|
|2 | [Sanskrit](https://iso639-3.sil.org/code/san)| [sa.pos](https://nlp.johnsnowlabs.com/2022/05/01/pos_vedic_sa_3_0.html) | [pos_vedic](https://nlp.johnsnowlabs.com/2022/05/01/pos_vedic_sa_3_0.html)| Part of Speech Tagging | PerceptronModel| Individual| Ancient|
|3 | [Sanskrit](https://iso639-3.sil.org/code/san)| [sa.stopwords](https://nlp.johnsnowlabs.com/2022/03/07/stopwords_iso_sa_3_0.html) | [stopwords_iso](https://nlp.johnsnowlabs.com/2022/03/07/stopwords_iso_sa_3_0.html)| Stop Words Removal | StopWordsCleaner| Individual| Ancient|
|4 | [Volapük](https://iso639-3.sil.org/code/vol)| [vo.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_vo_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_vo_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Constructed|
|5 | [Nahuatl languages](https://iso639-3.sil.org/code/nah)| [nah.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_nah_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_nah_3_0.html)| Embeddings | WordEmbeddingsModel| Collective| Genetic|
|6 | [Aragonese](https://iso639-3.sil.org/code/arg)| [an.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_an_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_an_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|7 | [Assamese](https://iso639-3.sil.org/code/asm)| [as.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_as_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_as_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|8 | [Asturian, Asturleonese, Bable, Leonese](https://iso639-3.sil.org/code/ast)| [ast.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_ast_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_ast_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|9 | [Bashkir](https://iso639-3.sil.org/code/bak)| [ba.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_ba_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_ba_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|10 | [Bavarian](https://iso639-3.sil.org/code/bar) | [bar.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_bar_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_bar_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|11 | [Bishnupriya](https://iso639-3.sil.org/code/bpy) | [bpy.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_bpy_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_bpy_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|12 | [Burmese](https://iso639-3.sil.org/code/mya) | [my.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_my_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_my_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|13 | [Cebuano](https://iso639-3.sil.org/code/ceb) | [ceb.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_ceb_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_ceb_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|14 | [Central Bikol](https://iso639-3.sil.org/code/bcl) | [bcl.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_bcl_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_bcl_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|15 | [Chechen](https://iso639-3.sil.org/code/che) | [ce.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_ce_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_ce_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|16 | [Chuvash](https://iso639-3.sil.org/code/chv) | [cv.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_cv_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_cv_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|17 | [Corsican](https://iso639-3.sil.org/code/cos) | [co.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_co_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_co_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|18 | [Dhivehi, Divehi, Maldivian](https://iso639-3.sil.org/code/div) | [dv.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_dv_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_dv_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|19 | [Egyptian Arabic](https://iso639-3.sil.org/code/arz) | [arz.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_arz_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_arz_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|20 | [Emiliano-Romagnolo](https://iso639-3.sil.org/code/eml) | [eml.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_eml_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_eml_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|21 | [Erzya](https://iso639-3.sil.org/code/myv) | [myv.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/15/w2v_cc_300d_myv_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/15/w2v_cc_300d_myv_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|22 | [Georgian](https://iso639-3.sil.org/code/kat) | [ka.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/15/w2v_cc_300d_ka_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/15/w2v_cc_300d_ka_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|23 | [Goan Konkani](https://iso639-3.sil.org/code/gom) | [gom.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/15/w2v_cc_300d_gom_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/15/w2v_cc_300d_gom_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|24 | [Javanese](https://iso639-3.sil.org/code/jav) | [jv.embed.distilbert](https://nlp.johnsnowlabs.com/2022/04/12/distilbert_embeddings_javanese_distilbert_small_jv_3_0.html) | [distilbert_embeddings_javanese_distilbert_small](https://nlp.johnsnowlabs.com/2022/04/12/distilbert_embeddings_javanese_distilbert_small_jv_3_0.html) | Embeddings | DistilBertEmbeddings| Individual| Living|
|25 | [Javanese](https://iso639-3.sil.org/code/jav) | [jv.embed.javanese_distilbert_small_imdb](https://nlp.johnsnowlabs.com/2022/04/12/distilbert_embeddings_javanese_distilbert_small_imdb_jv_3_0.html) | [distilbert_embeddings_javanese_distilbert_small_imdb](https://nlp.johnsnowlabs.com/2022/04/12/distilbert_embeddings_javanese_distilbert_small_imdb_jv_3_0.html) | Embeddings | DistilBertEmbeddings| Individual| Living|
|26 | [Javanese](https://iso639-3.sil.org/code/jav) | [jv.embed.javanese_roberta_small](https://nlp.johnsnowlabs.com/2022/04/14/roberta_embeddings_javanese_roberta_small_jv_3_0.html) | [roberta_embeddings_javanese_roberta_small](https://nlp.johnsnowlabs.com/2022/04/14/roberta_embeddings_javanese_roberta_small_jv_3_0.html)| Embeddings | RoBertaEmbeddings| Individual| Living|
|27 | [Javanese](https://iso639-3.sil.org/code/jav) | [jv.embed.javanese_roberta_small_imdb](https://nlp.johnsnowlabs.com/2022/04/14/roberta_embeddings_javanese_roberta_small_imdb_jv_3_0.html) | [roberta_embeddings_javanese_roberta_small_imdb](https://nlp.johnsnowlabs.com/2022/04/14/roberta_embeddings_javanese_roberta_small_imdb_jv_3_0.html)| Embeddings | RoBertaEmbeddings| Individual| Living|
|28 | [Javanese](https://iso639-3.sil.org/code/jav) | [jv.embed.javanese_bert_small_imdb](https://nlp.johnsnowlabs.com/2022/04/11/bert_embeddings_javanese_bert_small_imdb_jv_3_0.html) | [bert_embeddings_javanese_bert_small_imdb](https://nlp.johnsnowlabs.com/2022/04/11/bert_embeddings_javanese_bert_small_imdb_jv_3_0.html)| Embeddings | BertEmbeddings| Individual| Living|
|29 | [Javanese](https://iso639-3.sil.org/code/jav) | [jv.embed.javanese_bert_small](https://nlp.johnsnowlabs.com/2022/04/11/bert_embeddings_javanese_bert_small_jv_3_0.html) | [bert_embeddings_javanese_bert_small](https://nlp.johnsnowlabs.com/2022/04/11/bert_embeddings_javanese_bert_small_jv_3_0.html) | Embeddings | BertEmbeddings| Individual| Living|
|30 | [Kirghiz, Kyrgyz](https://iso639-3.sil.org/code/kir) | [ky.stopwords](https://nlp.johnsnowlabs.com/2022/03/07/stopwords_iso_ky_3_0.html) | [stopwords_iso](https://nlp.johnsnowlabs.com/2022/03/07/stopwords_iso_ky_3_0.html)| Stop Words Removal | StopWordsCleaner| Individual| Living|
|31 | [Letzeburgesch, Luxembourgish](https://iso639-3.sil.org/code/ltz) | [lb.stopwords](https://nlp.johnsnowlabs.com/2022/03/07/stopwords_iso_lb_3_0.html) | [stopwords_iso](https://nlp.johnsnowlabs.com/2022/03/07/stopwords_iso_lb_3_0.html) | Stop Words Removal | StopWordsCleaner| Individual| Living|
|32 | [Letzeburgesch, Luxembourgish](https://iso639-3.sil.org/code/ltz) | [lb.lemma](https://nlp.johnsnowlabs.com/2022/03/03/lemma_spacylookup_lb_3_0.html) | [lemma_spacylookup](https://nlp.johnsnowlabs.com/2022/03/03/lemma_spacylookup_lb_3_0.html)| Lemmatization | LemmatizerModel| Individual| Living|
|33 | [Letzeburgesch, Luxembourgish](https://iso639-3.sil.org/code/ltz) | [lb.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_lb_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_lb_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|34 | [Ligurian](https://iso639-3.sil.org/code/lij) | [lij.stopwords](https://nlp.johnsnowlabs.com/2022/03/07/stopwords_iso_lij_3_0.html) | [stopwords_iso](https://nlp.johnsnowlabs.com/2022/03/07/stopwords_iso_lij_3_0.html)| Stop Words Removal | StopWordsCleaner| Individual| Living|
|35 | [Lombard](https://iso639-3.sil.org/code/lmo) | [lmo.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_lmo_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_lmo_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|36 | [Low German, Low Saxon](https://iso639-3.sil.org/code/nds) | [nds.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_nds_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_nds_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|37 | [Macedonian](https://iso639-3.sil.org/code/mkd) | [mk.stopwords](https://nlp.johnsnowlabs.com/2022/03/07/stopwords_iso_mk_3_0.html) | [stopwords_iso](https://nlp.johnsnowlabs.com/2022/03/07/stopwords_iso_mk_3_0.html)| Stop Words Removal | StopWordsCleaner| Individual| Living|
|38 | [Macedonian](https://iso639-3.sil.org/code/mkd) | [mk.lemma](https://nlp.johnsnowlabs.com/2022/03/03/lemma_spacylookup_mk_3_0.html) | [lemma_spacylookup](https://nlp.johnsnowlabs.com/2022/03/03/lemma_spacylookup_mk_3_0.html)| Lemmatization | LemmatizerModel| Individual| Living|
|39 | [Macedonian](https://iso639-3.sil.org/code/mkd) | [mk.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_mk_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_mk_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|40 | [Maithili](https://iso639-3.sil.org/code/mai) | [mai.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_mai_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_mai_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|41 | [Manx](https://iso639-3.sil.org/code/glv) | [gv.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_gv_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_gv_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|42 | [Mazanderani](https://iso639-3.sil.org/code/mzn) | [mzn.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_mzn_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_mzn_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|43 | [Minangkabau](https://iso639-3.sil.org/code/min) | [min.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_min_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_min_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|44 | [Mingrelian](https://iso639-3.sil.org/code/xmf) | [xmf.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_xmf_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_xmf_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|45 | [Mirandese](https://iso639-3.sil.org/code/mwl) | [mwl.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_mwl_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_mwl_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|46 | [Neapolitan](https://iso639-3.sil.org/code/nap) | [nap.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_nap_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_nap_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|47 | [Nepal Bhasa, Newari](https://iso639-3.sil.org/code/new) | [new.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_new_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_new_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|48 | [Northern Frisian](https://iso639-3.sil.org/code/frr) | [frr.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_frr_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_frr_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|49 | [Northern Sami](https://iso639-3.sil.org/code/sme) | [sme.lemma](https://nlp.johnsnowlabs.com/2022/05/01/lemma_giella_sme_3_0.html) | [lemma_giella](https://nlp.johnsnowlabs.com/2022/05/01/lemma_giella_sme_3_0.html)| Lemmatization | LemmatizerModel| Individual| Living|
|50 | [Northern Sami](https://iso639-3.sil.org/code/sme) | [sme.pos](https://nlp.johnsnowlabs.com/2022/05/01/pos_giella_sme_3_0.html) | [pos_giella](https://nlp.johnsnowlabs.com/2022/05/01/pos_giella_sme_3_0.html)| Part of Speech Tagging | PerceptronModel| Individual| Living|
|51 | [Northern Sotho, Pedi, Sepedi](https://iso639-3.sil.org/code/nso) | [nso.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_nso_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_nso_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|52 | [Occitan (post 1500)](https://iso639-3.sil.org/code/oci) | [oc.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_oc_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_oc_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|53 | [Ossetian, Ossetic](https://iso639-3.sil.org/code/oss) | [os.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_os_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_os_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|54 | [Pfaelzisch](https://iso639-3.sil.org/code/pfl) | [pfl.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_pfl_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_pfl_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|55 | [Piemontese](https://iso639-3.sil.org/code/pms) | [pms.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_pms_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_pms_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|56 | [Romansh](https://iso639-3.sil.org/code/roh) | [rm.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_rm_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_rm_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|57 | [Scots](https://iso639-3.sil.org/code/sco) | [sco.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_sco_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_sco_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|58 | [Sicilian](https://iso639-3.sil.org/code/scn) | [scn.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_scn_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_scn_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|59 | [Sinhala, Sinhalese](https://iso639-3.sil.org/code/sin) | [si.stopwords](https://nlp.johnsnowlabs.com/2022/03/07/stopwords_iso_si_3_0.html) | [stopwords_iso](https://nlp.johnsnowlabs.com/2022/03/07/stopwords_iso_si_3_0.html)| Stop Words Removal | StopWordsCleaner| Individual| Living|
|60 | [Sinhala, Sinhalese](https://iso639-3.sil.org/code/sin) | [si.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_si_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_si_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|61 | [Sundanese](https://iso639-3.sil.org/code/sun) | [su.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_su_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_su_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|62 | [Sundanese](https://iso639-3.sil.org/code/sun) | [su.embed.sundanese_roberta_base](https://nlp.johnsnowlabs.com/2022/04/14/roberta_embeddings_sundanese_roberta_base_su_3_0.html) | [roberta_embeddings_sundanese_roberta_base](https://nlp.johnsnowlabs.com/2022/04/14/roberta_embeddings_sundanese_roberta_base_su_3_0.html)| Embeddings | RoBertaEmbeddings| Individual| Living|
|63 | [Tagalog](https://iso639-3.sil.org/code/tgl) | [tl.lemma](https://nlp.johnsnowlabs.com/2022/03/03/lemma_spacylookup_tl_3_0.html) | [lemma_spacylookup](https://nlp.johnsnowlabs.com/2022/03/03/lemma_spacylookup_tl_3_0.html)| Lemmatization | LemmatizerModel| Individual| Living|
|64 | [Tagalog](https://iso639-3.sil.org/code/tgl) | [tl.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_tl_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_tl_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|65 | [Tagalog](https://iso639-3.sil.org/code/tgl) | [tl.stopwords](https://nlp.johnsnowlabs.com/2022/03/07/stopwords_iso_tl_3_0.html) | [stopwords_iso](https://nlp.johnsnowlabs.com/2022/03/07/stopwords_iso_tl_3_0.html)| Stop Words Removal | StopWordsCleaner| Individual| Living|
|66 | [Tagalog](https://iso639-3.sil.org/code/tgl) | [tl.embed.roberta_tagalog_large](https://nlp.johnsnowlabs.com/2022/04/14/roberta_embeddings_roberta_tagalog_large_tl_3_0.html) | [roberta_embeddings_roberta_tagalog_large](https://nlp.johnsnowlabs.com/2022/04/14/roberta_embeddings_roberta_tagalog_large_tl_3_0.html)| Embeddings | RoBertaEmbeddings| Individual| Living|
|67 | [Tagalog](https://iso639-3.sil.org/code/tgl) | [tl.embed.roberta_tagalog_base](https://nlp.johnsnowlabs.com/2022/04/14/roberta_embeddings_roberta_tagalog_base_tl_3_0.html) | [roberta_embeddings_roberta_tagalog_base](https://nlp.johnsnowlabs.com/2022/04/14/roberta_embeddings_roberta_tagalog_base_tl_3_0.html)| Embeddings | RoBertaEmbeddings| Individual| Living|
|68 | [Tajik](https://iso639-3.sil.org/code/tgk) | [tg.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_tg_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_tg_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|69 | [Tatar](https://iso639-3.sil.org/code/tat) | [tt.stopwords](https://nlp.johnsnowlabs.com/2022/03/07/stopwords_iso_tt_3_0.html) | [stopwords_iso](https://nlp.johnsnowlabs.com/2022/03/07/stopwords_iso_tt_3_0.html)| Stop Words Removal | StopWordsCleaner| Individual| Living|
|70 | [Tatar](https://iso639-3.sil.org/code/tat) | [tt.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_tt_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_tt_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|71 | [Tigrinya](https://iso639-3.sil.org/code/tir) | [ti.stopwords](https://nlp.johnsnowlabs.com/2022/03/07/stopwords_iso_ti_3_0.html) | [stopwords_iso](https://nlp.johnsnowlabs.com/2022/03/07/stopwords_iso_ti_3_0.html)| Stop Words Removal | StopWordsCleaner| Individual| Living|
|72 | [Tosk Albanian](https://iso639-3.sil.org/code/als) | [als.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_als_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_als_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|73 | [Tswana](https://iso639-3.sil.org/code/tsn) | [tn.stopwords](https://nlp.johnsnowlabs.com/2022/03/07/stopwords_iso_tn_3_0.html) | [stopwords_iso](https://nlp.johnsnowlabs.com/2022/03/07/stopwords_iso_tn_3_0.html)| Stop Words Removal | StopWordsCleaner| Individual| Living|
|74 | [Turkmen](https://iso639-3.sil.org/code/tuk) | [tk.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_tk_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_tk_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|75 | [Upper Sorbian](https://iso639-3.sil.org/code/hsb) | [hsb.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_hsb_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_hsb_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|76 | [Venetian](https://iso639-3.sil.org/code/vec) | [vec.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_vec_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_vec_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|77 | [Vlaams](https://iso639-3.sil.org/code/vls) | [vls.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_vls_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_vls_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|78 | [Walloon](https://iso639-3.sil.org/code/wln) | [wa.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_wa_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_wa_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|79 | [Waray (Philippines)](https://iso639-3.sil.org/code/war) | [war.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_war_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_war_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|80 | [Western Armenian](https://iso639-3.sil.org/code/hyw) | [hyw.pos](https://nlp.johnsnowlabs.com/2022/05/01/pos_armtdp_hyw_3_0.html) | [pos_armtdp](https://nlp.johnsnowlabs.com/2022/05/01/pos_armtdp_hyw_3_0.html)| Part of Speech Tagging | PerceptronModel| Individual| Living|
|81 | [Western Armenian](https://iso639-3.sil.org/code/hyw) | [hyw.lemma](https://nlp.johnsnowlabs.com/2022/05/01/lemma_armtdp_hyw_3_0.html) | [lemma_armtdp](https://nlp.johnsnowlabs.com/2022/05/01/lemma_armtdp_hyw_3_0.html)| Lemmatization | LemmatizerModel| Individual| Living|
|82 | [Western Frisian](https://iso639-3.sil.org/code/fry) | [fy.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_fy_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_fy_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|83 | [Western Panjabi](https://iso639-3.sil.org/code/pnb) | [pnb.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_pnb_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_pnb_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|84 | [Yakut](https://iso639-3.sil.org/code/sah) | [sah.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_sah_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_sah_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|85 | [Zeeuws](https://iso639-3.sil.org/code/zea) | [zea.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_zea_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_zea_3_0.html)| Embeddings | WordEmbeddingsModel| Individual| Living|
|86 | [Albanian](https://iso639-3.sil.org/code/sqi) | [sq.stopwords](https://nlp.johnsnowlabs.com/2022/03/07/stopwords_iso_sq_3_0.html) | [stopwords_iso](https://nlp.johnsnowlabs.com/2022/03/07/stopwords_iso_sq_3_0.html)| Stop Words Removal | StopWordsCleaner| Macrolanguage| Living|
|87 | [Albanian](https://iso639-3.sil.org/code/sqi) | [sq.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_sq_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_sq_3_0.html)| Embeddings | WordEmbeddingsModel| Macrolanguage| Living|
|88 | [Azerbaijani](https://iso639-3.sil.org/code/aze) | [az.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_az_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/14/w2v_cc_300d_az_3_0.html)| Embeddings | WordEmbeddingsModel| Macrolanguage| Living|
|89 | [Azerbaijani](https://iso639-3.sil.org/code/aze) | [az.stopwords](https://nlp.johnsnowlabs.com/2022/03/07/stopwords_iso_az_3_0.html) | [stopwords_iso](https://nlp.johnsnowlabs.com/2022/03/07/stopwords_iso_az_3_0.html)| Stop Words Removal | StopWordsCleaner| Macrolanguage| Living|
|90 | [Malagasy](https://iso639-3.sil.org/code/mlg) | [mg.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_mg_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_mg_3_0.html)| Embeddings | WordEmbeddingsModel| Macrolanguage| Living|
|91 | [Malay (macrolanguage)](https://iso639-3.sil.org/code/msa) | [ms.embed.albert](https://nlp.johnsnowlabs.com/2022/04/14/albert_embeddings_albert_large_bahasa_cased_ms_3_0.html) | [albert_embeddings_albert_large_bahasa_cased](https://nlp.johnsnowlabs.com/2022/04/14/albert_embeddings_albert_large_bahasa_cased_ms_3_0.html) | Embeddings | AlbertEmbeddings| Macrolanguage| Living|
|92 | [Malay (macrolanguage)](https://iso639-3.sil.org/code/msa) | [ms.embed.distilbert](https://nlp.johnsnowlabs.com/2022/04/12/distilbert_embeddings_malaysian_distilbert_small_ms_3_0.html) | [distilbert_embeddings_malaysian_distilbert_small](https://nlp.johnsnowlabs.com/2022/04/12/distilbert_embeddings_malaysian_distilbert_small_ms_3_0.html)| Embeddings | DistilBertEmbeddings| Macrolanguage| Living|
|93 | [Malay (macrolanguage)](https://iso639-3.sil.org/code/msa) | [ms.embed.albert_tiny_bahasa_cased](https://nlp.johnsnowlabs.com/2022/04/14/albert_embeddings_albert_tiny_bahasa_cased_ms_3_0.htm)l| [albert_embeddings_albert_tiny_bahasa_cased](https://nlp.johnsnowlabs.com/2022/04/14/albert_embeddings_albert_tiny_bahasa_cased_ms_3_0.html)| Embeddings | AlbertEmbeddings| Macrolanguage| Living|
|94 | [Malay (macrolanguage)](https://iso639-3.sil.org/code/msa) | [ms.embed.albert_base_bahasa_cased](https://nlp.johnsnowlabs.com/2022/04/14/albert_embeddings_albert_base_bahasa_cased_ms_3_0.html)| [albert_embeddings_albert_base_bahasa_cased](https://nlp.johnsnowlabs.com/2022/04/14/albert_embeddings_albert_base_bahasa_cased_ms_3_0.html)| Embeddings | AlbertEmbeddings| Macrolanguage| Living|
|95 | [Malay (macrolanguage)](https://iso639-3.sil.org/code/msa) | [ms.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_ms_3_0.html)| [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_ms_3_0.html)| Embeddings | WordEmbeddingsModel| Macrolanguage| Living|
|96 | [Mongolian](https://iso639-3.sil.org/code/mon) | [mn.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_mn_3_0.html)| [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_mn_3_0.html)| Embeddings | WordEmbeddingsModel| Macrolanguage| Living|
|97 | [Oriya (macrolanguage)](https://iso639-3.sil.org/code/ori) | [or.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_or_3_0.html)| [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_or_3_0.html)| Embeddings | WordEmbeddingsModel| Macrolanguage| Living|
|98 | [Pashto, Pushto](https://iso639-3.sil.org/code/pus) | [ps.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_ps_3_0.html)| [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_ps_3_0.html)| Embeddings | WordEmbeddingsModel| Macrolanguage| Living|
|99 | [Quechua](https://iso639-3.sil.org/code/que) | [qu.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_qu_3_0.html)| [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_qu_3_0.html)| Embeddings | WordEmbeddingsModel| Macrolanguage| Living|
|100 | [Sardinian](https://iso639-3.sil.org/code/srd) | [sc.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_sc_3_0.html)| [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_sc_3_0.html)| Embeddings | WordEmbeddingsModel| Macrolanguage| Living|
|101 | [Serbo-Croatian](https://iso639-3.sil.org/code/sh) | [sh.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_sh_3_0.html)| [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_sh_3_0.html)| Embeddings | WordEmbeddingsModel| Macrolanguage| Living|
|102 | [Uzbek](https://iso639-3.sil.org/code/uzb) | [uz.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_uz_3_0.html)| [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_uz_3_0.html)| Embeddings | WordEmbeddingsModel| Macrolanguage| Living|





All Healthcare models
Powered by the amazing
[Spark NLP for Healthcare 3.5.2](https://nlp.johnsnowlabs.com/docs/en/licensed_release_notes#352) and
[Spark NLP for Healthcare 3.5.1](https://nlp.johnsnowlabs.com/docs/en/licensed_release_notes#351) releases.

| Number | NLU Reference | Spark NLP Reference | Task | Language Name(s) | Annotator Class | Language Type | Scope |
|---------:|:-----------------------------------------------------------------------------------------------------------------------------------|:----------------------------------------------------------------------------------------------------------------------------------------|:-------------------------|:--------------------------------------------------|:-------------------------------|:--------------|:-----------|
|0 | [en.med_ner.biomedical_bc2gm](https://nlp.johnsnowlabs.com/2022/05/10/ner_biomedical_bc2gm_en_3_0.html) | [ner_biomedical_bc2gm](https://nlp.johnsnowlabs.com/2022/05/10/ner_biomedical_bc2gm_en_3_0.html) | Named Entity Recognition | [English](https://iso639-3.sil.org/code/eng) | MedicalNerModel | Living | Individual |
|1 | [en.med_ner.biomedical_bc2gm](https://nlp.johnsnowlabs.com/2022/05/11/ner_biomedical_bc2gm_en_2_4.html) | [ner_biomedical_bc2gm](https://nlp.johnsnowlabs.com/2022/05/11/ner_biomedical_bc2gm_en_2_4.html) | Named Entity Recognition | [English](https://iso639-3.sil.org/code/eng) | MedicalNerModel | Living | Individual |
|2 | [en.resolve.rxnorm_action_treatment](https://nlp.johnsnowlabs.com/2022/04/25/sbiobertresolve_rxnorm_action_treatment_en_2_4.html) | [sbiobertresolve_rxnorm_action_treatment](https://nlp.johnsnowlabs.com/2022/04/25/sbiobertresolve_rxnorm_action_treatment_en_2_4.html) | Entity Resolution | [English](https://iso639-3.sil.org/code/eng) | SentenceEntityResolverModel | Living | Individual |
|3 | [en.classify.token_bert.ner_ade](https://nlp.johnsnowlabs.com/2021/09/30/bert_token_classifier_ner_ade_en.html) | [bert_token_classifier_ner_ade](https://nlp.johnsnowlabs.com/2021/09/30/bert_token_classifier_ner_ade_en.html) | Named Entity Recognition | [English](https://iso639-3.sil.org/code/eng) | MedicalBertForTokenClassifier | Living | Individual |
|4 | [en.classify.token_bert.ner_ade](https://nlp.johnsnowlabs.com/2022/01/04/bert_token_classifier_ner_ade_en.html) | [bert_token_classifier_ner_ade](https://nlp.johnsnowlabs.com/2022/01/04/bert_token_classifier_ner_ade_en.html) | Named Entity Recognition | [English](https://iso639-3.sil.org/code/eng) | MedicalBertForTokenClassifier | Living | Individual |
|5 | [pt.med_ner.deid.subentity](https://nlp.johnsnowlabs.com/2022/04/13/ner_deid_subentity_pt_3_0.html) | [ner_deid_subentity](https://nlp.johnsnowlabs.com/2022/04/13/ner_deid_subentity_pt_3_0.html) | De-identification | [Portuguese](https://iso639-3.sil.org/code/por) | MedicalNerModel | Living | Individual |
|6 | [pt.med_ner.deid.generic](https://nlp.johnsnowlabs.com/2022/04/13/ner_deid_generic_pt_3_0.html) | [ner_deid_generic](https://nlp.johnsnowlabs.com/2022/04/13/ner_deid_generic_pt_3_0.html) | De-identification | [Portuguese](https://iso639-3.sil.org/code/por) | MedicalNerModel | Living | Individual |
|7 | [pt.med_ner.deid](https://nlp.johnsnowlabs.com/2022/04/13/ner_deid_generic_pt_3_0.html) | [ner_deid_generic](https://nlp.johnsnowlabs.com/2022/04/13/ner_deid_generic_pt_3_0.html) | De-identification | [Portuguese](https://iso639-3.sil.org/code/por) | MedicalNerModel | Living | Individual |


See next comment for more details

3.4.3

We are very excited to announce NLU 3.4.3 has been released!

This release features new models for `Zero-Shot-Relation-Extraction`, [DeBERTa for Sequence Classification](https://arxiv.org/abs/2006.03654),
`Deidentification` in `French` and `Italian` and
Lemmatizers, Parts of Speech Taggers, and Word2Vec Embeddings for over `66 languages`, with 20 languages being covered
for the first time by NLU, including ancient and exotic languages like `Ancient Greek`, `Old Russian`,
`Old French` and much more. Once again we would like to thank our community to make this release possible.



NLU for Healthcare


On the healthcare NLP side, a new `ZeroShotRelationExtractionModel` is available, which can extract relations between
clinical entities in an unsupervised fashion, no training required!
Additionally, New French and Italian Deidentification models are available for clinical and healthcare domains.
Powerd by the fantastic [ Spark NLP for helathcare 3.5.0 release](https://nlp.johnsnowlabs.com/docs/en/licensed_release_notes)

Zero-Shot Relation Extraction

Zero-shot Relation Extraction to extract relations between clinical entities with no training dataset

python
import nlu

pipe = nlu.load('med_ner.clinical relation.zeroshot_biobert')
Configure relations to extract
pipe['zero_shot_relation_extraction'].setRelationalCategories({
"CURE": ["{{TREATMENT}} cures {{PROBLEM}}."],
"IMPROVE": ["{{TREATMENT}} improves {{PROBLEM}}.", "{{TREATMENT}} cures {{PROBLEM}}."],
"REVEAL": ["{{TEST}} reveals {{PROBLEM}}."]})
.setMultiLabel(False)
df = pipe.predict("Paracetamol can alleviate headache or sickness. An MRI test can be used to find cancer.")
df[
'relation', 'relation_confidence', 'relation_entity1', 'relation_entity1_class', 'relation_entity2', 'relation_entity2_class',]
Results in following table :


| relation | relation_confidence | relation_entity1 | relation_entity1_class | relation_entity2 | relation_entity2_class |
|:-----------|----------------------:|:-------------------|:-------------------------|:-------------------|:-------------------------|
| REVEAL | 0.976004 | An MRI test | TEST | cancer | PROBLEM |
| IMPROVE | 0.988195 | Paracetamol | TREATMENT | sickness | PROBLEM |
| IMPROVE | 0.992962 | Paracetamol | TREATMENT | headache | PROBLEM |

New Healthcare Models overview

| Language | NLU Reference | Spark NLP Reference | Task | Annotator Class |
|:-----------|:--------------------------------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------------------|:-------------------------|:--------------------------------|
| en | [en.relation.zeroshot_biobert](https://nlp.johnsnowlabs.com/2022/04/05/re_zeroshot_biobert_en_3_0.html) | [re_zeroshot_biobert](https://nlp.johnsnowlabs.com/2022/04/05/re_zeroshot_biobert_en_3_0.html) | Relation Extraction | ZeroShotRelationExtractionModel |
| fr | [fr.med_ner.deid_generic](https://nlp.johnsnowlabs.com/2022/02/11/ner_deid_generic_fr.html) | [ner_deid_generic](https://nlp.johnsnowlabs.com/2022/02/11/ner_deid_generic_fr.html) | De-identification | MedicalNerModel |
| fr | [fr.med_ner.deid_subentity](https://nlp.johnsnowlabs.com/2022/02/14/ner_deid_subentity_fr.html) | [ner_deid_subentity](https://nlp.johnsnowlabs.com/2022/02/14/ner_deid_subentity_fr.html) | De-identification | MedicalNerModel |
| it | [it.med_ner.deid_generic](https://nlp.johnsnowlabs.com/2022/03/25/ner_deid_generic_it_3_0.html) | [ner_deid_generic](https://nlp.johnsnowlabs.com/2022/03/25/ner_deid_generic_it_3_0.html) | Named Entity Recognition | MedicalNerModel |
| it | [it.med_ner.deid_subentity](https://nlp.johnsnowlabs.com/2022/03/22/ner_deid_subentity_it_3_0.html) | [ner_deid_subentity](https://nlp.johnsnowlabs.com/2022/03/22/ner_deid_subentity_it_3_0.html) | Named Entity Recognition | MedicalNerModel |

NLU general

On the general NLP side we have new transformer based `DeBERTa v3 sequence classifiers` models fine-tuned in Urdu, French and English for
Sentiment and News classification. Additionally, 100+ Part Of Speech Taggers and Lemmatizers for 66 Languages and for 7
languages new word2vec embeddings, including `hi`,`azb`,`bo`,`diq`,`cy`,`es`,`it`,
powered by the amazing [Spark NLP 3.4.3 release](https://github.com/JohnSnowLabs/spark-nlp/releases/tag/3.4.3)


New Languages covered:
First time languages covered by NLU are :
`South Azerbaijani`, `Tibetan`, `Dimli`, `Central Kurdish`, `Southern Altai`,
`Scottish Gaelic`,`Faroese`,`Literary Chinese`,`Ancient Greek`,
`Gothic`, `Old Russian`, `Church Slavic`,
`Old French`,`Uighur`,`Coptic`,`Croatian`, `Belarusian`, `Serbian`

and their respective ISO-639-3 and ISO 630-2 codes are :
`azb`,`bo`,`diq`,`ckb`, `lt` `gd`, `fo`,`lzh`,`grc`,`got`,`orv`,`cu`,`fro`,`qtd`,`ug`,`cop`,`hr`,`be`,`qhe`,`sr`

New NLP Models Overview


| Language | NLU Reference | Spark NLP Reference | Task | Annotator Class |
|:-----------|:------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------|:-----------------------|:---------------------------------|
| en | [en.classify.sentiment.imdb.deberta](https://nlp.johnsnowlabs.com/2022/04/09/deberta_v3_xsmall_sequence_classifier_imdb_en_3_0.html) | [deberta_v3_xsmall_sequence_classifier_imdb](https://nlp.johnsnowlabs.com/2022/04/09/deberta_v3_xsmall_sequence_classifier_imdb_en_3_0.html) | Text Classification | DeBertaForSequenceClassification |
| en | [en.classify.sentiment.imdb.deberta.small](https://nlp.johnsnowlabs.com/2022/04/09/deberta_v3_small_sequence_classifier_imdb_en_3_0.html) | [deberta_v3_small_sequence_classifier_imdb](https://nlp.johnsnowlabs.com/2022/04/09/deberta_v3_small_sequence_classifier_imdb_en_3_0.html) | Text Classification | DeBertaForSequenceClassification |
| en | [en.classify.sentiment.imdb.deberta.base](https://nlp.johnsnowlabs.com/2022/04/09/deberta_v3_base_sequence_classifier_imdb_en_3_0.html) | [deberta_v3_base_sequence_classifier_imdb](https://nlp.johnsnowlabs.com/2022/04/09/deberta_v3_base_sequence_classifier_imdb_en_3_0.html) | Text Classification | DeBertaForSequenceClassification |
| en | [en.classify.sentiment.imdb.deberta.large](https://nlp.johnsnowlabs.com/2022/04/09/deberta_v3_large_sequence_classifier_imdb_en_3_0.html) | [deberta_v3_large_sequence_classifier_imdb](https://nlp.johnsnowlabs.com/2022/04/09/deberta_v3_large_sequence_classifier_imdb_en_3_0.html) | Text Classification | DeBertaForSequenceClassification |
| en | [en.classify.news.deberta](https://nlp.johnsnowlabs.com/2022/04/09/deberta_v3_xsmall_sequence_classifier_ag_news_en_3_0.html) | [deberta_v3_xsmall_sequence_classifier_ag_news](https://nlp.johnsnowlabs.com/2022/04/09/deberta_v3_xsmall_sequence_classifier_ag_news_en_3_0.html) | Text Classification | DeBertaForSequenceClassification |
| en | [en.classify.news.deberta.small](https://nlp.johnsnowlabs.com/2022/04/09/deberta_v3_small_sequence_classifier_ag_news_en_3_0.html) | [deberta_v3_small_sequence_classifier_ag_news](https://nlp.johnsnowlabs.com/2022/04/09/deberta_v3_small_sequence_classifier_ag_news_en_3_0.html) | Text Classification | DeBertaForSequenceClassification |
| ur | [ur.classify.sentiment.imdb](https://nlp.johnsnowlabs.com/2022/04/09/mdeberta_v3_base_sequence_classifier_imdb_ur_3_0.html) | [mdeberta_v3_base_sequence_classifier_imdb](https://nlp.johnsnowlabs.com/2022/04/09/mdeberta_v3_base_sequence_classifier_imdb_ur_3_0.html) | Text Classification | DeBertaForSequenceClassification |
| fr | [fr.classify.allocine](https://nlp.johnsnowlabs.com/2022/04/09/mdeberta_v3_base_sequence_classifier_allocine_fr_3_0.html) | [mdeberta_v3_base_sequence_classifier_allocine](https://nlp.johnsnowlabs.com/2022/04/09/mdeberta_v3_base_sequence_classifier_allocine_fr_3_0.html) | Text Classification | DeBertaForSequenceClassification |
| ur | [ur.embed.bert_cased](https://nlp.johnsnowlabs.com/2022/04/11/bert_embeddings_bert_base_ur_cased_ur_3_0.html) | [bert_embeddings_bert_base_ur_cased](https://nlp.johnsnowlabs.com/2022/04/11/bert_embeddings_bert_base_ur_cased_ur_3_0.html) | Embeddings | BertEmbeddings |
| fr | [fr.embed.bert_5lang_cased](https://nlp.johnsnowlabs.com/2022/04/11/bert_embeddings_bert_base_5lang_cased_fr_3_0.html) | [bert_embeddings_bert_base_5lang_cased](https://nlp.johnsnowlabs.com/2022/04/11/bert_embeddings_bert_base_5lang_cased_fr_3_0.html) | Embeddings | BertEmbeddings |
| de | [de.embed.medbert](https://nlp.johnsnowlabs.com/2022/04/11/bert_embeddings_German_MedBERT_de_3_0.html) | [bert_embeddings_German_MedBERT](https://nlp.johnsnowlabs.com/2022/04/11/bert_embeddings_German_MedBERT_de_3_0.html) | Embeddings | BertEmbeddings |
| ar | [ar.embed.arbert](https://nlp.johnsnowlabs.com/2022/04/11/bert_embeddings_ARBERT_ar_3_0.html) | [bert_embeddings_ARBERT](https://nlp.johnsnowlabs.com/2022/04/11/bert_embeddings_ARBERT_ar_3_0.html) | Embeddings | BertEmbeddings |
| bn | [bn.embed.bangala_bert](https://nlp.johnsnowlabs.com/2022/04/11/bert_embeddings_bangla_bert_base_bn_3_0.html) | [bert_embeddings_bangla_bert_base](https://nlp.johnsnowlabs.com/2022/04/11/bert_embeddings_bangla_bert_base_bn_3_0.html) | Embeddings | BertEmbeddings |
| zh | [zh.embed.bert_5lang_cased](https://nlp.johnsnowlabs.com/2022/04/11/bert_embeddings_bert_base_5lang_cased_zh_3_0.html) | [bert_embeddings_bert_base_5lang_cased](https://nlp.johnsnowlabs.com/2022/04/11/bert_embeddings_bert_base_5lang_cased_zh_3_0.html) | Embeddings | BertEmbeddings |
| hi | [hi.embed.bert_hi_cased](https://nlp.johnsnowlabs.com/2022/04/11/bert_embeddings_bert_base_hi_cased_hi_3_0.html) | [bert_embeddings_bert_base_hi_cased](https://nlp.johnsnowlabs.com/2022/04/11/bert_embeddings_bert_base_hi_cased_hi_3_0.html) | Embeddings | BertEmbeddings |
| it | [it.embed.bert_it_cased](https://nlp.johnsnowlabs.com/2022/04/11/bert_embeddings_bert_base_it_cased_it_3_0.html) | [bert_embeddings_bert_base_it_cased](https://nlp.johnsnowlabs.com/2022/04/11/bert_embeddings_bert_base_it_cased_it_3_0.html) | Embeddings | BertEmbeddings |
| ko | [ko.embed.bert](https://nlp.johnsnowlabs.com/2022/04/11/bert_embeddings_bert_base_ko_3_0.html) | [bert_embeddings_bert_base](https://nlp.johnsnowlabs.com/2022/04/11/bert_embeddings_bert_base_ko_3_0.html) | Embeddings | BertEmbeddings |
| tr | [tr.embed.bert_cased](https://nlp.johnsnowlabs.com/2022/04/11/bert_embeddings_bert_base_tr_cased_tr_3_0.html) | [bert_embeddings_bert_base_tr_cased](https://nlp.johnsnowlabs.com/2022/04/11/bert_embeddings_bert_base_tr_cased_tr_3_0.html) | Embeddings | BertEmbeddings |
| vi | [vi.embed.bert_cased](https://nlp.johnsnowlabs.com/2022/04/11/bert_embeddings_bert_base_vi_cased_vi_3_0.html) | [bert_embeddings_bert_base_vi_cased](https://nlp.johnsnowlabs.com/2022/04/11/bert_embeddings_bert_base_vi_cased_vi_3_0.html) | Embeddings | BertEmbeddings |
| hif | [hif.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/15/w2v_cc_300d_hif_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/15/w2v_cc_300d_hif_3_0.html) | Embeddings | WordEmbeddingsModel |
| azb | [azb.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_azb_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_azb_3_0.html) | Embeddings | WordEmbeddingsModel |
| bo | [bo.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_bo_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_bo_3_0.html) | Embeddings | WordEmbeddingsModel |
| diq | [diq.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_diq_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_diq_3_0.html) | Embeddings | WordEmbeddingsModel |
| cy | [cy.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_cy_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_cy_3_0.html) | Embeddings | WordEmbeddingsModel |
| es | [es.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_es_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/16/w2v_cc_300d_es_3_0.html) | Embeddings | WordEmbeddingsModel |
| it | [it.embed.word2vec](https://nlp.johnsnowlabs.com/2022/03/07/w2v_cc_300d_it_3_0.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/03/07/w2v_cc_300d_it_3_0.html) | Embeddings | WordEmbeddingsModel |
| af | [af.lemma](https://nlp.johnsnowlabs.com/2021/04/02/lemma_af.html) | [lemma](https://nlp.johnsnowlabs.com/2021/04/02/lemma_af.html) | Lemmatization | LemmatizerModel |
| lt | [lt.lemma](https://nlp.johnsnowlabs.com/2022/03/31/lemma_alksnis_lt_3_0.html) | [lemma_alksnis](https://nlp.johnsnowlabs.com/2022/03/31/lemma_alksnis_lt_3_0.html) | Lemmatization | LemmatizerModel |
| nl | [nl.lemma](https://nlp.johnsnowlabs.com/2020/05/03/lemma_nl.html) | [lemma](https://nlp.johnsnowlabs.com/2020/05/03/lemma_nl.html) | Lemmatization | LemmatizerModel |
| gd | [gd.lemma](https://nlp.johnsnowlabs.com/2022/03/31/lemma_arcosg_gd_3_0.html) | [lemma_arcosg](https://nlp.johnsnowlabs.com/2022/03/31/lemma_arcosg_gd_3_0.html) | Lemmatization | LemmatizerModel |
| es | [es.lemma](https://nlp.johnsnowlabs.com/2020/02/16/lemma_es.html) | [lemma](https://nlp.johnsnowlabs.com/2020/02/16/lemma_es.html) | Lemmatization | LemmatizerModel |
| ca | [ca.lemma](https://nlp.johnsnowlabs.com/2020/07/29/lemma_ca.html) | [lemma](https://nlp.johnsnowlabs.com/2020/07/29/lemma_ca.html) | Lemmatization | LemmatizerModel |
| el | [el.lemma.gdt](https://nlp.johnsnowlabs.com/2022/03/31/lemma_gdt_el_3_0.html) | [lemma_gdt](https://nlp.johnsnowlabs.com/2022/03/31/lemma_gdt_el_3_0.html) | Lemmatization | LemmatizerModel |
| en | [en.lemma.atis](https://nlp.johnsnowlabs.com/2022/03/31/lemma_atis_en_3_0.html) | [lemma_atis](https://nlp.johnsnowlabs.com/2022/03/31/lemma_atis_en_3_0.html) | Lemmatization | LemmatizerModel |
| tr | [tr.lemma.boun](https://nlp.johnsnowlabs.com/2022/03/31/lemma_boun_tr_3_0.html) | [lemma_boun](https://nlp.johnsnowlabs.com/2022/03/31/lemma_boun_tr_3_0.html) | Lemmatization | LemmatizerModel |
| da | [da.lemma.ddt](https://nlp.johnsnowlabs.com/2022/03/31/lemma_ddt_da_3_0.html) | [lemma_ddt](https://nlp.johnsnowlabs.com/2022/03/31/lemma_ddt_da_3_0.html) | Lemmatization | LemmatizerModel |
| cs | [cs.lemma.cac](https://nlp.johnsnowlabs.com/2022/03/31/lemma_cac_cs_3_0.html) | [lemma_cac](https://nlp.johnsnowlabs.com/2022/03/31/lemma_cac_cs_3_0.html) | Lemmatization | LemmatizerModel |
| en | [en.lemma.esl](https://nlp.johnsnowlabs.com/2022/03/31/lemma_esl_en_3_0.html) | [lemma_esl](https://nlp.johnsnowlabs.com/2022/03/31/lemma_esl_en_3_0.html) | Lemmatization | LemmatizerModel |
| bg | [bg.lemma.btb](https://nlp.johnsnowlabs.com/2022/03/31/lemma_btb_bg_3_0.html) | [lemma_btb](https://nlp.johnsnowlabs.com/2022/03/31/lemma_btb_bg_3_0.html) | Lemmatization | LemmatizerModel |
| id | [id.lemma.csui](https://nlp.johnsnowlabs.com/2022/03/31/lemma_csui_id_3_0.html) | [lemma_csui](https://nlp.johnsnowlabs.com/2022/03/31/lemma_csui_id_3_0.html) | Lemmatization | LemmatizerModel |
| gl | [gl.lemma.ctg](https://nlp.johnsnowlabs.com/2022/03/31/lemma_ctg_gl_3_0.html) | [lemma_ctg](https://nlp.johnsnowlabs.com/2022/03/31/lemma_ctg_gl_3_0.html) | Lemmatization | LemmatizerModel |
| cy | [cy.lemma.ccg](https://nlp.johnsnowlabs.com/2022/03/31/lemma_ccg_cy_3_0.html) | [lemma_ccg](https://nlp.johnsnowlabs.com/2022/03/31/lemma_ccg_cy_3_0.html) | Lemmatization | LemmatizerModel |
| fo | [fo.lemma.farpahc](https://nlp.johnsnowlabs.com/2022/03/31/lemma_farpahc_fo_3_0.html) | [lemma_farpahc](https://nlp.johnsnowlabs.com/2022/03/31/lemma_farpahc_fo_3_0.html) | Lemmatization | LemmatizerModel |
| tr | [tr.lemma.atis](https://nlp.johnsnowlabs.com/2022/03/31/lemma_atis_tr_3_0.html) | [lemma_atis](https://nlp.johnsnowlabs.com/2022/03/31/lemma_atis_tr_3_0.html) | Lemmatization | LemmatizerModel |
| ga | [ga.lemma.idt](https://nlp.johnsnowlabs.com/2022/03/31/lemma_idt_ga_3_0.html) | [lemma_idt](https://nlp.johnsnowlabs.com/2022/03/31/lemma_idt_ga_3_0.html) | Lemmatization | LemmatizerModel |
| ja | [ja.lemma.gsdluw](https://nlp.johnsnowlabs.com/2022/03/31/lemma_gsdluw_ja_3_0.html) | [lemma_gsdluw](https://nlp.johnsnowlabs.com/2022/03/31/lemma_gsdluw_ja_3_0.html) | Lemmatization | LemmatizerModel |
| es | [es.lemma.gsd](https://nlp.johnsnowlabs.com/2022/03/31/lemma_gsd_es_3_0.html) | [lemma_gsd](https://nlp.johnsnowlabs.com/2022/03/31/lemma_gsd_es_3_0.html) | Lemmatization | LemmatizerModel |
| en | [en.lemma.gum](https://nlp.johnsnowlabs.com/2022/03/31/lemma_gum_en_3_0.html) | [lemma_gum](https://nlp.johnsnowlabs.com/2022/03/31/lemma_gum_en_3_0.html) | Lemmatization | LemmatizerModel |
| zh | [zh.lemma.gsd](https://nlp.johnsnowlabs.com/2022/03/31/lemma_gsd_zh_3_0.html) | [lemma_gsd](https://nlp.johnsnowlabs.com/2022/03/31/lemma_gsd_zh_3_0.html) | Lemmatization | LemmatizerModel |
| lv | [lv.lemma.lvtb](https://nlp.johnsnowlabs.com/2022/03/31/lemma_lvtb_lv_3_0.html) | [lemma_lvtb](https://nlp.johnsnowlabs.com/2022/03/31/lemma_lvtb_lv_3_0.html) | Lemmatization | LemmatizerModel |
| hi | [hi.lemma.hdtb](https://nlp.johnsnowlabs.com/2022/03/31/lemma_hdtb_hi_3_0.html) | [lemma_hdtb](https://nlp.johnsnowlabs.com/2022/03/31/lemma_hdtb_hi_3_0.html) | Lemmatization | LemmatizerModel |
| pt | [pt.lemma.gsd](https://nlp.johnsnowlabs.com/2022/03/31/lemma_gsd_pt_3_0.html) | [lemma_gsd](https://nlp.johnsnowlabs.com/2022/03/31/lemma_gsd_pt_3_0.html) | Lemmatization | LemmatizerModel |
| de | [de.lemma.gsd](https://nlp.johnsnowlabs.com/2022/03/31/lemma_gsd_de_3_0.html) | [lemma_gsd](https://nlp.johnsnowlabs.com/2022/03/31/lemma_gsd_de_3_0.html) | Lemmatization | LemmatizerModel |
| nl | [nl.lemma.lassysmall](https://nlp.johnsnowlabs.com/2022/03/31/lemma_lassysmall_nl_3_0.html) | [lemma_lassysmall](https://nlp.johnsnowlabs.com/2022/03/31/lemma_lassysmall_nl_3_0.html) | Lemmatization | LemmatizerModel |
| lzh | [lzh.lemma.kyoto](https://nlp.johnsnowlabs.com/2022/03/31/lemma_kyoto_lzh_3_0.html) | [lemma_kyoto](https://nlp.johnsnowlabs.com/2022/03/31/lemma_kyoto_lzh_3_0.html) | Lemmatization | LemmatizerModel |
| zh | [zh.lemma.gsdsimp](https://nlp.johnsnowlabs.com/2022/03/31/lemma_gsdsimp_zh_3_0.html) | [lemma_gsdsimp](https://nlp.johnsnowlabs.com/2022/03/31/lemma_gsdsimp_zh_3_0.html) | Lemmatization | LemmatizerModel |
| he | [he.lemma.htb](https://nlp.johnsnowlabs.com/2022/03/31/lemma_htb_he_3_0.html) | [lemma_htb](https://nlp.johnsnowlabs.com/2022/03/31/lemma_htb_he_3_0.html) | Lemmatization | LemmatizerModel |
| fr | [fr.lemma.gsd](https://nlp.johnsnowlabs.com/2022/03/31/lemma_gsd_fr_3_0.html) | [lemma_gsd](https://nlp.johnsnowlabs.com/2022/03/31/lemma_gsd_fr_3_0.html) | Lemmatization | LemmatizerModel |
| ro | [ro.lemma.nonstandard](https://nlp.johnsnowlabs.com/2022/03/31/lemma_nonstandard_ro_3_0.html) | [lemma_nonstandard](https://nlp.johnsnowlabs.com/2022/03/31/lemma_nonstandard_ro_3_0.html) | Lemmatization | LemmatizerModel |
| ja | [ja.lemma.gsd](https://nlp.johnsnowlabs.com/2022/03/31/lemma_gsd_ja_3_0.html) | [lemma_gsd](https://nlp.johnsnowlabs.com/2022/03/31/lemma_gsd_ja_3_0.html) | Lemmatization | LemmatizerModel |
| it | [it.lemma.isdt](https://nlp.johnsnowlabs.com/2022/03/31/lemma_isdt_it_3_0.html) | [lemma_isdt](https://nlp.johnsnowlabs.com/2022/03/31/lemma_isdt_it_3_0.html) | Lemmatization | LemmatizerModel |
| de | [de.lemma.hdt](https://nlp.johnsnowlabs.com/2022/03/31/lemma_hdt_de_3_0.html) | [lemma_hdt](https://nlp.johnsnowlabs.com/2022/03/31/lemma_hdt_de_3_0.html) | Lemmatization | LemmatizerModel |
| is | [is.lemma.modern](https://nlp.johnsnowlabs.com/2022/03/31/lemma_modern_is_3_0.html) | [lemma_modern](https://nlp.johnsnowlabs.com/2022/03/31/lemma_modern_is_3_0.html) | Lemmatization | LemmatizerModel |
| la | [la.lemma.ittb](https://nlp.johnsnowlabs.com/2022/03/31/lemma_ittb_la_3_0.html) | [lemma_ittb](https://nlp.johnsnowlabs.com/2022/03/31/lemma_ittb_la_3_0.html) | Lemmatization | LemmatizerModel |
| fr | [fr.lemma.partut](https://nlp.johnsnowlabs.com/2022/03/31/lemma_partut_fr_3_0.html) | [lemma_partut](https://nlp.johnsnowlabs.com/2022/03/31/lemma_partut_fr_3_0.html) | Lemmatization | LemmatizerModel |
| pcm | [pcm.lemma.nsc](https://nlp.johnsnowlabs.com/2022/03/31/lemma_nsc_pcm_3_0.html) | [lemma_nsc](https://nlp.johnsnowlabs.com/2022/03/31/lemma_nsc_pcm_3_0.html) | Lemmatization | LemmatizerModel |
| pl | [pl.lemma.pdb](https://nlp.johnsnowlabs.com/2022/03/31/lemma_pdb_pl_3_0.html) | [lemma_pdb](https://nlp.johnsnowlabs.com/2022/03/31/lemma_pdb_pl_3_0.html) | Lemmatization | LemmatizerModel |
| grc | [grc.lemma.perseus](https://nlp.johnsnowlabs.com/2022/03/31/lemma_perseus_grc_3_0.html) | [lemma_perseus](https://nlp.johnsnowlabs.com/2022/03/31/lemma_perseus_grc_3_0.html) | Lemmatization | LemmatizerModel |
| cs | [cs.lemma.pdt](https://nlp.johnsnowlabs.com/2022/03/31/lemma_pdt_cs_3_0.html) | [lemma_pdt](https://nlp.johnsnowlabs.com/2022/03/31/lemma_pdt_cs_3_0.html) | Lemmatization | LemmatizerModel |
| fa | [fa.lemma.perdt](https://nlp.johnsnowlabs.com/2022/03/31/lemma_perdt_fa_3_0.html) | [lemma_perdt](https://nlp.johnsnowlabs.com/2022/03/31/lemma_perdt_fa_3_0.html) | Lemmatization | LemmatizerModel |
| got | [got.lemma.proiel](https://nlp.johnsnowlabs.com/2022/03/31/lemma_proiel_got_3_0.html) | [lemma_proiel](https://nlp.johnsnowlabs.com/2022/03/31/lemma_proiel_got_3_0.html) | Lemmatization | LemmatizerModel |
| fr | [fr.lemma.rhapsodie](https://nlp.johnsnowlabs.com/2022/03/31/lemma_rhapsodie_fr_3_0.html) | [lemma_rhapsodie](https://nlp.johnsnowlabs.com/2022/03/31/lemma_rhapsodie_fr_3_0.html) | Lemmatization | LemmatizerModel |
| it | [it.lemma.partut](https://nlp.johnsnowlabs.com/2022/03/31/lemma_partut_it_3_0.html) | [lemma_partut](https://nlp.johnsnowlabs.com/2022/03/31/lemma_partut_it_3_0.html) | Lemmatization | LemmatizerModel |
| en | [en.lemma.partut](https://nlp.johnsnowlabs.com/2022/03/31/lemma_partut_en_3_0.html) | [lemma_partut](https://nlp.johnsnowlabs.com/2022/03/31/lemma_partut_en_3_0.html) | Lemmatization | LemmatizerModel |
| no | [no.lemma.nynorsklia](https://nlp.johnsnowlabs.com/2022/03/31/lemma_nynorsklia_no_3_0.html) | [lemma_nynorsklia](https://nlp.johnsnowlabs.com/2022/03/31/lemma_nynorsklia_no_3_0.html) | Lemmatization | LemmatizerModel |
| orv | [orv.lemma.rnc](https://nlp.johnsnowlabs.com/2022/03/31/lemma_rnc_orv_3_0.html) | [lemma_rnc](https://nlp.johnsnowlabs.com/2022/03/31/lemma_rnc_orv_3_0.html) | Lemmatization | LemmatizerModel |
| cu | [cu.lemma.proiel](https://nlp.johnsnowlabs.com/2022/03/31/lemma_proiel_cu_3_0.html) | [lemma_proiel](https://nlp.johnsnowlabs.com/2022/03/31/lemma_proiel_cu_3_0.html) | Lemmatization | LemmatizerModel |
| la | [la.lemma.perseus](https://nlp.johnsnowlabs.com/2022/03/31/lemma_perseus_la_3_0.html) | [lemma_perseus](https://nlp.johnsnowlabs.com/2022/03/31/lemma_perseus_la_3_0.html) | Lemmatization | LemmatizerModel |
| fr | [fr.lemma.parisstories](https://nlp.johnsnowlabs.com/2022/03/31/lemma_parisstories_fr_3_0.html) | [lemma_parisstories](https://nlp.johnsnowlabs.com/2022/03/31/lemma_parisstories_fr_3_0.html) | Lemmatization | LemmatizerModel |
| fro | [fro.lemma.srcmf](https://nlp.johnsnowlabs.com/2022/03/31/lemma_srcmf_fro_3_0.html) | [lemma_srcmf](https://nlp.johnsnowlabs.com/2022/03/31/lemma_srcmf_fro_3_0.html) | Lemmatization | LemmatizerModel |
| vi | [vi.lemma.vtb](https://nlp.johnsnowlabs.com/2022/03/31/lemma_vtb_vi_3_0.html) | [lemma_vtb](https://nlp.johnsnowlabs.com/2022/03/31/lemma_vtb_vi_3_0.html) | Lemmatization | LemmatizerModel |
| qtd | [qtd.lemma.sagt](https://nlp.johnsnowlabs.com/2022/03/31/lemma_sagt_qtd_3_0.html) | [lemma_sagt](https://nlp.johnsnowlabs.com/2022/03/31/lemma_sagt_qtd_3_0.html) | Lemmatization | LemmatizerModel |
| ro | [ro.lemma.rrt](https://nlp.johnsnowlabs.com/2022/03/31/lemma_rrt_ro_3_0.html) | [lemma_rrt](https://nlp.johnsnowlabs.com/2022/03/31/lemma_rrt_ro_3_0.html) | Lemmatization | LemmatizerModel |
| hu | [hu.lemma.szeged](https://nlp.johnsnowlabs.com/2022/03/31/lemma_szeged_hu_3_0.html) | [lemma_szeged](https://nlp.johnsnowlabs.com/2022/03/31/lemma_szeged_hu_3_0.html) | Lemmatization | LemmatizerModel |
| ug | [ug.lemma.udt](https://nlp.johnsnowlabs.com/2022/03/31/lemma_udt_ug_3_0.html) | [lemma_udt](https://nlp.johnsnowlabs.com/2022/03/31/lemma_udt_ug_3_0.html) | Lemmatization | LemmatizerModel |
| wo | [wo.lemma.wtb](https://nlp.johnsnowlabs.com/2022/03/31/lemma_wtb_wo_3_0.html) | [lemma_wtb](https://nlp.johnsnowlabs.com/2022/03/31/lemma_wtb_wo_3_0.html) | Lemmatization | LemmatizerModel |
| cop | [cop.lemma.scriptorium](https://nlp.johnsnowlabs.com/2022/03/31/lemma_scriptorium_cop_3_0.html) | [lemma_scriptorium](https://nlp.johnsnowlabs.com/2022/03/31/lemma_scriptorium_cop_3_0.html) | Lemmatization | LemmatizerModel |
| ru | [ru.lemma.syntagrus](https://nlp.johnsnowlabs.com/2022/03/31/lemma_syntagrus_ru_3_0.html) | [lemma_syntagrus](https://nlp.johnsnowlabs.com/2022/03/31/lemma_syntagrus_ru_3_0.html) | Lemmatization | LemmatizerModel |
| ru | [ru.lemma.taiga](https://nlp.johnsnowlabs.com/2022/03/31/lemma_taiga_ru_3_0.html) | [lemma_taiga](https://nlp.johnsnowlabs.com/2022/03/31/lemma_taiga_ru_3_0.html) | Lemmatization | LemmatizerModel |
| fr | [fr.lemma.sequoia](https://nlp.johnsnowlabs.com/2022/03/31/lemma_sequoia_fr_3_0.html) | [lemma_sequoia](https://nlp.johnsnowlabs.com/2022/03/31/lemma_sequoia_fr_3_0.html) | Lemmatization | LemmatizerModel |
| la | [la.lemma.udante](https://nlp.johnsnowlabs.com/2022/03/31/lemma_udante_la_3_0.html) | [lemma_udante](https://nlp.johnsnowlabs.com/2022/03/31/lemma_udante_la_3_0.html) | Lemmatization | LemmatizerModel |
| ro | [ro.lemma.simonero](https://nlp.johnsnowlabs.com/2022/03/31/lemma_simonero_ro_3_0.html) | [lemma_simonero](https://nlp.johnsnowlabs.com/2022/03/31/lemma_simonero_ro_3_0.html) | Lemmatization | LemmatizerModel |
| it | [it.lemma.vit](https://nlp.johnsnowlabs.com/2022/03/31/lemma_vit_it_3_0.html) | [lemma_vit](https://nlp.johnsnowlabs.com/2022/03/31/lemma_vit_it_3_0.html) | Lemmatization | LemmatizerModel |
| hr | [hr.lemma.set](https://nlp.johnsnowlabs.com/2022/03/31/lemma_set_hr_3_0.html) | [lemma_set](https://nlp.johnsnowlabs.com/2022/03/31/lemma_set_hr_3_0.html) | Lemmatization | LemmatizerModel |
| fa | [fa.lemma.seraji](https://nlp.johnsnowlabs.com/2022/03/31/lemma_seraji_fa_3_0.html) | [lemma_seraji](https://nlp.johnsnowlabs.com/2022/03/31/lemma_seraji_fa_3_0.html) | Lemmatization | LemmatizerModel |
| tr | [tr.lemma.tourism](https://nlp.johnsnowlabs.com/2022/03/31/lemma_tourism_tr_3_0.html) | [lemma_tourism](https://nlp.johnsnowlabs.com/2022/03/31/lemma_tourism_tr_3_0.html) | Lemmatization | LemmatizerModel |
| ta | [ta.lemma.ttb](https://nlp.johnsnowlabs.com/2022/03/31/lemma_ttb_ta_3_0.html) | [lemma_ttb](https://nlp.johnsnowlabs.com/2022/03/31/lemma_ttb_ta_3_0.html) | Lemmatization | LemmatizerModel |
| sl | [sl.lemma.ssj](https://nlp.johnsnowlabs.com/2022/03/31/lemma_ssj_sl_3_0.html) | [lemma_ssj](https://nlp.johnsnowlabs.com/2022/03/31/lemma_ssj_sl_3_0.html) | Lemmatization | LemmatizerModel |
| sv | [sv.lemma.talbanken](https://nlp.johnsnowlabs.com/2022/03/31/lemma_talbanken_sv_3_0.html) | [lemma_talbanken](https://nlp.johnsnowlabs.com/2022/03/31/lemma_talbanken_sv_3_0.html) | Lemmatization | LemmatizerModel |
| uk | [uk.lemma.iu](https://nlp.johnsnowlabs.com/2022/03/31/lemma_iu_uk_3_0.html) | [lemma_iu](https://nlp.johnsnowlabs.com/2022/03/31/lemma_iu_uk_3_0.html) | Lemmatization | LemmatizerModel |
| te | [te.pos](https://nlp.johnsnowlabs.com/2021/03/10/pos_mtg_te.html) | [pos_mtg](https://nlp.johnsnowlabs.com/2021/03/10/pos_mtg_te.html) | Part of Speech Tagging | PerceptronModel |
| te | [te.pos](https://nlp.johnsnowlabs.com/2022/04/01/pos_mtg_te_3_0.html) | [pos_mtg](https://nlp.johnsnowlabs.com/2022/04/01/pos_mtg_te_3_0.html) | Part of Speech Tagging | PerceptronModel |
| ta | [ta.pos](https://nlp.johnsnowlabs.com/2021/03/10/pos_ttb_ta.html) | [pos_ttb](https://nlp.johnsnowlabs.com/2021/03/10/pos_ttb_ta.html) | Part of Speech Tagging | PerceptronModel |
| ta | [ta.pos](https://nlp.johnsnowlabs.com/2022/04/01/pos_ttb_ta_3_0.html) | [pos_ttb](https://nlp.johnsnowlabs.com/2022/04/01/pos_ttb_ta_3_0.html) | Part of Speech Tagging | PerceptronModel |
| cs | [cs.pos](https://nlp.johnsnowlabs.com/2020/05/04/pos_ud_pdt_cs.html) | [pos_ud_pdt](https://nlp.johnsnowlabs.com/2020/05/04/pos_ud_pdt_cs.html) | Part of Speech Tagging | PerceptronModel |
| cs | [cs.pos](https://nlp.johnsnowlabs.com/2021/03/08/pos_ud_pdt_cs.html) | [pos_ud_pdt](https://nlp.johnsnowlabs.com/2021/03/08/pos_ud_pdt_cs.html) | Part of Speech Tagging | PerceptronModel |
| bg | [bg.pos](https://nlp.johnsnowlabs.com/2021/03/23/pos_btb_bg.html) | [pos_btb](https://nlp.johnsnowlabs.com/2021/03/23/pos_btb_bg.html) | Part of Speech Tagging | PerceptronModel |
| bg | [bg.pos](https://nlp.johnsnowlabs.com/2022/04/01/pos_btb_bg_3_0.html) | [pos_btb](https://nlp.johnsnowlabs.com/2022/04/01/pos_btb_bg_3_0.html) | Part of Speech Tagging | PerceptronModel |
| af | [af.pos](https://nlp.johnsnowlabs.com/2021/03/16/pos_afribooms_af.html) | [pos_afribooms](https://nlp.johnsnowlabs.com/2021/03/16/pos_afribooms_af.html) | Part of Speech Tagging | PerceptronModel |
| af | [af.pos](https://nlp.johnsnowlabs.com/2021/04/06/pos_afribooms_af.html) | [pos_afribooms](https://nlp.johnsnowlabs.com/2021/04/06/pos_afribooms_af.html) | Part of Speech Tagging | PerceptronModel |
| af | [af.pos](https://nlp.johnsnowlabs.com/2022/04/01/pos_afribooms_af_3_0.html) | [pos_afribooms](https://nlp.johnsnowlabs.com/2022/04/01/pos_afribooms_af_3_0.html) | Part of Speech Tagging | PerceptronModel |
| es | [es.pos.gsd](https://nlp.johnsnowlabs.com/2022/04/01/pos_gsd_es_3_0.html) | [pos_gsd](https://nlp.johnsnowlabs.com/2022/04/01/pos_gsd_es_3_0.html) | Part of Speech Tagging | PerceptronModel |
| en | [en.pos.ewt](https://nlp.johnsnowlabs.com/2022/04/01/pos_ewt_en_3_0.html) | [pos_ewt](https://nlp.johnsnowlabs.com/2022/04/01/pos_ewt_en_3_0.html) | Part of Speech Tagging | PerceptronModel |
| gd | [gd.pos.arcosg](https://nlp.johnsnowlabs.com/2022/04/01/pos_arcosg_gd_3_0.html) | [pos_arcosg](https://nlp.johnsnowlabs.com/2022/04/01/pos_arcosg_gd_3_0.html) | Part of Speech Tagging | PerceptronModel |
| el | [el.pos.gdt](https://nlp.johnsnowlabs.com/2022/04/01/pos_gdt_el_3_0.html) | [pos_gdt](https://nlp.johnsnowlabs.com/2022/04/01/pos_gdt_el_3_0.html) | Part of Speech Tagging | PerceptronModel |
| hy | [hy.pos.armtdp](https://nlp.johnsnowlabs.com/2022/04/01/pos_armtdp_hy_3_0.html) | [pos_armtdp](https://nlp.johnsnowlabs.com/2022/04/01/pos_armtdp_hy_3_0.html) | Part of Speech Tagging | PerceptronModel |
| pt | [pt.pos.bosque](https://nlp.johnsnowlabs.com/2022/04/01/pos_bosque_pt_3_0.html) | [pos_bosque](https://nlp.johnsnowlabs.com/2022/04/01/pos_bosque_pt_3_0.html) | Part of Speech Tagging | PerceptronModel |
| tr | [tr.pos.framenet](https://nlp.johnsnowlabs.com/2022/04/01/pos_framenet_tr_3_0.html) | [pos_framenet](https://nlp.johnsnowlabs.com/2022/04/01/pos_framenet_tr_3_0.html) | Part of Speech Tagging | PerceptronModel |
| cs | [cs.pos.cltt](https://nlp.johnsnowlabs.com/2022/04/01/pos_cltt_cs_3_0.html) | [pos_cltt](https://nlp.johnsnowlabs.com/2022/04/01/pos_cltt_cs_3_0.html) | Part of Speech Tagging | PerceptronModel |
| eu | [eu.pos.bdt](https://nlp.johnsnowlabs.com/2022/04/01/pos_bdt_eu_3_0.html) | [pos_bdt](https://nlp.johnsnowlabs.com/2022/04/01/pos_bdt_eu_3_0.html) | Part of Speech Tagging | PerceptronModel |
| et | [et.pos.ewt](https://nlp.johnsnowlabs.com/2022/04/01/pos_ewt_et_3_0.html) | [pos_ewt](https://nlp.johnsnowlabs.com/2022/04/01/pos_ewt_et_3_0.html) | Part of Speech Tagging | PerceptronModel |
| da | [da.pos.ddt](https://nlp.johnsnowlabs.com/2022/04/01/pos_ddt_da_3_0.html) | [pos_ddt](https://nlp.johnsnowlabs.com/2022/04/01/pos_ddt_da_3_0.html) | Part of Speech Tagging | PerceptronModel |
| cy | [cy.pos.ccg](https://nlp.johnsnowlabs.com/2022/04/01/pos_ccg_cy_3_0.html) | [pos_ccg](https://nlp.johnsnowlabs.com/2022/04/01/pos_ccg_cy_3_0.html) | Part of Speech Tagging | PerceptronModel |
| lt | [lt.pos.alksnis](https://nlp.johnsnowlabs.com/2022/04/01/pos_alksnis_lt_3_0.html) | [pos_alksnis](https://nlp.johnsnowlabs.com/2022/04/01/pos_alksnis_lt_3_0.html) | Part of Speech Tagging | PerceptronModel |
| nl | [nl.pos.alpino](https://nlp.johnsnowlabs.com/2022/04/01/pos_alpino_nl_3_0.html) | [pos_alpino](https://nlp.johnsnowlabs.com/2022/04/01/pos_alpino_nl_3_0.html) | Part of Speech Tagging | PerceptronModel |
| fi | [fi.pos.ftb](https://nlp.johnsnowlabs.com/2022/04/01/pos_ftb_fi_3_0.html) | [pos_ftb](https://nlp.johnsnowlabs.com/2022/04/01/pos_ftb_fi_3_0.html) | Part of Speech Tagging | PerceptronModel |
| tr | [tr.pos.atis](https://nlp.johnsnowlabs.com/2022/04/01/pos_atis_tr_3_0.html) | [pos_atis](https://nlp.johnsnowlabs.com/2022/04/01/pos_atis_tr_3_0.html) | Part of Speech Tagging | PerceptronModel |
| ca | [ca.pos.ancora](https://nlp.johnsnowlabs.com/2022/04/01/pos_ancora_ca_3_0.html) | [pos_ancora](https://nlp.johnsnowlabs.com/2022/04/01/pos_ancora_ca_3_0.html) | Part of Speech Tagging | PerceptronModel |
| gl | [gl.pos.ctg](https://nlp.johnsnowlabs.com/2022/04/01/pos_ctg_gl_3_0.html) | [pos_ctg](https://nlp.johnsnowlabs.com/2022/04/01/pos_ctg_gl_3_0.html) | Part of Speech Tagging | PerceptronModel |
| de | [de.pos.gsd](https://nlp.johnsnowlabs.com/2022/04/01/pos_gsd_de_3_0.html) | [pos_gsd](https://nlp.johnsnowlabs.com/2022/04/01/pos_gsd_de_3_0.html) | Part of Speech Tagging | PerceptronModel |
| fr | [fr.pos.gsd](https://nlp.johnsnowlabs.com/2022/04/01/pos_gsd_fr_3_0.html) | [pos_gsd](https://nlp.johnsnowlabs.com/2022/04/01/pos_gsd_fr_3_0.html) | Part of Speech Tagging | PerceptronModel |
| ja | [ja.pos.gsdluw](https://nlp.johnsnowlabs.com/2022/04/01/pos_gsdluw_ja_3_0.html) | [pos_gsdluw](https://nlp.johnsnowlabs.com/2022/04/01/pos_gsdluw_ja_3_0.html) | Part of Speech Tagging | PerceptronModel |
| it | [it.pos.isdt](https://nlp.johnsnowlabs.com/2022/04/01/pos_isdt_it_3_0.html) | [pos_isdt](https://nlp.johnsnowlabs.com/2022/04/01/pos_isdt_it_3_0.html) | Part of Speech Tagging | PerceptronModel |
| be | [be.pos.hse](https://nlp.johnsnowlabs.com/2022/04/01/pos_hse_be_3_0.html) | [pos_hse](https://nlp.johnsnowlabs.com/2022/04/01/pos_hse_be_3_0.html) | Part of Speech Tagging | PerceptronModel |
| nl | [nl.pos.lassysmall](https://nlp.johnsnowlabs.com/2022/04/01/pos_lassysmall_nl_3_0.html) | [pos_lassysmall](https://nlp.johnsnowlabs.com/2022/04/01/pos_lassysmall_nl_3_0.html) | Part of Speech Tagging | PerceptronModel |
| sv | [sv.pos.lines](https://nlp.johnsnowlabs.com/2022/04/01/pos_lines_sv_3_0.html) | [pos_lines](https://nlp.johnsnowlabs.com/2022/04/01/pos_lines_sv_3_0.html) | Part of Speech Tagging | PerceptronModel |
| uk | [uk.pos.iu](https://nlp.johnsnowlabs.com/2022/04/01/pos_iu_uk_3_0.html) | [pos_iu](https://nlp.johnsnowlabs.com/2022/04/01/pos_iu_uk_3_0.html) | Part of Speech Tagging | PerceptronModel |
| fr | [fr.pos.parisstories](https://nlp.johnsnowlabs.com/2022/04/01/pos_parisstories_fr_3_0.html) | [pos_parisstories](https://nlp.johnsnowlabs.com/2022/04/01/pos_parisstories_fr_3_0.html) | Part of Speech Tagging | PerceptronModel |
| en | [en.pos.partut](https://nlp.johnsnowlabs.com/2022/04/01/pos_partut_en_3_0.html) | [pos_partut](https://nlp.johnsnowlabs.com/2022/04/01/pos_partut_en_3_0.html) | Part of Speech Tagging | PerceptronModel |
| la | [la.pos.ittb](https://nlp.johnsnowlabs.com/2022/04/01/pos_ittb_la_3_0.html) | [pos_ittb](https://nlp.johnsnowlabs.com/2022/04/01/pos_ittb_la_3_0.html) | Part of Speech Tagging | PerceptronModel |
| lzh | [lzh.pos.kyoto](https://nlp.johnsnowlabs.com/2022/04/01/pos_kyoto_lzh_3_0.html) | [pos_kyoto](https://nlp.johnsnowlabs.com/2022/04/01/pos_kyoto_lzh_3_0.html) | Part of Speech Tagging | PerceptronModel |
| id | [id.pos.gsd](https://nlp.johnsnowlabs.com/2022/04/01/pos_gsd_id_3_0.html) | [pos_gsd](https://nlp.johnsnowlabs.com/2022/04/01/pos_gsd_id_3_0.html) | Part of Speech Tagging | PerceptronModel |
| he | [he.pos.htb](https://nlp.johnsnowlabs.com/2022/04/01/pos_htb_he_3_0.html) | [pos_htb](https://nlp.johnsnowlabs.com/2022/04/01/pos_htb_he_3_0.html) | Part of Speech Tagging | PerceptronModel |
| tr | [tr.pos.kenet](https://nlp.johnsnowlabs.com/2022/04/01/pos_kenet_tr_3_0.html) | [pos_kenet](https://nlp.johnsnowlabs.com/2022/04/01/pos_kenet_tr_3_0.html) | Part of Speech Tagging | PerceptronModel |
| de | [de.pos.hdt](https://nlp.johnsnowlabs.com/2022/04/01/pos_hdt_de_3_0.html) | [pos_hdt](https://nlp.johnsnowlabs.com/2022/04/01/pos_hdt_de_3_0.html) | Part of Speech Tagging | PerceptronModel |
| qhe | [qhe.pos.hiencs](https://nlp.johnsnowlabs.com/2022/04/01/pos_hiencs_qhe_3_0.html) | [pos_hiencs](https://nlp.johnsnowlabs.com/2022/04/01/pos_hiencs_qhe_3_0.html) | Part of Speech Tagging | PerceptronModel |
| la | [la.pos.llct](https://nlp.johnsnowlabs.com/2022/04/01/pos_llct_la_3_0.html) | [pos_llct](https://nlp.johnsnowlabs.com/2022/04/01/pos_llct_la_3_0.html) | Part of Speech Tagging | PerceptronModel |
| en | [en.pos.lines](https://nlp.johnsnowlabs.com/2022/04/01/pos_lines_en_3_0.html) | [pos_lines](https://nlp.johnsnowlabs.com/2022/04/01/pos_lines_en_3_0.html) | Part of Speech Tagging | PerceptronModel |
| pcm | [pcm.pos.nsc](https://nlp.johnsnowlabs.com/2022/04/01/pos_nsc_pcm_3_0.html) | [pos_nsc](https://nlp.johnsnowlabs.com/2022/04/01/pos_nsc_pcm_3_0.html) | Part of Speech Tagging | PerceptronModel |
| ko | [ko.pos.kaist](https://nlp.johnsnowlabs.com/2022/04/01/pos_kaist_ko_3_0.html) | [pos_kaist](https://nlp.johnsnowlabs.com/2022/04/01/pos_kaist_ko_3_0.html) | Part of Speech Tagging | PerceptronModel |
| pt | [pt.pos.gsd](https://nlp.johnsnowlabs.com/2022/04/01/pos_gsd_pt_3_0.html) | [pos_gsd](https://nlp.johnsnowlabs.com/2022/04/01/pos_gsd_pt_3_0.html) | Part of Speech Tagging | PerceptronModel |
| hi | [hi.pos.hdtb](https://nlp.johnsnowlabs.com/2022/04/01/pos_hdtb_hi_3_0.html) | [pos_hdtb](https://nlp.johnsnowlabs.com/2022/04/01/pos_hdtb_hi_3_0.html) | Part of Speech Tagging | PerceptronModel |
| is | [is.pos.modern](https://nlp.johnsnowlabs.com/2022/04/01/pos_modern_is_3_0.html) | [pos_modern](https://nlp.johnsnowlabs.com/2022/04/01/pos_modern_is_3_0.html) | Part of Speech Tagging | PerceptronModel |
| en | [en.pos.gum](https://nlp.johnsnowlabs.com/2022/04/01/pos_gum_en_3_0.html) | [pos_gum](https://nlp.johnsnowlabs.com/2022/04/01/pos_gum_en_3_0.html) | Part of Speech Tagging | PerceptronModel |
| fro | [fro.pos.srcmf](https://nlp.johnsnowlabs.com/2022/04/01/pos_srcmf_fro_3_0.html) | [pos_srcmf](https://nlp.johnsnowlabs.com/2022/04/01/pos_srcmf_fro_3_0.html) | Part of Speech Tagging | PerceptronModel |
| sl | [sl.pos.ssj](https://nlp.johnsnowlabs.com/2022/04/01/pos_ssj_sl_3_0.html) | [pos_ssj](https://nlp.johnsnowlabs.com/2022/04/01/pos_ssj_sl_3_0.html) | Part of Speech Tagging | PerceptronModel |
| ru | [ru.pos.taiga](https://nlp.johnsnowlabs.com/2022/04/01/pos_taiga_ru_3_0.html) | [pos_taiga](https://nlp.johnsnowlabs.com/2022/04/01/pos_taiga_ru_3_0.html) | Part of Speech Tagging | PerceptronModel |
| grc | [grc.pos.perseus](https://nlp.johnsnowlabs.com/2022/04/01/pos_perseus_grc_3_0.html) | [pos_perseus](https://nlp.johnsnowlabs.com/2022/04/01/pos_perseus_grc_3_0.html) | Part of Speech Tagging | PerceptronModel |
| sr | [sr.pos.set](https://nlp.johnsnowlabs.com/2022/04/01/pos_set_sr_3_0.html) | [pos_set](https://nlp.johnsnowlabs.com/2022/04/01/pos_set_sr_3_0.html) | Part of Speech Tagging | PerceptronModel |
| orv | [orv.pos.rnc](https://nlp.johnsnowlabs.com/2022/04/01/pos_rnc_orv_3_0.html) | [pos_rnc](https://nlp.johnsnowlabs.com/2022/04/01/pos_rnc_orv_3_0.html) | Part of Speech Tagging | PerceptronModel |
| ug | [ug.pos.udt](https://nlp.johnsnowlabs.com/2022/04/01/pos_udt_ug_3_0.html) | [pos_udt](https://nlp.johnsnowlabs.com/2022/04/01/pos_udt_ug_3_0.html) | Part of Speech Tagging | PerceptronModel |
| got | [got.pos.proiel](https://nlp.johnsnowlabs.com/2022/04/01/pos_proiel_got_3_0.html) | [pos_proiel](https://nlp.johnsnowlabs.com/2022/04/01/pos_proiel_got_3_0.html) | Part of Speech Tagging | PerceptronModel |
| sv | [sv.pos.talbanken](https://nlp.johnsnowlabs.com/2021/03/23/pos_talbanken_sv.html) | [pos_talbanken](https://nlp.johnsnowlabs.com/2021/03/23/pos_talbanken_sv.html) | Part of Speech Tagging | PerceptronModel |
| sv | [sv.pos.talbanken](https://nlp.johnsnowlabs.com/2022/04/01/pos_talbanken_sv_3_0.html) | [pos_talbanken](https://nlp.johnsnowlabs.com/2022/04/01/pos_talbanken_sv_3_0.html) | Part of Speech Tagging | PerceptronModel |
| pl | [pl.pos.pdb](https://nlp.johnsnowlabs.com/2022/04/01/pos_pdb_pl_3_0.html) | [pos_pdb](https://nlp.johnsnowlabs.com/2022/04/01/pos_pdb_pl_3_0.html) | Part of Speech Tagging | PerceptronModel |
| fa | [fa.pos.seraji](https://nlp.johnsnowlabs.com/2022/04/01/pos_seraji_fa_3_0.html) | [pos_seraji](https://nlp.johnsnowlabs.com/2022/04/01/pos_seraji_fa_3_0.html) | Part of Speech Tagging | PerceptronModel |
| tr | [tr.pos.penn](https://nlp.johnsnowlabs.com/2022/04/01/pos_penn_tr_3_0.html) | [pos_penn](https://nlp.johnsnowlabs.com/2022/04/01/pos_penn_tr_3_0.html) | Part of Speech Tagging | PerceptronModel |
| hu | [hu.pos.szeged](https://nlp.johnsnowlabs.com/2022/04/01/pos_szeged_hu_3_0.html) | [pos_szeged](https://nlp.johnsnowlabs.com/2022/04/01/pos_szeged_hu_3_0.html) | Part of Speech Tagging | PerceptronModel |
| sk | [sk.pos.snk](https://nlp.johnsnowlabs.com/2021/03/23/pos_snk_sk.html) | [pos_snk](https://nlp.johnsnowlabs.com/2021/03/23/pos_snk_sk.html) | Part of Speech Tagging | PerceptronModel |
| sk | [sk.pos.snk](https://nlp.johnsnowlabs.com/2022/04/01/pos_snk_sk_3_0.html) | [pos_snk](https://nlp.johnsnowlabs.com/2022/04/01/pos_snk_sk_3_0.html) | Part of Speech Tagging | PerceptronModel |
| ro | [ro.pos.simonero](https://nlp.johnsnowlabs.com/2022/04/01/pos_simonero_ro_3_0.html) | [pos_simonero](https://nlp.johnsnowlabs.com/2022/04/01/pos_simonero_ro_3_0.html) | Part of Speech Tagging | PerceptronModel |
| it | [it.pos.postwita](https://nlp.johnsnowlabs.com/2022/04/01/pos_postwita_it_3_0.html) | [pos_postwita](https://nlp.johnsnowlabs.com/2022/04/01/pos_postwita_it_3_0.html) | Part of Speech Tagging | PerceptronModel |
| gl | [gl.pos.treegal](https://nlp.johnsnowlabs.com/2022/04/01/pos_treegal_gl_3_0.html) | [pos_treegal](https://nlp.johnsnowlabs.com/2022/04/01/pos_treegal_gl_3_0.html) | Part of Speech Tagging | PerceptronModel |
| cs | [cs.pos.pdt](https://nlp.johnsnowlabs.com/2022/04/01/pos_pdt_cs_3_0.html) | [pos_pdt](https://nlp.johnsnowlabs.com/2022/04/01/pos_pdt_cs_3_0.html) | Part of Speech Tagging | PerceptronModel |
| ro | [ro.pos.rrt](https://nlp.johnsnowlabs.com/2022/04/01/pos_rrt_ro_3_0.html) | [pos_rrt](https://nlp.johnsnowlabs.com/2022/04/01/pos_rrt_ro_3_0.html) | Part of Speech Tagging | PerceptronModel |
| orv | [orv.pos.torot](https://nlp.johnsnowlabs.com/2022/04/01/pos_torot_orv_3_0.html) | [pos_torot](https://nlp.johnsnowlabs.com/2022/04/01/pos_torot_orv_3_0.html) | Part of Speech Tagging | PerceptronModel |
| hr | [hr.pos.set](https://nlp.johnsnowlabs.com/2022/04/01/pos_set_hr_3_0.html) | [pos_set](https://nlp.johnsnowlabs.com/2022/04/01/pos_set_hr_3_0.html) | Part of Speech Tagging | PerceptronModel |
| la | [la.pos.proiel](https://nlp.johnsnowlabs.com/2022/04/01/pos_proiel_la_3_0.html) | [pos_proiel](https://nlp.johnsnowlabs.com/2022/04/01/pos_proiel_la_3_0.html) | Part of Speech Tagging | PerceptronModel |
| fr | [fr.pos.partut](https://nlp.johnsnowlabs.com/2022/04/01/pos_partut_fr_3_0.html) | [pos_partut](https://nlp.johnsnowlabs.com/2022/04/01/pos_partut_fr_3_0.html) | Part of Speech Tagging | PerceptronModel |
| it | [it.pos.vit](https://nlp.johnsnowlabs.com/2022/04/01/pos_vit_it_3_0.html) | [pos_vit](https://nlp.johnsnowlabs.com/2022/04/01/pos_vit_it_3_0.html) | Part of Speech Tagging | PerceptronModel |

Bugfixes

- Improved Error Messages and integrated detection and stopping of endless loops which could occur during construction
of nlu pipelines



Additional NLU resources
* [140+ NLU Tutorials](https://nlu.johnsnowlabs.com/docs/en/notebooks)
* [NLU in Action](https://nlp.johnsnowlabs.com/demo)
* [Streamlit visualizations docs](https://nlu.johnsnowlabs.com/docs/en/streamlit_viz_examples)
* The complete list of all 4000+ models & pipelines in 200+ languages is available on [Models Hub](https://nlp.johnsnowlabs.com/models).
* [Spark NLP publications](https://medium.com/spark-nlp)
* [NLU documentation](https://nlu.johnsnowlabs.com/docs/en/install)
* [Discussions](https://github.com/JohnSnowLabs/spark-nlp/discussions) Engage with other community members, share ideas, and show off how you use Spark NLP and NLU!


1 line Install NLU on Google Colab
!wget https://setup.johnsnowlabs.com/nlu/colab.sh -O - | bash
1 line Install NLU on Kaggle
!wget https://setup.johnsnowlabs.com/nlu/kaggle.sh -O - | bash
Install via PIP
! pip install nlu pyspark streamlit==0.80.0

3.4.2

Multilingual DeBERTa Transformer Embeddings for 100+ Languages, Spanish Deidentification and NER for Randomized Clinical Trials - John Snow Labs NLU 3.4.2

We are very excited NLU 3.4.2 has been released.
On the open source side we have 5 new DeBERTa Transformer models for English and Multi-Lingual for 100+ languages.
DeBERTa improves over BERT and RoBERTa by introducing two novel techniques.

For the healthcare side we have new NER models for randomized clinical trials (RCT) which can detect entities of type
`BACKGROUND`, `CONCLUSIONS`, `METHODS`, `OBJECTIVE`, `RESULTS` from clinical text.
Additionally, new Spanish Deidentification NER models for entities like `STATE`, `PATIENT`, `DEVICE`, `COUNTRY`, `ZIP`, `PHONE`, `HOSPITAL` and many more.

New Open Source Models

Integrates models from [Spark NLP 3.4.2](https://github.com/JohnSnowLabs/spark-nlp/releases/tag/3.4.2) release

| Language | NLU Reference | Spark NLP Reference | Task | Annotator Class |
|:-----------|:----------------------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------------|:-----------|:------------------|
| en | [en.embed.deberta_v3_xsmall](https://nlp.johnsnowlabs.com/2022/03/10/deberta_v3_xsmall_en_3_0.html) | [deberta_v3_xsmall](https://nlp.johnsnowlabs.com/2022/03/10/deberta_v3_xsmall_en_3_0.html) | Embeddings | DeBertaEmbeddings |
| en | [en.embed.deberta_v3_small](https://nlp.johnsnowlabs.com/2022/03/10/deberta_v3_small_en_3_0.html) | [deberta_v3_small](https://nlp.johnsnowlabs.com/2022/03/10/deberta_v3_small_en_3_0.html) | Embeddings | DeBertaEmbeddings |
| en | [en.embed.deberta_v3_base](https://nlp.johnsnowlabs.com/2022/03/10/deberta_v3_base_en_3_0.html) | [deberta_v3_base](https://nlp.johnsnowlabs.com/2022/03/10/deberta_v3_base_en_3_0.html) | Embeddings | DeBertaEmbeddings |
| en | [en.embed.deberta_v3_large](https://nlp.johnsnowlabs.com/2022/03/10/deberta_v3_large_en_3_0.html) | [deberta_v3_large](https://nlp.johnsnowlabs.com/2022/03/10/deberta_v3_large_en_3_0.html) | Embeddings | DeBertaEmbeddings |
| xx | [xx.embed.mdeberta_v3_base](https://nlp.johnsnowlabs.com/2022/03/10/mdeberta_v3_base_xx_3_0.html) | [mdeberta_v3_base](https://nlp.johnsnowlabs.com/2022/03/10/mdeberta_v3_base_xx_3_0.html) | Embeddings | DeBertaEmbeddings |


New Healthcare Models

Integrates models from [Spark NLP For Healthcare 3.4.2](https://nlp.johnsnowlabs.com/docs/en/licensed_release_notes#342) release

| Language | NLU Reference | Spark NLP Reference | Task | Annotator Class |
|:-----------|:--------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------|:--------------------------------------------------|:-------------------------------------|
| en | [en.med_ner.clinical_trials](https://nlp.johnsnowlabs.com/2022/03/01/bert_sequence_classifier_rct_biobert_en_2_4.html) | [bert_sequence_classifier_rct_biobert](https://nlp.johnsnowlabs.com/2022/03/01/bert_sequence_classifier_rct_biobert_en_2_4.html) | Text Classification | MedicalBertForSequenceClassification |
| es | [es.med_ner.deid.generic.roberta](https://nlp.johnsnowlabs.com/2022/02/15/ner_deid_generic_roberta_augmented_es.html) | [ner_deid_generic_roberta_augmented](https://nlp.johnsnowlabs.com/2022/02/15/ner_deid_generic_roberta_augmented_es.html) | De-identification | MedicalNerModel |
| es | [es.med_ner.deid.subentity.roberta](https://nlp.johnsnowlabs.com/2022/02/15/ner_deid_subentity_roberta_augmented_es.html) | [ner_deid_subentity_roberta_augmented](https://nlp.johnsnowlabs.com/2022/02/15/ner_deid_subentity_roberta_augmented_es.html) | De-identification | MedicalNerModel |
| en | [en.med_ner.deid.generic_augmented](https://nlp.johnsnowlabs.com/2021/06/30/ner_deid_generic_augmented_en.html) | [ner_deid_generic_augmented](https://nlp.johnsnowlabs.com/2021/06/30/ner_deid_generic_augmented_en.html) | ['Named Entity Recognition', 'De-identification'] | MedicalNerModel |
| en | [en.med_ner.deid.subentity_augmented](https://nlp.johnsnowlabs.com/2021/06/01/ner_deid_subentity_augmented_en.html) | [ner_deid_subentity_augmented](https://nlp.johnsnowlabs.com/2021/06/01/ner_deid_subentity_augmented_en.html) | ['Named Entity Recognition', 'De-identification'] | MedicalNerModel |


Additional NLU resources
* [140+ NLU Tutorials](https://nlu.johnsnowlabs.com/docs/en/notebooks)
* [NLU in Action](https://nlp.johnsnowlabs.com/demo)
* [Streamlit visualizations docs](https://nlu.johnsnowlabs.com/docs/en/streamlit_viz_examples)
* The complete list of all 4000+ models & pipelines in 200+ languages is available on [Models Hub](https://nlp.johnsnowlabs.com/models).
* [Spark NLP publications](https://medium.com/spark-nlp)
* [NLU documentation](https://nlu.johnsnowlabs.com/docs/en/install)
* [Discussions](https://github.com/JohnSnowLabs/spark-nlp/discussions) Engage with other community members, share ideas, and show off how you use Spark NLP and NLU!


1 line Install NLU on Google Colab
!wget https://setup.johnsnowlabs.com/nlu/colab.sh -O - | bash
1 line Install NLU on Kaggle
!wget https://setup.johnsnowlabs.com/nlu/kaggle.sh -O - | bash
Install via PIP
! pip install nlu pyspark streamlit==0.80.0

3.4.1

We are very excited to announce the release of NLU 3.4.1
which features 22 new models for 23 languages where the
The open-source side covers new Embeddings for Vietnamese and English Clinical domains and Multilingual Embeddings for 12 Indian and 9 African Languages.
Additionally, there are new Sequence classifiers for Multilingual NER for 9 African languages,
German Sentiment Classifiers and English Emotion and Typo Classifiers.
The healthcare side covers Medical Spanish models, Classifiers for Drugs, Gender, the Pico Framework, and Relation Extractors for Adverse Drug events and Temporality.
Finally, Spark 3.2.X is now supported and bugs related to Databricks environments have been fixed.



General NLU Improvements
- Support for Spark 3.2.x


New Open Source Models
Based on the amazing [3.4.1 Spark NLP Release](https://github.com/JohnSnowLabs/spark-nlp/releases/tag/3.4.1)
integrates new Multilingual embeddings for 12 Major Indian languages,
embeddings for Vietnamese, French, and English Clinical domains.
Additionally new Multilingual NER model for 9 African languages, English 6 Class Emotion classifier and Typo detectors.


New Embeddings
- **Multilingual ALBERT - IndicBert** model pretrained exclusively on 12 major Indian languages with size smaller and performance on par or better than competing models. Languages covered are Assamese, Bengali, English, Gujarati, Hindi, Kannada, Malayalam, Marathi, Oriya, Punjabi, Tamil, Telugu.
Available with [xx.embed.albert.indic](https://nlp.johnsnowlabs.com/2022/01/26/albert_indic_xx.html)
- **Fine tuned Vietnamese DistilBERT** Base cased embeddings. Available with [vi.embed.distilbert.cased](https://nlp.johnsnowlabs.com/2022/01/13/distilbert_base_cased_vi.html)
- **Clinical Longformer Embeddings** which consistently out-performs ClinicalBERT for various downstream
tasks and on datasets. Available with [en.embed.longformer.clinical](https://nlp.johnsnowlabs.com/2022/02/08/clinical_longformer_en.html)
- **Fine tuned Static French Word2Vec Embeddings** in 3 sizes, 200d, 300d and 100d. Available with [fr.embed.word2vec_wiki_1000](https://nlp.johnsnowlabs.com/2022/01/26/word2vec_wiki_1000_fr.html), [fr.embed.word2vec_wac_200](https://nlp.johnsnowlabs.com/2022/02/01/word2vec_wac_200_fr.html) and [fr.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/02/03/w2v_cc_300d_fr.html)

New Transformer based Token and Sequence Classifiers
- **Multilingual NER Distilbert** model which detects entities `DATE`, `LOC`, `ORG`, `PER` for the languages 9 African languages (Hausa, Igbo, Kinyarwanda, Luganda, Nigerian, Pidgin, Swahili, Wolof, and Yorùbá).
Available with [xx.ner.masakhaner.distilbert](https://nlp.johnsnowlabs.com/2021/12/06/xlm_roberta_large_token_classifier_masakhaner_xx.html)
- **German News Sentiment Classifier** available with [de.classify.news_sentiment.bert](https://nlp.johnsnowlabs.com/2022/01/18/bert_sequence_classifier_news_sentiment_de.html)
- **English Emotion Classifier for 6 Classes** available with [en.classify.emotion.bert](https://nlp.johnsnowlabs.com/2022/01/14/bert_sequence_classifier_emotion_en.html)
- **English Typo Detector **: available with [en.classify.typos.distilbert](https://nlp.johnsnowlabs.com/2022/01/19/distilbert_token_classifier_typo_detector_en.html)


| Language | NLU Reference | Spark NLP Reference | Task | Annotator Class |
|:-----------|:---------------------------------------------------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------------------------------------------------------------------|:-------------------------|:---------------------------------|
| xx | [xx.embed.albert.indic](https://nlp.johnsnowlabs.com/2022/01/26/albert_indic_xx.html) | [albert_indic](https://nlp.johnsnowlabs.com/2022/01/26/albert_indic_xx.html) | Embeddings | AlbertEmbeddings |
| xx | [xx.ner.masakhaner.distilbert](https://nlp.johnsnowlabs.com/2021/12/06/xlm_roberta_large_token_classifier_masakhaner_xx.html) | [xlm_roberta_large_token_classifier_masakhaner](https://nlp.johnsnowlabs.com/2022/01/18/distilbert_base_token_classifier_masakhaner_xx.html) | Named Entity Recognition | DistilBertForTokenClassification |
| en | [en.embed.longformer.clinical](https://nlp.johnsnowlabs.com/2022/02/08/clinical_longformer_en.html) | [clinical_longformer](https://nlp.johnsnowlabs.com/2022/02/08/clinical_longformer_en.html) | Embeddings | LongformerEmbeddings |
| en | [en.classify.emotion.bert](https://nlp.johnsnowlabs.com/2022/01/14/bert_sequence_classifier_emotion_en.html) | [bert_sequence_classifier_emotion](https://nlp.johnsnowlabs.com/2022/01/14/bert_sequence_classifier_emotion_en.html) | Text Classification | BertForSequenceClassification |
| de | [de.classify.news_sentiment.bert](https://nlp.johnsnowlabs.com/2022/01/18/bert_sequence_classifier_news_sentiment_de.html) | [bert_sequence_classifier_news_sentiment](https://nlp.johnsnowlabs.com/2022/01/18/bert_sequence_classifier_news_sentiment_de.html) | Sentiment Analysis | BertForSequenceClassification |
| en | [en.classify.typos.distilbert](https://nlp.johnsnowlabs.com/2022/01/19/distilbert_token_classifier_typo_detector_en.html) | [distilbert_token_classifier_typo_detector](https://nlp.johnsnowlabs.com/2022/01/19/distilbert_token_classifier_typo_detector_en.html) | Named Entity Recognition | DistilBertForTokenClassification |
| fr | [fr.embed.word2vec_wiki_1000](https://nlp.johnsnowlabs.com/2022/01/26/word2vec_wiki_1000_fr.html) | [word2vec_wiki_1000](https://nlp.johnsnowlabs.com/2022/01/26/word2vec_wiki_1000_fr.html) | Embeddings | WordEmbeddingsModel |
| fr | [fr.embed.word2vec_wac_200](https://nlp.johnsnowlabs.com/2022/02/01/word2vec_wac_200_fr.html) | [word2vec_wac_200](https://nlp.johnsnowlabs.com/2022/02/01/word2vec_wac_200_fr.html) | Embeddings | WordEmbeddingsModel |
| fr | [fr.embed.w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/02/03/w2v_cc_300d_fr.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2022/02/03/w2v_cc_300d_fr.html) | Embeddings | WordEmbeddingsModel |
| vi | [vi.embed.distilbert.cased](https://nlp.johnsnowlabs.com/2022/01/13/distilbert_base_cased_vi.html) | [distilbert_base_cased](https://nlp.johnsnowlabs.com/2022/01/13/distilbert_base_cased_vi.html) | Embeddings | DistilBertEmbeddings |




New Healthcare Models
Integrated from the amazing [3.4.1 Spark NLP For Healthcare Release](https://nlp.johnsnowlabs.com/docs/en/licensed_release_notes#341).
which makes 2 new Annotator Classes available, `MedicalBertForSequenceClassification` and `MedicalDistilBertForSequenceClassification`,
various medical Spanish models, RxNorm Resolvers,
Transformer based sequence classifiers for Drugs, Gender and the PICO framework,
and Relation extractors for Temporality and Causality of Drugs and Adverse Events.



New Medical Spanish Models
- **Spanish Word2Vec Embeddings** available with [es.embed.sciwiki_300d](https://nlp.johnsnowlabs.com/2020/05/27/embeddings_sciwiki_300d_es.html)
- **Spanish PHI Deidentification NER models** with two different subsets of entities extracted, available with [ner_deid_generic](https://nlp.johnsnowlabs.com/2022/01/18/ner_deid_generic_es.html) and [ner_deid_subentity](https://nlp.johnsnowlabs.com/2022/01/18/ner_deid_subentity_es.html)

New Resolvers
- **RxNorm resolvers** with augmented concept data available with [en.med_ner.supplement_clinical](https://nlp.johnsnowlabs.com/2022/02/01/ner_supplement_clinical_en.html)

New Transformer based Sequence Classifiers
- **Adverse Drug Event Classifier Biobert based** available with [en.classify.ade.seq_biobert](https://nlp.johnsnowlabs.com/2022/02/08/bert_sequence_classifier_ade_en.html)
- **Patient Gender Classifier Biobert and Distilbert based** available with [en.classify.gender.seq_biobert](https://nlp.johnsnowlabs.com/2022/02/08/bert_sequence_classifier_gender_biobert_en.html)
and available with [en.classify.ade.seq_distilbert](https://nlp.johnsnowlabs.com/2022/02/08/distilbert_sequence_classifier_ade_en.html)
- **PiCO Framework Classifier** available with [en.classify.pico.seq_biobert](https://nlp.johnsnowlabs.com/2022/02/07/bert_sequence_classifier_pico_biobert_en.html)

New Relation Extractors
- **Temporal Relation Extractor** available with [en.relation.temporal_events_clinical](https://nlp.johnsnowlabs.com/2020/09/28/re_temporal_events_clinical_en.html)
- **Adverse Drug Event Relation Extractors** one version Biobert Embeddings and one non-DL version available with [en.relation.adverse_drug_events.clinical](https://nlp.johnsnowlabs.com/2021/07/12/re_ade_clinical_en.html) available with [en.relation.adverse_drug_events.clinical.biobert](https://nlp.johnsnowlabs.com/2021/07/12/redl_ade_biobert_en.html)

| Language | NLU Reference | Spark NLP Reference | Task | Annotator Class |
|:-----------|:--------------------------------------------------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------------------------------------------------------|:-------------------------|:-------------------------------------------|
| es | [es.embed.sciwiki_300d](https://nlp.johnsnowlabs.com/2020/05/27/embeddings_sciwiki_300d_es.html) | [embeddings_sciwiki_300d](https://nlp.johnsnowlabs.com/2020/05/27/embeddings_sciwiki_300d_es.html) | Embeddings | WordEmbeddingsModel |
| es | [es.med_ner.deid.generic](https://nlp.johnsnowlabs.com/2022/01/18/ner_deid_generic_es.html) | [ner_deid_generic](https://nlp.johnsnowlabs.com/2022/01/18/ner_deid_generic_es.html) | De-identification | MedicalNerModel |
| es | [es.med_ner.deid.subentity](https://nlp.johnsnowlabs.com/2022/01/18/ner_deid_subentity_es.html) | [ner_deid_subentity](https://nlp.johnsnowlabs.com/2022/01/18/ner_deid_subentity_es.html) | De-identification | MedicalNerModel |
| en | [en.med_ner.supplement_clinical](https://nlp.johnsnowlabs.com/2022/02/01/ner_supplement_clinical_en.html) | [ner_supplement_clinical](https://nlp.johnsnowlabs.com/2022/02/01/ner_supplement_clinical_en.html) | Named Entity Recognition | MedicalNerModel |
| en | [en.resolve.rxnorm.augmented_re](https://nlp.johnsnowlabs.com/2022/02/09/sbiobertresolve_rxnorm_augmented_re_en.html) | [sbiobertresolve_rxnorm_augmented_re](https://nlp.johnsnowlabs.com/2022/02/09/sbiobertresolve_rxnorm_augmented_re_en.html) | Entity Resolution | SentenceEntityResolverModel |
| en | [en.classify.ade.seq_biobert](https://nlp.johnsnowlabs.com/2022/02/08/bert_sequence_classifier_ade_en.html) | [bert_sequence_classifier_ade](https://nlp.johnsnowlabs.com/2022/02/08/bert_sequence_classifier_ade_en.html) | Text Classification | MedicalBertForSequenceClassification |
| en | [en.classify.gender.seq_biobert](https://nlp.johnsnowlabs.com/2022/02/08/bert_sequence_classifier_gender_biobert_en.html) | [bert_sequence_classifier_gender_biobert](https://nlp.johnsnowlabs.com/2022/02/08/bert_sequence_classifier_gender_biobert_en.html) | Text Classification | MedicalBertForSequenceClassification |
| en | [en.classify.pico.seq_biobert](https://nlp.johnsnowlabs.com/2022/02/07/bert_sequence_classifier_pico_biobert_en.html) | [bert_sequence_classifier_pico_biobert](https://nlp.johnsnowlabs.com/2022/02/07/bert_sequence_classifier_pico_biobert_en.html) | Text Classification | MedicalBertForSequenceClassification |
| en | [en.classify.ade.seq_distilbert](https://nlp.johnsnowlabs.com/2022/02/08/distilbert_sequence_classifier_ade_en.html) | [distilbert_sequence_classifier_ade](https://nlp.johnsnowlabs.com/2022/02/08/distilbert_sequence_classifier_ade_en.html) | Text Classification | MedicalDistilBertForSequenceClassification |
| en | [en.relation.temporal_events_clinical](https://nlp.johnsnowlabs.com/2020/09/28/re_temporal_events_clinical_en.html) | [re_temporal_events_clinical](https://nlp.johnsnowlabs.com/2020/09/28/re_temporal_events_clinical_en.html) | Relation Extraction | RelationExtractionModel |
| en | [en.relation.adverse_drug_events.clinical](https://nlp.johnsnowlabs.com/2021/07/12/re_ade_clinical_en.html) | [re_ade_clinical](https://nlp.johnsnowlabs.com/2021/07/12/re_ade_clinical_en.html) | Relation Extraction | RelationExtractionModel |
| en | [en.relation.adverse_drug_events.clinical.biobert](https://nlp.johnsnowlabs.com/2021/07/12/redl_ade_biobert_en.html) | [redl_ade_biobert](https://nlp.johnsnowlabs.com/2021/07/12/redl_ade_biobert_en.html) | Relation Extraction | RelationExtractionDLModel |



Bugfixes
- Fixed bug that caused non-default output level of components to be sentence
- Fixed a bug that caused nlu references pointing to pretrained pipelines in spark nlp to crash in Databricks environments


Additional NLU resources
* [140+ NLU Tutorials](https://nlu.johnsnowlabs.com/docs/en/notebooks)
* [NLU in Action](https://nlp.johnsnowlabs.com/demo)
* [Streamlit visualizations docs](https://nlu.johnsnowlabs.com/docs/en/streamlit_viz_examples)
* The complete list of all 4000+ models & pipelines in 200+ languages is available on [Models Hub](https://nlp.johnsnowlabs.com/models).
* [Spark NLP publications](https://medium.com/spark-nlp)
* [NLU documentation](https://nlu.johnsnowlabs.com/docs/en/install)
* [Discussions](https://github.com/JohnSnowLabs/spark-nlp/discussions) Engage with other community members, share ideas, and show off how you use Spark NLP and NLU!


1 line Install NLU on Google Colab
!wget https://setup.johnsnowlabs.com/nlu/colab.sh -O - | bash
1 line Install NLU on Kaggle
!wget https://setup.johnsnowlabs.com/nlu/kaggle.sh -O - | bash
Install via PIP
! pip install nlu pyspark streamlit==0.80.0

3.4.0

We are incredibly excited to announce John Snow Labs NLU 3.4.0 has been released!
This release features `11 new annotator classes` and `80` new models, including 3 `OCR Transformers` which enable you to extract text
from various file types, support for `GPT2` and new pretrained `T5` models for **Text Generation** and dozens more of new transformer based models
for **Token and Sequence Classification**.
This includes `8 new Sequence classifier models` which can be pretrained in Huggingface and imported into Spark NLP and NLU.
Finally, the NLU tutorial page of the [140+ notebooks has been updated](https://nlu.johnsnowlabs.com/docs/en/notebooks)


**New** NLU OCR Features
3 new OCR based spells are supported, which enable extracting `text` from files of type
`JPEG`, `PNG`, `BMP`, `WBMP`, `GIF`, `JPG`, `TIFF`, `DOCX`, `PDF` in just 1 line of code.
You need a Spark OCR license for using these, which is available for [free here](https://www.johnsnowlabs.com/spark-nlp-try-free/) and refer to the new
[OCR tutorial notebook](https://colab.research.google.com/github/JohnSnowLabs/nlu/blob/master/examples/colab/ocr/ocr_for_img_pdf_docx_files.ipynb)
[![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/JohnSnowLabs/nlu/blob/master/examples/colab/ocr/ocr_for_img_pdf_docx_files.ipynb)
Find more details on the [NLU OCR documentation page](https://nlu.johnsnowlabs.com/docs/en/nlu_for_ocr)


**New** NLU Healthcare Features
The healthcare side features a new `MedicalBertForTokenClassifier` annotator which is a Bert based model for token classification problems like `Named Entity Recognition`,
`Parts of Speech` and much more. Overall there are `28` new models which include German De-Identification models, English NER models for extracting `Drug Development Trials`,
`Clinical Abbreviations and Acronyms`, NER models for chemical compounds/drugs and genes/proteins, updated `MedicalBertForTokenClassifier` NER models for the medical domains `Adverse drug Events`,
`Anatomy`, `Chemicals`, `Genes`,`Proteins`, `Cellular/Molecular Biology`, `Drugs`, `Bacteria`, `De-Identification` and general Medical and Clinical Named Entities.
For **Entity Relation Extraction** between entity pairs new models for interaction between `Drugs and Proteins`.
For **Entity Resolution** new models for resolving `Clinical Abbreviations and Acronyms` to their full length names and also a model for resolving `Drug Substance Entities` to the categories
`Clinical Drug`, `Pharmacologic Substance`, `Antibiotic`, `Hazardous` or `Poisonous Substance` and new resolvers for `LOINC` and `SNOMED` terminologies.



**New** NLU Open source Features
On the open source side we have new support for [Open Ai's `GPT2`](https://openai.com/blog/tags/gpt-2/) for various text sequence to sequence problems and
additionally the following new Transformer models are supported :
`RoBertaForSequenceClassification`, `XlmRoBertaForSequenceClassification`, `LongformerForSequenceClassification`,
`AlbertForSequenceClassification`, `XlnetForSequenceClassification`, `Word2Vec` with various pre-trained weights for various problems!

New **GPT2** models for generating text conditioned on some input,
New **T5 style transfer models** for `active to passive`, `formal to informal`, `informal to formal`, `passive to active` sequence to sequence generation.
Additionally, a new T5 model for generating SQL code from natural language input is provided.

On top of this dozens new Transformer based Sequence Classifiers and Token Classifiers have been released, this is includes for `Token Classifier` the following models :
Multi-Lingual general NER models for **10 African Languages** (`Amharic`, `Hausa`, `Igbo`, `Kinyarwanda`, `Luganda`, `Nigerian`, `Pidgin`, `Swahilu`, `Wolof`, and `Yorùbá`),
**10 high resourced languages** (10 high resourced languages (`Arabic`, `German`, `English`, `Spanish`, `French`, `Italian`, `Latvian`, `Dutch`, `Portuguese` and `Chinese`),
**6 Scandinavian languages** (`Danish`, `Norwegian-Bokmål`, `Norwegian-Nynorsk`, `Swedish`, `Icelandic`, `Faroese`) ,
Uni-Lingual NER models for general entites in the language `Chinese`, `Hindi`, `Islandic`, `Indonesian`
and finally English NER models for extracting entities related to `Stocks Ticker Symbols`, `Restaurants`, `Time`.

For `Sequence Classification` new models for classifying `Toxicity in Russian text` and English models for
`Movie Reviews`, `News Categorization`, `Sentimental Tone` and `General Sentiment`



New NLU OCR Models
The following Transformers have been integrated from [Spark OCR](https://nlp.johnsnowlabs.com/docs/en/ocr_pipeline_components)

| NLU Spell | Transformer Class |
|----------------------|-----------------------------------------------------------------------------------------|
| nlu.load(`img2text`) | [ImageToText](https://nlp.johnsnowlabs.com/docs/en/ocr_pipeline_components#imagetotext) |
| nlu.load(`pdf2text`) | [PdfToText](https://nlp.johnsnowlabs.com/docs/en/ocr_pipeline_components#pdftotext) |
| nlu.load(`doc2text`) | [DocToText](https://nlp.johnsnowlabs.com/docs/en/ocr_pipeline_components#doctotext) |





New Open Source Models

Integration for the 49 new models from the colossal [Spark NLP 3.4.0 release](https://nlp.johnsnowlabs.com/docs/en/release_notes#340)




| Language | NLU Reference | Spark NLP Reference | Task | Annotator Class |
|:-----------|:----------------------------------------------------------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------------------------------------------------------------------|:-------------------------|:------------------------------------|
| en | [en.gpt2.distilled](https://nlp.johnsnowlabs.com/2021/12/03/gpt2_distilled_en.html) | [gpt2_distilled](https://nlp.johnsnowlabs.com/2021/12/03/gpt2_distilled_en.html) | Text Generation | GPT2Transformer |
| en | [en.gpt2](https://nlp.johnsnowlabs.com/2021/12/03/gpt2_en.html) | [gpt2](https://nlp.johnsnowlabs.com/2021/12/03/gpt2_en.html) | Text Generation | GPT2Transformer |
| en | [en.gpt2.medium](https://nlp.johnsnowlabs.com/2021/12/03/gpt2_medium_en.html) | [gpt2_medium](https://nlp.johnsnowlabs.com/2021/12/03/gpt2_medium_en.html) | Text Generation | GPT2Transformer |
| en | [en.gpt2.large](https://nlp.johnsnowlabs.com/2021/12/03/gpt_large_en.html) | [gpt_large](https://nlp.johnsnowlabs.com/2021/12/03/gpt_large_en.html) | Text Generation | GPT2Transformer |
| en | [en.t5.active_to_passive_styletransfer](https://nlp.johnsnowlabs.com/2022/01/12/t5_active_to_passive_styletransfer_en.html) | [t5_active_to_passive_styletransfer](https://nlp.johnsnowlabs.com/2022/01/12/t5_active_to_passive_styletransfer_en.html) | Text Generation | T5Transformer |
| en | [en.t5.formal_to_informal_styletransfer](https://nlp.johnsnowlabs.com/2022/01/12/t5_formal_to_informal_styletransfer_en.html) | [t5_formal_to_informal_styletransfer](https://nlp.johnsnowlabs.com/2022/01/12/t5_formal_to_informal_styletransfer_en.html) | Text Generation | T5Transformer |
| en | [en.t5.grammar_error_corrector](https://nlp.johnsnowlabs.com/2022/01/12/t5_grammar_error_corrector_en.html) | [t5_grammar_error_corrector](https://nlp.johnsnowlabs.com/2022/01/12/t5_grammar_error_corrector_en.html) | Text Generation | T5Transformer |
| en | [en.t5.informal_to_formal_styletransfer](https://nlp.johnsnowlabs.com/2022/01/12/t5_informal_to_formal_styletransfer_en.html) | [t5_informal_to_formal_styletransfer](https://nlp.johnsnowlabs.com/2022/01/12/t5_informal_to_formal_styletransfer_en.html) | Text Generation | T5Transformer |
| en | [en.t5.passive_to_active_styletransfer](https://nlp.johnsnowlabs.com/2022/01/12/t5_passive_to_active_styletransfer_en.html) | [t5_passive_to_active_styletransfer](https://nlp.johnsnowlabs.com/2022/01/12/t5_passive_to_active_styletransfer_en.html) | Text Generation | T5Transformer |
| en | [en.t5.wikiSQL](https://nlp.johnsnowlabs.com/2022/01/12/t5_small_wikiSQL_en.html) | [t5_small_wikiSQL](https://nlp.johnsnowlabs.com/2022/01/12/t5_small_wikiSQL_en.html) | Text Generation | T5Transformer |
| xx | [xx.ner.masakhaner](https://nlp.johnsnowlabs.com/2021/12/06/xlm_roberta_large_token_classifier_masakhaner_xx.html) | [xlm_roberta_large_token_classifier_masakhaner](https://nlp.johnsnowlabs.com/2021/12/06/xlm_roberta_large_token_classifier_masakhaner_xx.html) | Named Entity Recognition | XlmRoBertaForTokenClassification |
| xx | [xx.ner.high_resourced_lang](https://nlp.johnsnowlabs.com/2021/12/26/xlm_roberta_large_token_classifier_hrl_xx.html) | [xlm_roberta_large_token_classifier_hrl](https://nlp.johnsnowlabs.com/2021/12/26/xlm_roberta_large_token_classifier_hrl_xx.html) | Named Entity Recognition | XlmRoBertaForTokenClassification |
| xx | [xx.ner.scandinavian](https://nlp.johnsnowlabs.com/2021/12/09/bert_token_classifier_scandi_ner_xx.html) | [bert_token_classifier_scandi_ner](https://nlp.johnsnowlabs.com/2021/12/09/bert_token_classifier_scandi_ner_xx.html) | Named Entity Recognition | BertForTokenClassification |
| en | [en.embed.electra.medical](https://nlp.johnsnowlabs.com/2022/01/04/electra_medal_acronym_en.html) | [electra_medal_acronym](https://nlp.johnsnowlabs.com/2022/01/04/electra_medal_acronym_en.html) | Embeddings | BertEmbeddings |
| en | [en.ner.restaurant](https://nlp.johnsnowlabs.com/2021/12/31/nerdl_restaurant_100d_en.html) | [nerdl_restaurant_100d](https://nlp.johnsnowlabs.com/2021/12/31/nerdl_restaurant_100d_en.html) | Named Entity Recognition | NerDLModel |
| en | [en.embed.word2vec.gigaword_wiki](https://nlp.johnsnowlabs.com/2022/01/03/word2vec_gigaword_wiki_300_en.html) | [word2vec_gigaword_wiki_300](https://nlp.johnsnowlabs.com/2022/01/03/word2vec_gigaword_wiki_300_en.html) | Embeddings | Word2VecModel |
| en | [en.embed.word2vec.gigaword](https://nlp.johnsnowlabs.com/2022/01/03/word2vec_gigaword_300_en.html) | [word2vec_gigaword_300](https://nlp.johnsnowlabs.com/2022/01/03/word2vec_gigaword_300_en.html) | Embeddings | Word2VecModel |
| en | [en.classify.xlm_roberta.imdb](https://nlp.johnsnowlabs.com/2021/12/23/xlm_roberta_base_sequence_classifier_imdb_en.html) | [xlm_roberta_base_sequence_classifier_imdb](https://nlp.johnsnowlabs.com/2021/12/23/xlm_roberta_base_sequence_classifier_imdb_en.html) | Text Classification | XlmRoBertaForSequenceClassification |
| en | [en.classify.xlm_roberta.ag_news](https://nlp.johnsnowlabs.com/2021/12/23/xlm_roberta_base_sequence_classifier_ag_news_en.html) | [xlm_roberta_base_sequence_classifier_ag_news](https://nlp.johnsnowlabs.com/2021/12/23/xlm_roberta_base_sequence_classifier_ag_news_en.html) | Text Classification | XlmRoBertaForSequenceClassification |
| en | [en.classify.roberta.imdb](https://nlp.johnsnowlabs.com/2021/12/16/roberta_base_sequence_classifier_imdb_en.html) | [roberta_base_sequence_classifier_imdb](https://nlp.johnsnowlabs.com/2021/12/16/roberta_base_sequence_classifier_imdb_en.html) | Text Classification | RoBertaForSequenceClassification |
| en | [en.classify.roberta.ag_news](https://nlp.johnsnowlabs.com/2021/12/16/roberta_base_sequence_classifier_ag_news_en.html) | [roberta_base_sequence_classifier_ag_news](https://nlp.johnsnowlabs.com/2021/12/16/roberta_base_sequence_classifier_ag_news_en.html) | Text Classification | RoBertaForSequenceClassification |
| en | [en.classify.albert.ag_news](https://nlp.johnsnowlabs.com/2021/12/16/albert_base_sequence_classifier_ag_news_en.html) | [albert_base_sequence_classifier_ag_news](https://nlp.johnsnowlabs.com/2021/12/16/albert_base_sequence_classifier_ag_news_en.html) | Text Classification | AlbertForSequenceClassification |
| en | [en.classify.albert.imdb](https://nlp.johnsnowlabs.com/2021/12/16/albert_base_sequence_classifier_imdb_en.html) | [albert_base_sequence_classifier_imdb](https://nlp.johnsnowlabs.com/2021/12/16/albert_base_sequence_classifier_imdb_en.html) | Text Classification | AlbertForSequenceClassification |
| en | [en.classify.ag_news.longformer](https://nlp.johnsnowlabs.com/2021/12/16/longformer_base_sequence_classifier_ag_news_en.html) | [longformer_base_sequence_classifier_ag_news](https://nlp.johnsnowlabs.com/2021/12/16/longformer_base_sequence_classifier_ag_news_en.html) | Text Classification | LongformerForSequenceClassification |
| en | [en.classify.imdb.xlnet](https://nlp.johnsnowlabs.com/2021/12/23/xlnet_base_sequence_classifier_imdb_en.html) | [xlnet_base_sequence_classifier_imdb](https://nlp.johnsnowlabs.com/2021/12/23/xlnet_base_sequence_classifier_imdb_en.html) | Text Classification | XlnetForSequenceClassification |
| en | [en.classify.finance_sentiment](https://nlp.johnsnowlabs.com/2021/12/21/bert_sequence_classifier_finbert_tone_en.html) | [bert_sequence_classifier_finbert_tone](https://nlp.johnsnowlabs.com/2021/12/21/bert_sequence_classifier_finbert_tone_en.html) | Sentiment Analysis | BertForSequenceClassification |
| en | [en.classify.imdb.longformer](https://nlp.johnsnowlabs.com/2021/12/16/longformer_base_sequence_classifier_imdb_en.html) | [longformer_base_sequence_classifier_imdb](https://nlp.johnsnowlabs.com/2021/12/16/longformer_base_sequence_classifier_imdb_en.html) | Text Classification | LongformerForSequenceClassification |
| en | [en.classify.ag_news.longformer](https://nlp.johnsnowlabs.com/2021/12/16/longformer_base_sequence_classifier_ag_news_en.html) | [longformer_base_sequence_classifier_ag_news](https://nlp.johnsnowlabs.com/2021/12/16/longformer_base_sequence_classifier_ag_news_en.html) | Text Classification | LongformerForSequenceClassification |
| en | [en.ner.time](https://nlp.johnsnowlabs.com/2021/12/28/roberta_token_classifier_timex_semeval_en.html) | [roberta_token_classifier_timex_semeval](https://nlp.johnsnowlabs.com/2021/12/28/roberta_token_classifier_timex_semeval_en.html) | Named Entity Recognition | RoBertaForTokenClassification |
| en | [en.ner.stocks_ticker](https://nlp.johnsnowlabs.com/2021/12/27/roberta_token_classifier_ticker_en.html) | [roberta_token_classifier_ticker](https://nlp.johnsnowlabs.com/2021/12/27/roberta_token_classifier_ticker_en.html) | Named Entity Recognition | RoBertaForTokenClassification |
| ru | [ru.classify.toxic](https://nlp.johnsnowlabs.com/2021/12/22/bert_sequence_classifier_toxicity_ru.html) | [bert_sequence_classifier_toxicity](https://nlp.johnsnowlabs.com/2021/12/22/bert_sequence_classifier_toxicity_ru.html) | Text Classification | BertForSequenceClassification |
| it | [it.classify.sentiment](https://nlp.johnsnowlabs.com/2021/12/21/bert_sequence_classifier_sentiment_it.html) | [bert_sequence_classifier_sentiment](https://nlp.johnsnowlabs.com/2021/12/21/bert_sequence_classifier_sentiment_it.html) | Sentiment Analysis | BertForSequenceClassification |
| es | [es.ner](https://nlp.johnsnowlabs.com/2020/02/03/wikiner_6B_100_es.html) | [wikiner_6B_100](https://nlp.johnsnowlabs.com/2020/02/03/wikiner_6B_100_es.html) | Named Entity Recognition | NerDLModel |
| is | [is.ner](https://nlp.johnsnowlabs.com/2021/12/06/roberta_token_classifier_icelandic_ner_is.html) | [roberta_token_classifier_icelandic_ner](https://nlp.johnsnowlabs.com/2021/12/06/roberta_token_classifier_icelandic_ner_is.html) | Named Entity Recognition | RoBertaForTokenClassification |
| id | [id.pos](https://nlp.johnsnowlabs.com/2021/12/27/roberta_token_classifier_pos_tagger_id.html) | [roberta_token_classifier_pos_tagger](https://nlp.johnsnowlabs.com/2021/12/27/roberta_token_classifier_pos_tagger_id.html) | Part of Speech Tagging | RoBertaForTokenClassification |
| tr | [tr.ner](https://nlp.johnsnowlabs.com/2020/11/10/turkish_ner_840B_300_tr.html) | [turkish_ner_840B_300](https://nlp.johnsnowlabs.com/2020/11/10/turkish_ner_840B_300_tr.html) | Named Entity Recognition | NerDLModel |
| id | [id.ner](https://nlp.johnsnowlabs.com/2021/12/03/xlm_roberta_large_token_classification_ner_id.html) | [xlm_roberta_large_token_classification_ner](https://nlp.johnsnowlabs.com/2021/12/03/xlm_roberta_large_token_classification_ner_id.html) | Named Entity Recognition | XlmRoBertaForTokenClassification |
| de | [de.ner](https://nlp.johnsnowlabs.com/2021/12/25/xlm_roberta_large_token_classifier_conll03_de.html) | [xlm_roberta_large_token_classifier_conll03](https://nlp.johnsnowlabs.com/2021/12/25/xlm_roberta_large_token_classifier_conll03_de.html) | Named Entity Recognition | XlmRoBertaForTokenClassification |
| hi | [hi.ner](https://nlp.johnsnowlabs.com/2021/12/27/bert_token_classifier_hi_en_ner_hi.html) | [bert_token_classifier_hi_en_ner](https://nlp.johnsnowlabs.com/2021/12/27/bert_token_classifier_hi_en_ner_hi.html) | Named Entity Recognition | BertForTokenClassification |
| nl | [nl.ner](https://nlp.johnsnowlabs.com/2020/05/10/wikiner_6B_100_nl.html) | [wikiner_6B_100](https://nlp.johnsnowlabs.com/2020/05/10/wikiner_6B_100_nl.html) | Named Entity Recognition | NerDLModel |
| zh | [zh.ner](https://nlp.johnsnowlabs.com/2021/12/07/bert_token_classifier_chinese_ner_zh.html) | [bert_token_classifier_chinese_ner](https://nlp.johnsnowlabs.com/2021/12/07/bert_token_classifier_chinese_ner_zh.html) | Named Entity Recognition | BertForTokenClassification |
| fr | [fr.classify.xlm_roberta.allocine](https://nlp.johnsnowlabs.com/2021/12/23/xlm_roberta_base_sequence_classifier_allocine_fr.html) | [xlm_roberta_base_sequence_classifier_allocine](https://nlp.johnsnowlabs.com/2021/12/23/xlm_roberta_base_sequence_classifier_allocine_fr.html) | Text Classification | XlmRoBertaForSequenceClassification |
| ur | [ur.classify.fakenews](https://nlp.johnsnowlabs.com/2021/12/29/classifierdl_urduvec_fakenews_ur.html) | [classifierdl_urduvec_fakenews](https://nlp.johnsnowlabs.com/2021/12/29/classifierdl_urduvec_fakenews_ur.html) | Text Classification | ClassifierDLModel |
| ur | [ur.classify.news](https://nlp.johnsnowlabs.com/2021/12/10/classifierdl_bert_news_ur.html) | [classifierdl_bert_news](https://nlp.johnsnowlabs.com/2021/12/10/classifierdl_bert_news_ur.html) | Text Classification | ClassifierDLModel |
| fi | [fi.embed_sentence.bert.uncased](https://nlp.johnsnowlabs.com/2022/01/03/bert_base_finnish_uncased_fi.html) | [bert_base_finnish_uncased](https://nlp.johnsnowlabs.com/2022/01/03/bert_base_finnish_uncased_fi.html) | Embeddings | BertSentenceEmbeddings |
| fi | [fi.embed_sentence.bert](https://nlp.johnsnowlabs.com/2022/01/03/bert_base_finnish_uncased_fi.html) | [bert_base_finnish_uncased](https://nlp.johnsnowlabs.com/2022/01/03/bert_base_finnish_uncased_fi.html) | Embeddings | BertSentenceEmbeddings |
| fi | [fi.embed_sentence.bert.cased](https://nlp.johnsnowlabs.com/2022/01/03/bert_base_finnish_cased_fi.html) | [bert_base_finnish_cased](https://nlp.johnsnowlabs.com/2022/01/03/bert_base_finnish_cased_fi.html) | Embeddings | BertSentenceEmbeddings |
| te | [te.embed.distilbert](https://nlp.johnsnowlabs.com/2021/12/14/distilbert_uncased_te.html) | [distilbert_uncased](https://nlp.johnsnowlabs.com/2021/12/14/distilbert_uncased_te.html) | Embeddings | DistilBertEmbeddings |
| sw | [sw.embed.xlm_roberta](https://nlp.johnsnowlabs.com/2021/10/16/xlm_roberta_base_finetuned_swahili_sw.html) | [xlm_roberta_base_finetuned_swahili](https://nlp.johnsnowlabs.com/2021/10/16/xlm_roberta_base_finetuned_swahili_sw.html) | Embeddings | XlmRoBertaEmbeddings |





New Healthcare Models
Integration for the 28 new models from the amazing [Spark NLP for healthcare 3.4.0 release](https://nlp.johnsnowlabs.com/docs/en/licensed_release_notes#340)


| Language | NLU Reference | Spark NLP Reference | Task | Annotator Class |
|:-----------|:------------------------------------------------------------------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------------------------------------------------------------------|:-------------------------|:----------------------------|
| en | [en.med_ner.chemprot.bert](https://nlp.johnsnowlabs.com/2021/10/19/bert_token_classifier_ner_chemprot_en.html) | [bert_token_classifier_ner_chemprot](https://nlp.johnsnowlabs.com/2021/10/19/bert_token_classifier_ner_chemprot_en.html) | Named Entity Recognition | MedicalBertForTokenClassifier |
| en | [en.med_ner.chemprot.bert](https://nlp.johnsnowlabs.com/2022/01/06/bert_token_classifier_ner_chemprot_en.html) | [bert_token_classifier_ner_chemprot](https://nlp.johnsnowlabs.com/2022/01/06/bert_token_classifier_ner_chemprot_en.html) | Named Entity Recognition | MedicalBertForTokenClassifier |
| en | [en.classify.token_bert.ner_bacteria](https://nlp.johnsnowlabs.com/2021/09/30/bert_token_classifier_ner_bacteria_en.html) | [bert_token_classifier_ner_bacteria](https://nlp.johnsnowlabs.com/2021/09/30/bert_token_classifier_ner_bacteria_en.html) | Named Entity Recognition | MedicalBertForTokenClassifier |
| en | [en.classify.token_bert.ner_bacteria](https://nlp.johnsnowlabs.com/2022/01/07/bert_token_classifier_ner_bacteria_en.html) | [bert_token_classifier_ner_bacteria](https://nlp.johnsnowlabs.com/2022/01/07/bert_token_classifier_ner_bacteria_en.html) | Named Entity Recognition | MedicalBertForTokenClassifier |
| en | [en.classify.token_bert.ner_anatomy](https://nlp.johnsnowlabs.com/2021/09/30/bert_token_classifier_ner_anatomy_en.html) | [bert_token_classifier_ner_anatomy](https://nlp.johnsnowlabs.com/2021/09/30/bert_token_classifier_ner_anatomy_en.html) | Named Entity Recognition | MedicalBertForTokenClassifier |
| en | [en.classify.token_bert.ner_anatomy](https://nlp.johnsnowlabs.com/2022/01/06/bert_token_classifier_ner_anatomy_en.html) | [bert_token_classifier_ner_anatomy](https://nlp.johnsnowlabs.com/2022/01/06/bert_token_classifier_ner_anatomy_en.html) | Named Entity Recognition | MedicalBertForTokenClassifier |
| en | [en.classify.token_bert.ner_drugs](https://nlp.johnsnowlabs.com/2021/09/20/bert_token_classifier_ner_drugs_en.html) | [bert_token_classifier_ner_drugs](https://nlp.johnsnowlabs.com/2021/09/20/bert_token_classifier_ner_drugs_en.html) | Named Entity Recognition | MedicalBertForTokenClassifier |
| en | [en.classify.token_bert.ner_drugs](https://nlp.johnsnowlabs.com/2022/01/06/bert_token_classifier_ner_drugs_en.html) | [bert_token_classifier_ner_drugs](https://nlp.johnsnowlabs.com/2022/01/06/bert_token_classifier_ner_drugs_en.html) | Named Entity Recognition | MedicalBertForTokenClassifier |
| en | [en.classify.token_bert.ner_jsl_slim](https://nlp.johnsnowlabs.com/2021/09/24/bert_token_classifier_ner_jsl_slim_en.html) | [bert_token_classifier_ner_jsl_slim](https://nlp.johnsnowlabs.com/2021/09/24/bert_token_classifier_ner_jsl_slim_en.html) | Named Entity Recognition | MedicalBertForTokenClassifier |
| en | [en.classify.token_bert.ner_jsl_slim](https://nlp.johnsnowlabs.com/2022/01/06/bert_token_classifier_ner_jsl_slim_en.html) | [bert_token_classifier_ner_jsl_slim](https://nlp.johnsnowlabs.com/2022/01/06/bert_token_classifier_ner_jsl_slim_en.html) | Named Entity Recognition | MedicalBertForTokenClassifier |
| en | [en.classify.token_bert.ner_ade](https://nlp.johnsnowlabs.com/2021/09/30/bert_token_classifier_ner_ade_en.html) | [bert_token_classifier_ner_ade](https://nlp.johnsnowlabs.com/2021/09/30/bert_token_classifier_ner_ade_en.html) | Named Entity Recognition | MedicalBertForTokenClassifier |
| en | [en.classify.token_bert.ner_ade](https://nlp.johnsnowlabs.com/2022/01/04/bert_token_classifier_ner_ade_en.html) | [bert_token_classifier_ner_ade](https://nlp.johnsnowlabs.com/2022/01/04/bert_token_classifier_ner_ade_en.html) | Named Entity Recognition | MedicalBertForTokenClassifier |
| en | [en.classify.token_bert.ner_deid](https://nlp.johnsnowlabs.com/2021/09/13/bert_token_classifier_ner_deid_en.html) | [bert_token_classifier_ner_deid](https://nlp.johnsnowlabs.com/2021/09/13/bert_token_classifier_ner_deid_en.html) | Named Entity Recognition | MedicalBertForTokenClassifier |
| en | [en.classify.token_bert.ner_deid](https://nlp.johnsnowlabs.com/2022/01/06/bert_token_classifier_ner_deid_en.html) | [bert_token_classifier_ner_deid](https://nlp.johnsnowlabs.com/2022/01/06/bert_token_classifier_ner_deid_en.html) | Named Entity Recognition | MedicalBertForTokenClassifier |
| en | [en.classify.token_bert.ner_clinical](https://nlp.johnsnowlabs.com/2021/08/28/bert_token_classifier_ner_clinical_en.html) | [bert_token_classifier_ner_clinical](https://nlp.johnsnowlabs.com/2021/08/28/bert_token_classifier_ner_clinical_en.html) | Named Entity Recognition | MedicalBertForTokenClassifier |
| en | [en.classify.token_bert.ner_clinical](https://nlp.johnsnowlabs.com/2022/01/06/bert_token_classifier_ner_clinical_en.html) | [bert_token_classifier_ner_clinical](https://nlp.johnsnowlabs.com/2022/01/06/bert_token_classifier_ner_clinical_en.html) | Named Entity Recognition | MedicalBertForTokenClassifier |
| en | [en.classify.token_bert.ner_jsl](https://nlp.johnsnowlabs.com/2021/08/28/bert_token_classifier_ner_jsl_en.html) | [bert_token_classifier_ner_jsl](https://nlp.johnsnowlabs.com/2021/08/28/bert_token_classifier_ner_jsl_en.html) | Named Entity Recognition | MedicalBertForTokenClassifier |
| en | [en.classify.token_bert.ner_jsl](https://nlp.johnsnowlabs.com/2021/09/16/bert_token_classifier_ner_jsl_en.html) | [bert_token_classifier_ner_jsl](https://nlp.johnsnowlabs.com/2021/09/16/bert_token_classifier_ner_jsl_en.html) | Named Entity Recognition | MedicalBertForTokenClassifier |
| en | [en.classify.token_bert.ner_jsl](https://nlp.johnsnowlabs.com/2022/01/06/bert_token_classifier_ner_jsl_en.html) | [bert_token_classifier_ner_jsl](https://nlp.johnsnowlabs.com/2022/01/06/bert_token_classifier_ner_jsl_en.html) | Named Entity Recognition | MedicalBertForTokenClassifier |
| en | [en.classify.token_bert.ner_chemical](https://nlp.johnsnowlabs.com/2021/10/19/bert_token_classifier_ner_chemicals_en.html) | [bert_token_classifier_ner_chemicals](https://nlp.johnsnowlabs.com/2021/10/19/bert_token_classifier_ner_chemicals_en.html) | Named Entity Recognition | MedicalBertForTokenClassifier |
| en | [en.classify.token_bert.ner_chemical](https://nlp.johnsnowlabs.com/2022/01/06/bert_token_classifier_ner_chemicals_en.html) | [bert_token_classifier_ner_chemicals](https://nlp.johnsnowlabs.com/2022/01/06/bert_token_classifier_ner_chemicals_en.html) | Named Entity Recognition | MedicalBertForTokenClassifier |
| en | [en.classify.token_bert.bionlp](https://nlp.johnsnowlabs.com/2021/11/03/bert_token_classifier_ner_bionlp_en.html) | [bert_token_classifier_ner_bionlp](https://nlp.johnsnowlabs.com/2021/11/03/bert_token_classifier_ner_bionlp_en.html) | Named Entity Recognition | MedicalBertForTokenClassifier |
| en | [en.classify.token_bert.bionlp](https://nlp.johnsnowlabs.com/2022/01/03/bert_token_classifier_ner_bionlp_en.html) | [bert_token_classifier_ner_bionlp](https://nlp.johnsnowlabs.com/2022/01/03/bert_token_classifier_ner_bionlp_en.html) | Named Entity Recognition | MedicalBertForTokenClassifier |
| en | [en.classify.token_bert.cellular](https://nlp.johnsnowlabs.com/2021/11/03/bert_token_classifier_ner_cellular_en.html) | [bert_token_classifier_ner_cellular](https://nlp.johnsnowlabs.com/2021/11/03/bert_token_classifier_ner_cellular_en.html) | Named Entity Recognition | MedicalBertForTokenClassifier |
| en | [en.classify.token_bert.cellular](https://nlp.johnsnowlabs.com/2022/01/06/bert_token_classifier_ner_cellular_en.html) | [bert_token_classifier_ner_cellular](https://nlp.johnsnowlabs.com/2022/01/06/bert_token_classifier_ner_cellular_en.html) | Named Entity Recognition | MedicalBertForTokenClassifier |
| en | [en.med_ner.abbreviation_clinical](https://nlp.johnsnowlabs.com/2021/12/30/ner_abbreviation_clinical_en.html) | [ner_abbreviation_clinical](https://nlp.johnsnowlabs.com/2021/12/30/ner_abbreviation_clinical_en.html) | Named Entity Recognition | MedicalNerModel |
| en | [en.med_ner.drugprot_clinical](https://nlp.johnsnowlabs.com/2021/12/20/ner_drugprot_clinical_en.html) | [ner_drugprot_clinical](https://nlp.johnsnowlabs.com/2021/12/20/ner_drugprot_clinical_en.html) | Named Entity Recognition | MedicalNerModel |
| en | [en.ner.drug_development_trials](https://nlp.johnsnowlabs.com/2021/12/17/bert_token_classifier_drug_development_trials_en.html) | [bert_token_classifier_drug_development_trials](https://nlp.johnsnowlabs.com/2021/12/17/bert_token_classifier_drug_development_trials_en.html) | Named Entity Recognition | BertForTokenClassification |
| en | [en.med_ner.chemprot](https://nlp.johnsnowlabs.com/2021/04/01/ner_chemprot_biobert_en.html) | [ner_chemprot_biobert](https://nlp.johnsnowlabs.com/2021/04/01/ner_chemprot_biobert_en.html) | Named Entity Recognition | MedicalNerModel |
| en | [en.relation.drugprot](https://nlp.johnsnowlabs.com/2022/01/05/redl_drugprot_biobert_en.html) | [redl_drugprot_biobert](https://nlp.johnsnowlabs.com/2022/01/05/redl_drugprot_biobert_en.html) | Relation Extraction | RelationExtractionDLModel |
| en | [en.relation.drugprot.clinical](https://nlp.johnsnowlabs.com/2022/01/05/re_drugprot_clinical_en.html) | [re_drugprot_clinical](https://nlp.johnsnowlabs.com/2022/01/05/re_drugprot_clinical_en.html) | Relation Extraction | RelationExtractionModel |
| en | [en.resolve.clinical_abbreviation_acronym](https://nlp.johnsnowlabs.com/2021/12/11/sbiobertresolve_clinical_abbreviation_acronym_en.html) | [sbiobertresolve_clinical_abbreviation_acronym](https://nlp.johnsnowlabs.com/2021/12/11/sbiobertresolve_clinical_abbreviation_acronym_en.html) | Entity Resolution | SentenceEntityResolverModel |
| en | [en.resolve.clinical_abbreviation_acronym](https://nlp.johnsnowlabs.com/2022/01/03/sbiobertresolve_clinical_abbreviation_acronym_en.html) | [sbiobertresolve_clinical_abbreviation_acronym](https://nlp.johnsnowlabs.com/2022/01/03/sbiobertresolve_clinical_abbreviation_acronym_en.html) | Entity Resolution | SentenceEntityResolverModel |
| en | [en.resolve.umls_drug_substance](https://nlp.johnsnowlabs.com/2021/12/06/sbiobertresolve_umls_drug_substance_en.html) | [sbiobertresolve_umls_drug_substance](https://nlp.johnsnowlabs.com/2021/12/06/sbiobertresolve_umls_drug_substance_en.html) | Entity Resolution | SentenceEntityResolverModel |
| en | [en.resolve.loinc_cased](https://nlp.johnsnowlabs.com/2021/12/24/sbiobertresolve_loinc_cased_en.html) | [sbiobertresolve_loinc_cased](https://nlp.johnsnowlabs.com/2021/12/24/sbiobertresolve_loinc_cased_en.html) | Entity Resolution | SentenceEntityResolverModel |
| en | [en.resolve.loinc_uncased](https://nlp.johnsnowlabs.com/2021/12/31/sbluebertresolve_loinc_uncased_en.html) | [sbluebertresolve_loinc_uncased](https://nlp.johnsnowlabs.com/2021/12/31/sbluebertresolve_loinc_uncased_en.html) | Entity Resolution | SentenceEntityResolverModel |
| en | [en.embed_sentence.biobert.rxnorm](https://nlp.johnsnowlabs.com/2021/12/23/sbiobert_jsl_rxnorm_cased_en.html) | [sbiobert_jsl_rxnorm_cased](https://nlp.johnsnowlabs.com/2021/12/23/sbiobert_jsl_rxnorm_cased_en.html) | Entity Resolution | BertSentenceEmbeddings |
| en | [en.embed_sentence.bert_uncased.rxnorm](https://nlp.johnsnowlabs.com/2021/12/23/sbert_jsl_medium_rxnorm_uncased_en.html) | [sbert_jsl_medium_rxnorm_uncased](https://nlp.johnsnowlabs.com/2021/12/23/sbert_jsl_medium_rxnorm_uncased_en.html) | Embeddings | BertSentenceEmbeddings |
| en | [en.embed_sentence.bert_uncased.rxnorm](https://nlp.johnsnowlabs.com/2022/01/03/sbert_jsl_medium_rxnorm_uncased_en.html) | [sbert_jsl_medium_rxnorm_uncased](https://nlp.johnsnowlabs.com/2022/01/03/sbert_jsl_medium_rxnorm_uncased_en.html) | Embeddings | BertSentenceEmbeddings |
| en | [en.resolve.snomed_drug](https://nlp.johnsnowlabs.com/2022/01/01/sbiobertresolve_snomed_drug_en.html) | [sbiobertresolve_snomed_drug](https://nlp.johnsnowlabs.com/2022/01/01/sbiobertresolve_snomed_drug_en.html) | Entity Resolution | SentenceEntityResolverModel |
| de | [de.med_ner.deid_subentity](https://nlp.johnsnowlabs.com/2022/01/06/ner_deid_subentity_de.html) | [ner_deid_subentity](https://nlp.johnsnowlabs.com/2022/01/06/ner_deid_subentity_de.html) | Named Entity Recognition | MedicalNerModel |
| de | [de.med_ner.deid_generic](https://nlp.johnsnowlabs.com/2022/01/06/ner_deid_generic_de.html) | [ner_deid_generic](https://nlp.johnsnowlabs.com/2022/01/06/ner_deid_generic_de.html) | Named Entity Recognition | MedicalNerModel |
| de | [de.embed.w2v](https://nlp.johnsnowlabs.com/2020/09/06/w2v_cc_300d_de.html) | [w2v_cc_300d](https://nlp.johnsnowlabs.com/2020/09/06/w2v_cc_300d_de.html) | Embeddings | WordEmbeddingsModel |





Additional NLU resources

* [NLU OCR tutorial notebook](https://colab.research.google.com/github/JohnSnowLabs/nlu/blob/master/examples/colab/ocr/ocr_for_img_pdf_docx_files.ipynb)
* [140+ NLU Tutorials](https://nlu.johnsnowlabs.com/docs/en/notebooks)
* [NLU in Action](https://nlp.johnsnowlabs.com/demo)
* [Streamlit visualizations docs](https://nlu.johnsnowlabs.com/docs/en/streamlit_viz_examples)
* The complete list of all 4000+ models & pipelines in 200+ languages is available on [Models Hub](https://nlp.johnsnowlabs.com/models).
* [Spark NLP publications](https://medium.com/spark-nlp)
* [NLU documentation](https://nlu.johnsnowlabs.com/docs/en/install)
* [Discussions](https://github.com/JohnSnowLabs/spark-nlp/discussions) Engage with other community members, share ideas, and show off how you use Spark NLP and NLU!


1 line Install NLU on Google Colab
!wget https://setup.johnsnowlabs.com/nlu/colab.sh -O - | bash
1 line Install NLU on Kaggle
!wget https://setup.johnsnowlabs.com/nlu/kaggle.sh -O - | bash
Install via PIP
! pip install nlu pyspark streamlit==0.80.0

3.3.1

We are incredibly excited to announce NLU 3.3.1 has been released with 48 new models in 9 languages!

It comes with 2 new types of state-of-the-art models,`distilBERT` and `BERT for sequence classification` with various pre-trained weights,
state-of-the-art bert based classifiers for problems in the domains of `Finance`, `Sentiment Classification`, `Industry`, `News`, and much more.

On the **healthcare** side, NLU features 22 new models in for English and Spanish with
with `entity Resolver Models` for LOINC, MeSH, NDC and SNOMED and UMLS Diseases,
`NER models` for `Biomarkers`, `NIHSS-Guidelines`, `COVID Trials` , `Chemical Trials`,
`Bert based Token Classifier models` for `biological`, `genetical`,`cancer`, `cellular` terms,
`Bert for Sequence Classification models` for `clinical question vs statement classification`
and finally `Spanish Clinical NER ` and `Resolver Models`

Once again, we would like to thank our community for making another amazing release possible!

New Open Source Models and Features
Integrates the amazing [Spark NLP](https://nlp.johnsnowlabs.com/docs/en/quickstart) [3.3.3](https://github.com/JohnSnowLabs/spark-nlp/releases/tag/3.3.3) and [3.3.2](https://github.com/JohnSnowLabs/spark-nlp/releases/tag/3.3.2) releases, featuring:

- New state-of-the-art fine-tuned `BERT models for Sequence Classification` in `English`, `French`, `German`, `Spanish`, `Japanese`, `Turkish`, `Russian`, and multilingual languages.
- `DistilBertForSequenceClassification` models in `English`, `French` and `Urdu`
- `Word2Vec` models.
- `classify.distilbert_sequence.banking77` : `Banking NER model` trained on BANKING77 dataset, which provides a very fine-grained set of intents in a banking domain. It comprises 13,083 customer service queries labeled with 77 intents. It focuses on fine-grained single-domain intent detection. Can extract entities like activate_my_card, age_limit, apple_pay_or_google_pay, atm_support, automatic_top_up, balance_not_updated_after_bank_transfer, balance_not_updated_after_cheque_or_cash_deposit, beneficiary_not_allowed, cancel_transfer, card_about_to_expire, card_acceptance, card_arrival, card_delivery_estimate, card_linking, card_not_working, card_payment_fee_charged, card_payment_not_recognised, card_payment_wrong_exchange_rate, card_swallowed, cash_withdrawal_charge, cash_withdrawal_not_recognised, change_pin, compromised_card, contactless_not_working, country_support, declined_card_payment, declined_cash_withdrawal, declined_transfer, direct_debit_payment_not_recognised, disposable_card_limits, edit_personal_details, exchange_charge, exchange_rate, exchange_via_app, extra_charge_on_statement, failed_transfer, fiat_currency_support, get_disposable_virtual_card, get_physical_card, getting_spare_card, getting_virtual_card, lost_or_stolen_card, lost_or_stolen_phone, order_physical_card, passcode_forgotten, pending_card_payment, pending_cash_withdrawal, pending_top_up, pending_transfer, pin_blocked, receiving_money,
- `classify.distilbert_sequence.industry` : `Industry NER model` which can extract entities like Advertising, Aerospace & Defense, Apparel Retail, Apparel, Accessories & Luxury Goods, Application Software, Asset Management & Custody Banks, Auto Parts & Equipment, Biotechnology, Building Products, Casinos & Gaming, Commodity Chemicals, Communications Equipment, Construction & Engineering, Construction Machinery & Heavy Trucks, Consumer Finance, Data Processing & Outsourced Services, Diversified Metals & Mining, Diversified Support Services, Electric Utilities, Electrical Components & Equipment, Electronic Equipment & Instruments, Environmental & Facilities Services, Gold, Health Care Equipment, Health Care Facilities, Health Care Services.
- `xx.classify.bert_sequence.sentiment` : `Multi-Lingual Sentiment Classifier` This a bert-base-multilingual-uncased model finetuned for sentiment analysis on product reviews in six languages: English, Dutch, German, French, Spanish and Italian. It predicts the sentiment of the review as a number of stars (between 1 and 5). This model is intended for direct use as a sentiment analysis model for product reviews in any of the six languages above, or for further finetuning on related sentiment analysis tasks.
- `distilbert_sequence.policy` : `Policy Classifier` This model was trained on 129.669 manually annotated sentences to classify text into one of seven political categories: ‘Economy’, ‘External Relations’, ‘Fabric of Society’, ‘Freedom and Democracy’, ‘Political System’, ‘Welfare and Quality of Life’ or ‘Social Groups’.
- `classify.bert_sequence.dehatebert_mono` : `Hate Speech Classifier` This model was trained on 129.669 manually annotated sentences to classify text into one of seven political categories: ‘Economy’, ‘External Relations’, ‘Fabric of Society’, ‘Freedom and Democracy’, ‘Political System’, ‘Welfare and Quality of Life’ or ‘Social Groups’.

Complete List of Open Source Models :
| Language | NLU Reference | Spark NLP Reference | Task |
|:-----------|:-------------------------------------------------------------------------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------------------------------------------------------------------------------|:--------------------|
| en | [en.classify.bert_sequence.imdb_large](https://nlp.johnsnowlabs.com/2021/11/01/bert_large_sequence_classifier_imdb_en.html) | [bert_large_sequence_classifier_imdb](https://nlp.johnsnowlabs.com/2021/11/01/bert_large_sequence_classifier_imdb_en.html) | Text Classification |
| en | [en.classify.bert_sequence.imdb](https://nlp.johnsnowlabs.com/2021/11/01/bert_base_sequence_classifier_imdb_en.html) | [bert_base_sequence_classifier_imdb](https://nlp.johnsnowlabs.com/2021/11/01/bert_base_sequence_classifier_imdb_en.html) | Text Classification |
| en | [en.classify.bert_sequence.ag_news](https://nlp.johnsnowlabs.com/2021/11/02/bert_base_sequence_classifier_ag_news_en.html) | [bert_base_sequence_classifier_ag_news](https://nlp.johnsnowlabs.com/2021/11/02/bert_base_sequence_classifier_ag_news_en.html) | Text Classification |
| en | [en.classify.bert_sequence.dbpedia_14](https://nlp.johnsnowlabs.com/2021/11/01/bert_base_sequence_classifier_dbpedia_14_en.html) | [bert_base_sequence_classifier_dbpedia_14](https://nlp.johnsnowlabs.com/2021/11/01/bert_base_sequence_classifier_dbpedia_14_en.html) | Text Classification |
| en | [en.classify.bert_sequence.finbert](https://nlp.johnsnowlabs.com/2021/11/03/bert_sequence_classifier_finbert_en.html) | [bert_sequence_classifier_finbert](https://nlp.johnsnowlabs.com/2021/11/03/bert_sequence_classifier_finbert_en.html) | Text Classification |
| en | [en.classify.bert_sequence.dehatebert_mono](https://nlp.johnsnowlabs.com/2021/11/03/bert_sequence_classifier_dehatebert_mono_en.html) | [bert_sequence_classifier_dehatebert_mono](https://nlp.johnsnowlabs.com/2021/11/03/bert_sequence_classifier_dehatebert_mono_en.html) | Text Classification |
| tr | [tr.classify.bert_sequence.sentiment](https://nlp.johnsnowlabs.com/2021/11/03/bert_sequence_classifier_turkish_sentiment_tr.html) | [bert_sequence_classifier_turkish_sentiment](https://nlp.johnsnowlabs.com/2021/11/03/bert_sequence_classifier_turkish_sentiment_tr.html) | Text Classification |
| de | [de.classify.bert_sequence.sentiment](https://nlp.johnsnowlabs.com/2021/11/03/bert_sequence_classifier_sentiment_de.html) | [bert_sequence_classifier_sentiment](https://nlp.johnsnowlabs.com/2021/11/03/bert_sequence_classifier_sentiment_de.html) | Text Classification |
| ru | [ru.classify.bert_sequence.sentiment](https://nlp.johnsnowlabs.com/2021/11/03/bert_sequence_classifier_rubert_sentiment_ru.html) | [bert_sequence_classifier_rubert_sentiment](https://nlp.johnsnowlabs.com/2021/11/03/bert_sequence_classifier_rubert_sentiment_ru.html) | Text Classification |
| ja | [ja.classify.bert_sequence.sentiment](https://nlp.johnsnowlabs.com/2021/11/03/bert_sequence_classifier_japanese_sentiment_ja.html) | [bert_sequence_classifier_japanese_sentiment](https://nlp.johnsnowlabs.com/2021/11/03/bert_sequence_classifier_japanese_sentiment_ja.html) | Text Classification |
| es | [es.classify.bert_sequence.sentiment](https://nlp.johnsnowlabs.com/2021/11/03/bert_sequence_classifier_beto_sentiment_analysis_es.html) | [bert_sequence_classifier_beto_sentiment_analysis](https://nlp.johnsnowlabs.com/2021/11/03/bert_sequence_classifier_beto_sentiment_analysis_es.html) | Text Classification |
| es | [es.classify.bert_sequence.emotion](https://nlp.johnsnowlabs.com/2021/11/03/bert_sequence_classifier_beto_emotion_analysis_es.html) | [bert_sequence_classifier_beto_emotion_analysis](https://nlp.johnsnowlabs.com/2021/11/03/bert_sequence_classifier_beto_emotion_analysis_es.html) | Text Classification |
| xx | [xx.classify.bert_sequence.sentiment](https://nlp.johnsnowlabs.com/2021/11/03/bert_sequence_classifier_multilingual_sentiment_xx.html) | [bert_sequence_classifier_multilingual_sentiment](https://nlp.johnsnowlabs.com/2021/11/03/bert_sequence_classifier_multilingual_sentiment_xx.html) | Text Classification |
| en | [en.classify.distilbert_sequence.sst2](https://nlp.johnsnowlabs.com/2021/11/21/distilbert_sequence_classifier_sst2_en.html) | [distilbert_sequence_classifier_sst2](https://nlp.johnsnowlabs.com/2021/11/21/distilbert_sequence_classifier_sst2_en.html) | Text Classification |
| en | [en.classify.distilbert_sequence.policy](https://nlp.johnsnowlabs.com/2021/11/21/distilbert_sequence_classifier_policy_en.html) | [distilbert_sequence_classifier_policy](https://nlp.johnsnowlabs.com/2021/11/21/distilbert_sequence_classifier_policy_en.html) | Text Classification |
| en | [en.classify.distilbert_sequence.industry](https://nlp.johnsnowlabs.com/2021/11/21/distilbert_sequence_classifier_industry_en.html) | [distilbert_sequence_classifier_industry](https://nlp.johnsnowlabs.com/2021/11/21/distilbert_sequence_classifier_industry_en.html) | Text Classification |
| en | [en.classify.distilbert_sequence.emotion](https://nlp.johnsnowlabs.com/2021/11/21/distilbert_sequence_classifier_emotion_en.html) | [distilbert_sequence_classifier_emotion](https://nlp.johnsnowlabs.com/2021/11/21/distilbert_sequence_classifier_emotion_en.html) | Text Classification |
| en | [en.classify.distilbert_sequence.banking77](https://nlp.johnsnowlabs.com/2021/11/21/distilbert_sequence_classifier_banking77_en.html) | [distilbert_sequence_classifier_banking77](https://nlp.johnsnowlabs.com/2021/11/21/distilbert_sequence_classifier_banking77_en.html) | Text Classification |
| en | [en.classify.distilbert_sequence.imdb](https://nlp.johnsnowlabs.com/2021/11/21/distilbert_base_sequence_classifier_imdb_en.html) | [distilbert_base_sequence_classifier_imdb](https://nlp.johnsnowlabs.com/2021/11/21/distilbert_base_sequence_classifier_imdb_en.html) | Text Classification |
| en | [en.classify.distilbert_sequence.amazon_polarity](https://nlp.johnsnowlabs.com/2021/11/21/distilbert_base_sequence_classifier_amazon_polarity_en.html) | [distilbert_base_sequence_classifier_amazon_polarity](https://nlp.johnsnowlabs.com/2021/11/21/distilbert_base_sequence_classifier_amazon_polarity_en.html) | Text Classification |
| en | [en.classify.distilbert_sequence.ag_news](https://nlp.johnsnowlabs.com/2021/11/21/distilbert_base_sequence_classifier_ag_news_en.html) | [distilbert_base_sequence_classifier_ag_news](https://nlp.johnsnowlabs.com/2021/11/21/distilbert_base_sequence_classifier_ag_news_en.html) | Text Classification |
| fr | [fr.classify.distilbert_sequence.allocine](https://nlp.johnsnowlabs.com/2021/11/21/distilbert_multilingual_sequence_classifier_allocine_fr.html) | [distilbert_multilingual_sequence_classifier_allocine](https://nlp.johnsnowlabs.com/2021/11/21/distilbert_multilingual_sequence_classifier_allocine_fr.html) | Text Classification |
| ur | [ur.classify.distilbert_sequence.imdb](https://nlp.johnsnowlabs.com/2021/11/21/distilbert_base_sequence_classifier_imdb_ur.html) | [distilbert_base_sequence_classifier_imdb](https://nlp.johnsnowlabs.com/2021/11/21/distilbert_base_sequence_classifier_imdb_ur.html) | Text Classification |
| en | [en.embed_sentence.doc2vec](https://nlp.johnsnowlabs.com/2021/11/21/doc2vec_gigaword_300_en.html) | [doc2vec_gigaword_300](https://nlp.johnsnowlabs.com/2021/11/21/doc2vec_gigaword_300_en.html) | Embeddings |
| en | [en.embed_sentence.doc2vec.gigaword_300](https://nlp.johnsnowlabs.com/2021/11/21/doc2vec_gigaword_300_en.html) | [doc2vec_gigaword_300](https://nlp.johnsnowlabs.com/2021/11/21/doc2vec_gigaword_300_en.html) | Embeddings |
| en | [en.embed_sentence.doc2vec.gigaword_wiki_300](https://nlp.johnsnowlabs.com/2021/11/21/doc2vec_gigaword_wiki_300_en.html) | [doc2vec_gigaword_wiki_300](https://nlp.johnsnowlabs.com/2021/11/21/doc2vec_gigaword_wiki_300_en.html) | Embeddings |



New Healthcare models and Features
Integrates the incredible [Spark NLP for Healthcare](https://nlp.johnsnowlabs.com/docs/en/licensed_install) releases [3.3.4](https://nlp.johnsnowlabs.com/docs/en/licensed_release_notes#334), [3.3.2](https://nlp.johnsnowlabs.com/docs/en/licensed_release_notes#332) and [3.3.1](https://nlp.johnsnowlabs.com/docs/en/licensed_release_notes#331), featuring:
- New Clinical NER Models for protected health information(PHI),
- `ner_biomarker` for extracting extract biomarkers, therapies, oncological, and other general concepts
- Oncogenes, Tumor_Finding, UnspecificTherapy, Ethnicity, Age, ResponseToTreatment, Biomarker, HormonalTherapy, Staging, Drug, CancerDx, Radiotherapy, CancerSurgery, TargetedTherapy, PerformanceStatus, CancerModifier, Radiological_Test_Result, Biomarker_Measurement, Metastasis, Radiological_Test, Chemotherapy, Test, Dosage, Test_Result, Immunotherapy, Date, Gender, Prognostic_Biomarkers, Duration, Predictive_Biomarkers
- `ner_nihss` : NER model that can identify entities according to NIHSS guidelines for clinical stroke assessment to evaluate neurological status in acute stroke patients
- 11_ExtinctionInattention, 6b_RightLeg, 1c_LOCCommands, 10_Dysarthria, NIHSS, 5_Motor, 8_Sensory, 4_FacialPalsy, 6_Motor, 2_BestGaze, Measurement, 6a_LeftLeg, 5b_RightArm, 5a_LeftArm, 1b_LOCQuestions, 3_Visual, 9_BestLanguage, 7_LimbAtaxia, 1a_LOC .
- `redl_nihss_biobert` : relation extraction model that can relate scale items and their measurements according to NIHSS guidelines.
- `es.med_ner.roberta_ner_diag_proc` : New Spanish Clinical NER Models for extracting the entities DIAGNOSTICO, PROCEDIMIENTO
- `es.resolve.snomed`: New Spanish SNOMED Entity Resolvers
- `bert_sequence_classifier_question_statement_clinical`:New Clinical Question vs Statement for BertForSequenceClassification model
- `med_ner.covid_trials` : This model is trained to extract covid-specific medical entities in clinical trials. It supports the following entities ranging from virus type to trial design: Stage, Severity, Virus, Trial_Design, Trial_Phase, N_Patients, Institution, Statistical_Indicator, Section_Header, Cell_Type, Cellular_component, Viral_components, Physiological_reaction, Biological_molecules, Admission_Discharge, Age, BMI, Cerebrovascular_Disease, Date, Death_Entity, Diabetes, Disease_Syndrome_Disorder, Dosage, Drug_Ingredient, Employment, Frequency, Gender, Heart_Disease, Hypertension, Obesity, Pulse, Race_Ethnicity, Respiration, Route, Smoking, Time, Total_Cholesterol, Treatment, VS_Finding, Vaccine .
- `med_ner.chemd` : This model extract the names of chemical compounds and drugs in medical texts. The entities that can be detected are as follows : SYSTEMATIC, IDENTIFIERS, FORMULA, TRIVIAL, ABBREVIATION, FAMILY, MULTIPLE . For reference click here . https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4331685/
- `bert_token_classifier_ner_bionlp` : This model is BERT-based version of ner_bionlp model and can detect biological and genetics terms in cancer-related texts. (Amino_acid, Anatomical_system, Cancer, Cell, Cellular_component, Developing_anatomical_Structure, Gene_or_gene_product, Immaterial_anatomical_entity, Multi-tissue_structure, Organ, Organism, Organism_subdivision, Simple_chemical, Tissue
- `bert_token_classifier_ner_cellular` : This model is BERT-based version of ner_cellular model and can detect molecular biology-related terms (DNA, Cell_type, Cell_line, RNA, Protein) in medical texts.
- We have updated `med_ner.jsl.enriched` model by enriching the training data using clinical trials data to make it more robust. This model is capable of predicting up to 87 different entities and is based on ner_jsl model. Here are the entities this model can detect; Social_History_Header, Oncology_Therapy, Blood_Pressure, Respiration, Performance_Status, Family_History_Header, Dosage, Clinical_Dept, Diet, Procedure, HDL, Weight, Admission_Discharge, LDL, Kidney_Disease, Oncological, Route, Imaging_Technique, Puerperium, Overweight, Temperature, Diabetes, Vaccine, Age, Test_Result, Employment, Time, Obesity, EKG_Findings, Pregnancy, Communicable_Disease, BMI, Strength, Tumor_Finding, Section_Header, RelativeDate, ImagingFindings, Death_Entity, Date, Cerebrovascular_Disease, Treatment, Labour_Delivery, Pregnancy_Delivery_Puerperium, Direction, Internal_organ_or_component, Psychological_Condition, Form, Medical_Device, Test, Symptom, Disease_Syndrome_Disorder, Staging, Birth_Entity, Hyperlipidemia, O2_Saturation, Frequency, External_body_part_or_region, Drug_Ingredient, Vital_Signs_Header, Substance_Quantity, Race_Ethnicity, VS_Finding, Injury_or_Poisoning, Medical_History_Header, Alcohol, Triglycerides, Total_Cholesterol, Sexually_Active_or_Sexual_Orientation, Female_Reproductive_Status, Relationship_Status, Drug_BrandName, RelativeTime, Duration, Hypertension, Metastasis, Gender, Oxygen_Therapy, Pulse, Heart_Disease, Modifier, Allergen, Smoking, Substance, Cancer_Modifier, Fetus_NewBorn, Height
- `classify.bert_sequence.question_statement_clinical` : This model classifies sentences into one of these two classes: question (interrogative sentence) or statement (declarative sentence) and trained with BertForSequenceClassification. This model is at first trained on SQuAD and SPAADIA dataset and then fine tuned on the clinical visit documents and MIMIC-III dataset annotated in-house. Using this model, you can find the question statements and exclude & utilize in the downstream tasks such as NER and relation extraction models.
- `classify.token_bert.ner_chemical` : This model is BERT-based version of ner_chemicals model and can detect chemical compounds (CHEM) in the medical texts.
- `resolve.umls_disease_syndrome` : This model is trained on the Disease or Syndrome category using sbiobert_base_cased_mli embeddings.

Complete List of Healthcare Models :

| Language | NLU Reference | Spark NLP Reference | Task |
|:-----------|:--------------------------------------------------------------------------------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------------------------------------------------------------------------------------|:-------------------------|
| en | [en.med_ner.deid_subentity_augmented_i2b2](https://nlp.johnsnowlabs.com/2021/11/29/ner_deid_subentity_augmented_i2b2_en.html) | [ner_deid_subentity_augmented_i2b2](https://nlp.johnsnowlabs.com/2021/11/29/ner_deid_subentity_augmented_i2b2_en.html) | Named Entity Recognition |
| en | [en.med_ner.biomarker](https://nlp.johnsnowlabs.com/2021/11/26/ner_biomarker_en.html) | [ner_biomarker](https://nlp.johnsnowlabs.com/2021/11/26/ner_biomarker_en.html) | Named Entity Recognition |
| en | [en.med_ner.nihss](https://nlp.johnsnowlabs.com/2021/11/15/ner_nihss_en.html) | [ner_nihss](https://nlp.johnsnowlabs.com/2021/11/15/ner_nihss_en.html) | Named Entity Recognition |
| en | [en.extract_relation.nihss](https://nlp.johnsnowlabs.com/2021/11/16/redl_nihss_biobert_en.html) | [redl_nihss_biobert](https://nlp.johnsnowlabs.com/2021/11/16/redl_nihss_biobert_en.html) | Relation Extraction |
| en | [en.resolve.mesh](https://nlp.johnsnowlabs.com/2021/11/14/sbiobertresolve_mesh_en.html) | [sbiobertresolve_mesh](https://nlp.johnsnowlabs.com/2021/11/14/sbiobertresolve_mesh_en.html) | Entity Resolution |
| en | [en.resolve.mli](https://nlp.johnsnowlabs.com/2020/11/27/sbiobert_base_cased_mli_en.html) | [sbiobert_base_cased_mli](https://nlp.johnsnowlabs.com/2020/11/27/sbiobert_base_cased_mli_en.html) | Embeddings |
| en | [en.resolve.ndc](https://nlp.johnsnowlabs.com/2021/11/27/sbiobertresolve_ndc_en.html) | [sbiobertresolve_ndc](https://nlp.johnsnowlabs.com/2021/11/27/sbiobertresolve_ndc_en.html) | Entity Resolution |
| en | [en.resolve.loinc.augmented](https://nlp.johnsnowlabs.com/2021/11/23/sbiobertresolve_loinc_augmented_en.html) | [sbiobertresolve_loinc_augmented](https://nlp.johnsnowlabs.com/2021/11/23/sbiobertresolve_loinc_augmented_en.html) | Entity Resolution |
| en | [en.resolve.clinical_snomed_procedures_measurements](https://nlp.johnsnowlabs.com/2021/11/15/sbiobertresolve_clinical_snomed_procedures_measurements_en.html) | [sbiobertresolve_clinical_snomed_procedures_measurements](https://nlp.johnsnowlabs.com/2021/11/15/sbiobertresolve_clinical_snomed_procedures_measurements_en.html) | Entity Resolution |
| es | [es.embed.roberta_base_biomedical](https://nlp.johnsnowlabs.com/2021/11/01/roberta_base_biomedical_es.html) | [roberta_base_biomedical](https://nlp.johnsnowlabs.com/2021/11/01/roberta_base_biomedical_es.html) | Embeddings |
| es | [es.med_ner.roberta_ner_diag_proc](https://nlp.johnsnowlabs.com/2021/11/04/roberta_ner_diag_proc_es.html) | [roberta_ner_diag_proc](https://nlp.johnsnowlabs.com/2021/11/04/roberta_ner_diag_proc_es.html) | Named Entity Recognition |
| es | [es.resolve.snomed](https://nlp.johnsnowlabs.com/2021/11/03/robertaresolve_snomed_es.html) | [robertaresolve_snomed](https://nlp.johnsnowlabs.com/2021/11/03/robertaresolve_snomed_es.html) | Entity Resolution |
| en | [en.med_ner.covid_trials](https://nlp.johnsnowlabs.com/2021/11/05/ner_covid_trials_en.html) | [ner_covid_trials](https://nlp.johnsnowlabs.com/2021/11/05/ner_covid_trials_en.html) | Named Entity Recognition |
| en | [en.classify.token_bert.bionlp](https://nlp.johnsnowlabs.com/2021/11/03/bert_token_classifier_ner_bionlp_en.html) | [bert_token_classifier_ner_bionlp](https://nlp.johnsnowlabs.com/2021/11/03/bert_token_classifier_ner_bionlp_en.html) | Named Entity Recognition |
| en | [en.classify.token_bert.cellular](https://nlp.johnsnowlabs.com/2021/11/03/bert_token_classifier_ner_cellular_en.html) | [bert_token_classifier_ner_cellular](https://nlp.johnsnowlabs.com/2021/11/03/bert_token_classifier_ner_cellular_en.html) | Named Entity Recognition |
| en | [en.classify.token_bert.chemicals](https://nlp.johnsnowlabs.com/2021/10/19/bert_token_classifier_ner_chemicals_en.html) | [bert_token_classifier_ner_chemicals](https://nlp.johnsnowlabs.com/2021/10/19/bert_token_classifier_ner_chemicals_en.html) | Named Entity Recognition |
| en | [en.resolve.rxnorm_augmented](https://nlp.johnsnowlabs.com/2021/10/29/sbiobertresolve_rxnorm_augmented_en.html) | [sbiobertresolve_rxnorm_augmented](https://nlp.johnsnowlabs.com/2021/10/29/sbiobertresolve_rxnorm_augmented_en.html) | Entity Resolution |
| en | [en.resolve.rxnorm_augmented](https://nlp.johnsnowlabs.com/2021/11/04/sbiobertresolve_rxnorm_augmented_en.html) | [sbiobertresolve_rxnorm_augmented](https://nlp.johnsnowlabs.com/2021/11/04/sbiobertresolve_rxnorm_augmented_en.html) | Entity Resolution |
| en | [en.resolve.rxnorm_augmented](https://nlp.johnsnowlabs.com/2021/11/09/sbiobertresolve_rxnorm_augmented_en.html) | [sbiobertresolve_rxnorm_augmented](https://nlp.johnsnowlabs.com/2021/11/09/sbiobertresolve_rxnorm_augmented_en.html) | Entity Resolution |
| en | [en.resolve.umls_disease_syndrome](https://nlp.johnsnowlabs.com/2021/10/11/sbiobertresolve_umls_disease_syndrome_en.html) | [sbiobertresolve_umls_disease_syndrome](https://nlp.johnsnowlabs.com/2021/10/11/sbiobertresolve_umls_disease_syndrome_en.html) | Entity Resolution |
| en | [en.resolve.umls_clinical_drugs](https://nlp.johnsnowlabs.com/2021/10/11/sbiobertresolve_umls_clinical_drugs_en.html) | [sbiobertresolve_umls_clinical_drugs](https://nlp.johnsnowlabs.com/2021/10/11/sbiobertresolve_umls_clinical_drugs_en.html) | Entity Resolution |
| en | [en.classify.bert_sequence.question_statement_clinical](https://nlp.johnsnowlabs.com/2021/11/05/bert_sequence_classifier_question_statement_clinical_en.html) | [bert_sequence_classifier_question_statement_clinical](https://nlp.johnsnowlabs.com/2021/11/05/bert_sequence_classifier_question_statement_clinical_en.html) | Text Classification |

Page 5 of 12

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.