Questions tagged [spacy]

Industrial strength Natural Language Processing (NLP) with Python and Cython

85 questions
7
votes
1 answer

Address parsing using spaCy

I am trying to parse addresses from various documents using spaCy using NER but the results are not so accurate. I know this is bit generic question but it would be a great help if I could get reference of any past work or good articles or…
SKB
  • 574
  • 5
  • 18
7
votes
1 answer

Meaning of NER Training values using Spacy

I am trying to train custom entities using Spacy. During the training process I am getting number of values of LOSS, score etc. What is the meaning of these values ============================= Training pipeline ============================= ℹ…
Aniiya0978
  • 193
  • 1
  • 4
6
votes
3 answers

Chunking Sentences with Spacy

I have a lot of sentences (500k) which looks like this: "Penalty missed! Bad penalty by Felipe Brisola - Riga FC - shot with right foot is very close to the goal. Felipe Brisola should be disappointed." "Penalty saved! Damir Kojasevic - Sutjeska…
senty
  • 163
  • 3
6
votes
1 answer

Is there any way to define custom entities in Spacy

1) I have just started working on NLP the basic Idea is to extract meaningful information from text. For this I am using "Spacy". As far as I have studied Spacy has following entities. ORG PERSON DATE MONEY CARDINAL etc. But I want to add custom…
AddyProg
  • 163
  • 1
  • 6
5
votes
2 answers

Converting paragraphs into sentences

I'm looking for ways to extract sentences from paragraphs of text containing different types of punctuations and all. I used SpaCy's Sentencizer to begin with. Sample input python list abstracts: ["A total of 2337 articles were found, and, according…
Van Peer
  • 285
  • 1
  • 4
  • 12
5
votes
1 answer

How to train a spacy model for text classification?

Can i know the way or steps to train a spacy model for text classification. (binary classification in my case) Please help me with the process and way to approach.
5
votes
1 answer

Spacy custom POS tagging for medical concepts

We are a group of doctors trying to use linguistic features of "Spacy", especially the part of speech tagging to show relationships between medical concepts like: 'Femoral artery pseudoaneurysm as in ==> "femoral artery" ['Anatomical Location'] …
4
votes
2 answers

Does spaCy support multiple GPUs?

I was wondering if spaCy supports multi-GPU via mpi4py? I am currently using spaCy's nlp.pipe for Named Entity Recognition on a high-performance-computing cluster that supports the MPI protocol and has many GPUs. It says here that I would need to…
Jinhua Wang
  • 163
  • 8
4
votes
2 answers

How to make the spacy 3.0 custom NER model training to optimize on precision rather than recall?

My current custom NER model is good on recall but I want to focus on improving precision, is it possible to change the optimizer metric in config file?
3
votes
1 answer

Testing Spacy NER model

I've trained an NER model with the use of Spacy, and I would like to test the accuracy on a test dataset. What would be the best way to perform this?
Adnos
  • 81
  • 4
3
votes
0 answers

Medical NER for French language

I'm currently exploring the options to extract medical NER specifically for French language. I tried SpaCy's general French NER but it wasn't helpful to the cause (mainly because of the domain-specific requirements). I assume we cannot use Med7 or…
Van Peer
  • 285
  • 1
  • 4
  • 12
3
votes
2 answers

Is NLP suitable for my legal contract parsing problem?

My company has a product that involves the extraction of a variety of fields from legal contract PDFs. The current approach is very time consuming and messy, and I am exploring if NLP is a suitable alternative. The PDFs that need to be parsed…
Posionus
  • 31
  • 1
3
votes
1 answer

Smart sentence segmentation not splitting on abbreviations

Sentencer from SpaCy and NLTK does not catch the fact that typical abbreviations (e.g. Mio. for Million in German) and the resulting sentence split is not correct. I understand that sentencers are supposed to be simple and quick but I am wondering…
sophros
  • 209
  • 2
  • 12
3
votes
1 answer

Character-level embeddings in python

I'm working on an NLP task that requires the use of character level embeddings, and I've been trying to use Spacy. However, it seems that spacy uses word-level embeddings for the word vectors, and I need character-level embeddings. The only…
rmaguiar
  • 163
  • 2
  • 7
3
votes
0 answers

Change the way spacy works - Custom properties for training and prediction

Spacy detects the entities using its predefined algorithm. It parses tokens in text considering position of tokens with respect to tokens surrounding it. It also takes into consideration the POS tagging for these tokens. However, I believe it…
Sandeep Bhutani
  • 914
  • 1
  • 7
  • 26
1
2 3 4 5 6