Questions tagged [spacy]

Industrial strength Natural Language Processing (NLP) with Python and Cython

spaCy is a library for advanced Natural Language Processing in Python and Cython. Its features include tokenization, part-of-speech tagging, dependency parsing, sentence boundary detection, named entity recognition and training of statistical neural network models.


Resources

3742 questions
1
vote
1 answer

Get entity from dataframe and save in other column

I am make DataFrame with 2 columns: Column 1 Column 2 Text1 Text1 Text2 Text2 Text3 Text3 And I need to make another column with entity from column 1 and second with entity from column 2. For example: Column 1 Column 2 Entity…
Michał
  • 21
  • 3
1
vote
1 answer

OSError: E053 Could not read config.cfg Spacy on colab

I want to use SpacyTextBlob in google Colab, when I use the formal installation, I got the below error. OSError: [E053] Could not read config.cfg from /usr/local/lib/python3.7/dist-packages/en_core_web_sm/en_core_web_sm-2.2.5/config.cfg what I do,…
zana saedpanah
  • 324
  • 3
  • 12
1
vote
0 answers

UserWarning: floor_divide is deprecated, and will be removed in a future version of pytorch

I am getting this error and it is preventing my code to run. I try to filter the warning but even so it stops the running of my code. I still after many hours does not figure it out hiw to overcome it. Là où les vêtements de sport connectés actuels…
kely789456123
  • 605
  • 1
  • 6
  • 21
1
vote
0 answers

Spacy download on Apple M1 chip

I'm in a MacBook Pro with M1 Pro chip, and I've built a Docker to install Tensorflow, spacy, etc. My Dockerfile: FROM --platform=linux/x86_64 python:3.8 ENV PYTHONUNBUFFERED 1 RUN apt-get update && apt-get -y upgrade RUN pip install --upgrade…
marcelps
  • 309
  • 2
  • 16
1
vote
2 answers

Unable to install spacy 3.2.1 in windows 10 pro PC, with python 3.9.0 installed

I have a windows 10, 64 bit system that cannot be connected to the internet due to security reasons. I have downloaded spacy 3.2.1 with all its dependencies but while installing spacy, it shows error "this is not a supported wheel for this platform"…
1
vote
0 answers

How to change the format of json to spacy/custom json format in python?

I do have a json format which is generated from docanno annotation tool. I want to convert the json into another format. Please check below for the format Docanno json format : {"id": 2, "data": "My name is Nithin Reddy and i'm working as a Data…
Nithin Reddy
  • 580
  • 2
  • 8
  • 18
1
vote
1 answer

How to import Spacy to run with GCP Dataflow?

I would like to run Spacy Lemmatization on a column within a ParDo on GCP DataFlow. My DataFlow project is composed by 3 files: main.py which is the file containing the script, myfile.json which contains the service account key, and setup.py which…
FairPluto
  • 697
  • 6
  • 28
1
vote
1 answer

Most efficient way to run spacy lemmatizer with Dataflow

I try to process data coming from BigQuery. I created a pipeline with Apache Beam as below: nlp = fr_core_news_lg.load() class CleanText(beam.DoFn): def process(self, row): row['descriptioncleaned'] = '…
1
vote
1 answer

how can I use EntityRuler for persian(farsi) language?

I want to create a NER for get the entities of the sentence (persian-farsi) and the spacy doesn't have the Persian language the sentence look like this en --> 'I bought 5 apples from Richard for $ 45' fa --> 'من ۵ تا سیب از ریچارد خریدم به قیمت ۴۵…
1
vote
1 answer

How to transform character indices to SpaCy token indices?

I am using SpaCy to find patterns in texts. For some patterns such as single words this is straightfoward, and I am happy with the results. For example, import re import spacy from spacy.matcher import Matcher nlp =…
Bart
  • 123
  • 3
1
vote
1 answer

spacy regex with japanese characters

i need help with regex in spacy in japanese. I have this text: 道が凍っているから気を付けなさい。 I need to find match every word until the character "を" in japanese, so essentially i need to get "道が凍っているから気を" . I tried this code: nlp…
Laz22434
  • 373
  • 1
  • 12
1
vote
2 answers

Search for particular parts of speech (e.g. nouns) and print them along with a preceding word

I have a text which is made up of a list of basic sentences, such as "she is a doctor", "he is a good person", and so forth. I'm trying to write a program which will return only the nouns and the preceding pronoun (e.g. she, he, it). I need them to…
beatrixx
  • 13
  • 3
1
vote
1 answer

OSError: [E053] Could not read config.cfg from C:\Users

I'm trying to run spaCY's lemmatizer on a text by running the command nlp = spacy.load("en_core_web_sm", disable=["parser", "ner"]), but then I get the following error: OSError: [E053] Could not read config.cfg from C:\Users. I'm using spaCy version…
Mattis
  • 29
  • 2
1
vote
2 answers

Spacy matcher pattern with specifics nouns

I'm trying to match a specific pattern: any verb with a noun ending with a s, t or l. E.g.: Like cat, Eat meal, Make Spices How Can i do this? I Know i was doing this: nlp =spacy.load("en_core_web_sm") matcher = Matcher(nlp.vocable) pattern =…
Laz22434
  • 373
  • 1
  • 12
1
vote
1 answer

Spacy.io DependencyMatcher Isn't Grouping MatchIDs

I have been working with Spacy.io DependencyMatcher and I find it very powerful. But, I do have a question that I couldn't figure out from the documentation. The matches results are a list of tuples for the same MatchID instead of getting one tuple…
eboraks
  • 167
  • 1
  • 9