Questions tagged [presidio]

7 questions
0
votes
0 answers

Does Presidio Framework supports Parallel Processing?

I am using presidio framework for PII detection along with bert PII models. I tried with ray but, I am not able to achieve parallel processing. Does presidio is compatible for Parallel processing I tried Ray. When I give multiple sentences for…
0
votes
0 answers

Add new entity recognizer in Presidio

I'm testing a data anonymizer that I found on github. But I want to add a new entity, because the author of the repository only uses ["PERSON", "EMAIL_ADDRESS", "LOCATION", "PHONE_NUMBER"] as entities. So based on the presidio documentation I add…
suribe06
  • 69
  • 8
0
votes
2 answers

Azure DataBricks ImportError: cannot import name dataclass_transform

I have a python notebook running the following imports on a DataBricks cluster %pip install presidio_analyzer %pip install presidio_anonymizer import spacy.cli spacy.cli.download("en_core_web_lg") nlp = spacy.load("en_core_web_lg") import csv import…
RAH
  • 395
  • 2
  • 9
0
votes
1 answer

How to incorporate flair model into presidio framework?

I was looking for some sample code/tutorial how to implement flair based model within the presidio framework and I found this https://github.com/microsoft/presidio/blob/main/docs/samples/python/flair_recognizer.py, but actually, it is not wroking.…
0
votes
1 answer

Microsoft Presidio support for spark using scala

Is there a way to use Microsoft Presidio for pii data encryption in Scala Spark? Is it only possible using pySpark?
0
votes
1 answer

Why does Presidio StanzaNlpEngine throw NameError: name 'StanzaLanguage' is not defined?

I'm trying to use Stanza language models with Presidio and running into this blocker. import stanza stanza.download("en") from presidio_analyzer.nlp_engine import StanzaNlpEngine StanzaNlpEngine(models={"en": "en"}) Above throws... Traceback (most…
conner.xyz
  • 6,273
  • 8
  • 39
  • 65
-1
votes
0 answers

Difficulty having Presidio library in Python recognize addresses

I'm having a hard time anonymizing PII for a project I am working on using Presidio. For example, when I am trying to clean the data and I give in an address (i.e 123 Sesame Street, Los Angeles, California) it will give me back 123 Sesame Street,…