Questions tagged [snorkel]

11 questions
5
votes
0 answers

Giving higher weight to a labeling function in Snorkel

I am using snorkel to create labels for my training data. I currently have five labeling functions for the task which I have stored in a list. I am using the following code to apply the labeling function: lfs = [lf_a, lf_b, lf_c, lf_d, lf_e] applier…
1
vote
1 answer

Snorkel: write several labelling functions automatically

My goal is to create N labelling functions (LFs in snorkel package) using more elegant way than writing it one by one. Since I'm expecting to have much more single regexes to be referenced, I would like to find a way to do it more automatically.…
1
vote
0 answers

conda said all requested packages already installed but can't find in conda list

my device is M1 mac, I want to install Snorkel pkg, however, my conda said all requested packages already installed, but I can't find it in conda list and pycharm can't import ether. Here is the log sudo conda install snorkel Collecting package…
Allonsy Jia
  • 160
  • 1
  • 9
1
vote
0 answers

How to create n number of python functions based on user input?

Essentially, I'm looking to build a web app where the user can input n number of labels for a dataset put it into a dictionary with keywords for each label. I'd like the same function to be created for n number of labels, something like: # labeling…
1
vote
0 answers

problem with abstain calss value and received result in snorkel labeling package

I am using Snorkel Labeling Package to programmatically label my unlabeled training data. I followed this like https://www.snorkel.org/use-cases/01-spam-tutorial you should write several label function like: from textblob import…
Mahsa
  • 207
  • 1
  • 10
1
vote
1 answer

I'm using Dask to apply LabelingFunction using Snorkel on multiple datasets but it seems to take forever. Is this normal?

My problem is as follow: I have several datasets (900K, 1M7 and 1M7 entries) in csv format which I load into multiple Dask Dataframe. Then I concatenate them all in one Dask Dataframe that I can feed to my Snorkel Applier, which applies a bunch of…
1
vote
0 answers

Efficiently labeling training data for Spacy's custom dependency parser

I am looking into utilizing spacy's custom dependency parser (https://spacy.io/usage/training#tagger-parser - "Training a parser for custom semantics"). It seems as if you would have to manually label the heads and dependencies. Does anyone know of…
formicaman
  • 1,317
  • 3
  • 16
  • 32
1
vote
2 answers

Snorkel: Can i have different features in data set to for generating labelling function VS training a classifier?

I have a set of features to build labelling functions (set A) and another set of features to train a sklearn classifier (set B) The generative model will output a set of probabilisitic labels which i can use to train my classifier. Do i need to add…
jxn
  • 7,685
  • 28
  • 90
  • 172
0
votes
0 answers

TypeError: 'str' object does not support item assignment - when dynamically assigning names in dictionary

I have a list of numbers using which I am creating dictionary and then assigning results to them. List looks like - [243,244,445,446] , Dictionary looks like - _243,_244,_445,_446 . I am assigning keys to each dictionary based on their names. For…
Shantanu Nandan
  • 1,438
  • 8
  • 30
  • 56
0
votes
1 answer

What if my Snorkel labeling function has a very low coverage over a development set?

I am trying to label a span recognition dataset using Snorkel and am currently at the stage of improving labeling functions. One of the LF has a rather low coverage because it only labels a subclass of one of the entity spans. What would be the…
PinkBanter
  • 1,686
  • 5
  • 17
  • 38
0
votes
2 answers

ModuleNotFoundError: No module named 'snorkel.labeling'

I installed snorkel using conda and when I try to run - from snorkel.labeling import labeling_function it throws the following error - ModuleNotFoundError: No module named 'snorkel.labeling'. I tried looking up for a solution on Github, but…