Questions tagged [fst]

FST (or OpenFst) is a library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs).

36 questions
1
vote
0 answers

Processing genind object with hierfstat, when NA in @pop

Does anyone know how to get the "basic.stats" (hierfstat), "wc" (hierfstat), and/or other hierfstat commands to work on a genind object that has NA in the @pop section? I can convert the genind to hierfstat, but the other commands are not happy with…
Neeraij
  • 11
  • 1
1
vote
1 answer

Finete State machine visualizer

I need an application that prints/visualizes input/output pairs during the FST runs. I mean, for each state of the fst, it needs to print out a tuple that contains input for that state and output of the state. Right now I can generate fst files that…
zwlayer
  • 1,752
  • 1
  • 18
  • 41
1
vote
2 answers

How to get input symbols from a fst model using the OpenFST python extension?

OpenFST provides a python extension. Is ist possible to read input symbols (isyms) from a compiled FST model using that API? I can't find the right property or method to do that: >>> import fst >>> f =…
Jan Holthuis
  • 149
  • 10
0
votes
0 answers

Pass string into map_dfr to filter using tidyfst

I'm reading data stored in several fst-files using tidyfst and purrr::map_dfr. The following example works as expected. library(tidyverse) library(fst) library(tidyfst) # get all fst files in current directory files_fst <-…
mgrund
  • 1,415
  • 8
  • 10
0
votes
0 answers

read.fst() crashes R : workaround needed to detect corrupted file before read.fst()

This is an issue already opened on fstpackage's github, but it seems the package author is no longer actively maintaining it. Meanwhile, I need a workaround to this crash problem, which is repeatable and occurs regularly but on a small subset of my…
Lazarus Thurston
  • 1,197
  • 15
  • 33
0
votes
0 answers

union two FSA's with different symbol lists

I have two finite state acceptors, a.fst and b.fst. They are both unweighted, simply encoding sequences of words. Their symbol (word) lists have some overlap, but are not identical. Now I want a union of these two fst's. I think some kind of symbol…
Jiaji Huang
  • 311
  • 4
  • 14
0
votes
1 answer

How to understand this formula in Direct Construction of Minimal Acyclic Subsequential Transducers?

In this paper "Direct Construction of Minimal Acyclic Subsequential Transducers" https://citeseerx.ist.psu.edu/viewdoc/download;jsessionid=690EBF51DD6D52AB9160723AAE3A9541?doi=10.1.1.24.3698&rep=rep1&type=pdf in Definition 3, what does !(u(r,a))…
Chen Li
  • 320
  • 1
  • 10
0
votes
1 answer

Saving and readings lubridate intervals to/from disk

I am having problems recovering lubridate::intervals when reading back from csv, and fst formats. Does anyone have a suggestion for how to do this? library(tidyverse) library(fst) library(lubridate) test <- tibble( start = ymd_hms("2020-01-01…
evidently
  • 43
  • 4
0
votes
2 answers

There is a set of pairs (set_pairs) and I want to create a set (set_fsts) of first elements of those pairs. How can I do this in Isabelle?

There is a set of pairs (set_pairs) and I want to create a set (set_fsts) of first elements of those pairs. I am writing it in the following way definition "set_fsts = {f . p ∈ set_piars ∧ fst p = f}" but Isabelle shows me this err msg: Extra…
S2020
  • 1
  • 1
0
votes
1 answer

Fastest way to search for a long list of patterns in a text

Given a "large" list of patterns and a "short" text, what is the best/fastest way to search/tag those patterns in the text, where we are trying to find the pattern as a substring of the text? If there are multiple matches of a pattern in a text, we…
Satarupa Guha
  • 1,267
  • 13
  • 20
0
votes
0 answers

Is there any downside/risk to using fst to compress a serialized list?

The fst package http://www.fstpackage.org/fst/ offers multithreaded compression and reading and writing for data frames. I'm running Bayesian models with brms that are large and slow. I want to save the results to disk for future re-use. Using…
Joshua
  • 686
  • 3
  • 7
0
votes
1 answer

Migrate lucene FST files from 5.1.0 to 8.9.0

I have files with FST's created with lucene 5.1.0. After upgrading to lucene 8.9.0 I get exception when I am trying to read FST from file: org.apache.lucene.index.IndexFormatTooOldException: Format version is not supported (resource…
Egor
  • 1,334
  • 8
  • 22
0
votes
1 answer

Kaldi error while running script prepare_lang.sh -- GenericRegister::GetEntry: No such file or directory

I am using Kaldi to train an acoustic model on my own dataset (following this tutorial by Eleanor Chodroff). I'm using Cygwin on Windows 10. When I run the script prepare_lang.sh that should create the files for /data/lang, it generates the…
disha
  • 1
  • 2
0
votes
1 answer

OpenFST - creating FST's from list of words

I'm reading the top example: http://www.openfst.org/twiki/bin/view/FST/FstExamples about tokenization. In the example, they create three fsts: Mars.fst, Martian.fst, and man.fst, and manually run some fst commands to merge them into one big…
granduser
  • 67
  • 7
0
votes
0 answers

Summary statistics on out-of-memory file

I have a csv file that's 120GB in size which is a set of numerical values grouped by categorical variables. eg. df<-as.data.frame(x=rbing(rep("BLO",100),rep("LR",100)), y=runif(200)) I would like to calculate some summary statistics using…
HCAI
  • 2,213
  • 8
  • 33
  • 65