FST (or OpenFst
) is a library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs).
Questions tagged [fst]
36 questions
1
vote
0 answers
Processing genind object with hierfstat, when NA in @pop
Does anyone know how to get the "basic.stats" (hierfstat), "wc" (hierfstat), and/or other hierfstat commands to work on a genind object that has NA in the @pop section? I can convert the genind to hierfstat, but the other commands are not happy with…

Neeraij
- 11
- 1
1
vote
1 answer
Finete State machine visualizer
I need an application that prints/visualizes input/output pairs during the FST runs. I mean, for each state of the fst, it needs to print out a tuple that contains input for that state and output of the state. Right now I can generate fst files that…

zwlayer
- 1,752
- 1
- 18
- 41
1
vote
2 answers
How to get input symbols from a fst model using the OpenFST python extension?
OpenFST provides a python extension. Is ist possible to read input symbols (isyms) from a compiled FST model using that API?
I can't find the right property or method to do that:
>>> import fst
>>> f =…

Jan Holthuis
- 149
- 10
0
votes
0 answers
Pass string into map_dfr to filter using tidyfst
I'm reading data stored in several fst-files using tidyfst and purrr::map_dfr. The following example works as expected.
library(tidyverse)
library(fst)
library(tidyfst)
# get all fst files in current directory
files_fst <-…

mgrund
- 1,415
- 8
- 10
0
votes
0 answers
read.fst() crashes R : workaround needed to detect corrupted file before read.fst()
This is an issue already opened on fstpackage's github, but it seems the package author is no longer actively maintaining it.
Meanwhile, I need a workaround to this crash problem, which is repeatable and occurs regularly but on a small subset of my…

Lazarus Thurston
- 1,197
- 15
- 33
0
votes
0 answers
union two FSA's with different symbol lists
I have two finite state acceptors, a.fst and b.fst. They are both unweighted, simply encoding sequences of words. Their symbol (word) lists have some overlap, but are not identical. Now I want a union of these two fst's.
I think some kind of symbol…

Jiaji Huang
- 311
- 4
- 14
0
votes
1 answer
How to understand this formula in Direct Construction of Minimal Acyclic Subsequential Transducers?
In this paper "Direct Construction of Minimal Acyclic Subsequential Transducers" https://citeseerx.ist.psu.edu/viewdoc/download;jsessionid=690EBF51DD6D52AB9160723AAE3A9541?doi=10.1.1.24.3698&rep=rep1&type=pdf
in Definition 3, what does !(u(r,a))…

Chen Li
- 320
- 1
- 10
0
votes
1 answer
Saving and readings lubridate intervals to/from disk
I am having problems recovering lubridate::intervals when reading back from csv, and fst formats.
Does anyone have a suggestion for how to do this?
library(tidyverse)
library(fst)
library(lubridate)
test <- tibble(
start = ymd_hms("2020-01-01…

evidently
- 43
- 4
0
votes
2 answers
There is a set of pairs (set_pairs) and I want to create a set (set_fsts) of first elements of those pairs. How can I do this in Isabelle?
There is a set of pairs (set_pairs) and I want to create a set (set_fsts) of first elements of those pairs. I am writing it in the following way
definition "set_fsts = {f . p ∈ set_piars ∧ fst p = f}"
but Isabelle shows me this err msg: Extra…

S2020
- 1
- 1
0
votes
1 answer
Fastest way to search for a long list of patterns in a text
Given a "large" list of patterns and a "short" text, what is the best/fastest way to search/tag those patterns in the text, where we are trying to find the pattern as a substring of the text? If there are multiple matches of a pattern in a text, we…

Satarupa Guha
- 1,267
- 13
- 20
0
votes
0 answers
Is there any downside/risk to using fst to compress a serialized list?
The fst package http://www.fstpackage.org/fst/ offers multithreaded compression and reading and writing for data frames.
I'm running Bayesian models with brms that are large and slow. I want to save the results to disk for future re-use. Using…

Joshua
- 686
- 3
- 7
0
votes
1 answer
Migrate lucene FST files from 5.1.0 to 8.9.0
I have files with FST's created with lucene 5.1.0.
After upgrading to lucene 8.9.0 I get exception when I am trying to read FST from file:
org.apache.lucene.index.IndexFormatTooOldException: Format version is not supported (resource…

Egor
- 1,334
- 8
- 22
0
votes
1 answer
Kaldi error while running script prepare_lang.sh -- GenericRegister::GetEntry: No such file or directory
I am using Kaldi to train an acoustic model on my own dataset (following this tutorial by Eleanor Chodroff). I'm using Cygwin on Windows 10. When I run the script prepare_lang.sh that should create the files for /data/lang, it generates the…

disha
- 1
- 2
0
votes
1 answer
OpenFST - creating FST's from list of words
I'm reading the top example: http://www.openfst.org/twiki/bin/view/FST/FstExamples about tokenization.
In the example, they create three fsts: Mars.fst, Martian.fst, and man.fst, and manually run some fst commands to merge them into one big…

granduser
- 67
- 7
0
votes
0 answers
Summary statistics on out-of-memory file
I have a csv file that's 120GB in size which is a set of numerical values grouped by categorical variables.
eg.
df<-as.data.frame(x=rbing(rep("BLO",100),rep("LR",100)), y=runif(200))
I would like to calculate some summary statistics using…

HCAI
- 2,213
- 8
- 33
- 65