Questions tagged [checkpoint]

Check Point Software Technologies is a widely deployed brand of firewalls and other security and networking products.

Check Point Software Technologies is a widely deployed brand of firewalls and other security and networking products.

304 questions
3
votes
0 answers

Iterative spark with caching runs out of memory

I have a tiny pyspark dataframe with relations and a function that calculates the transitive closure. I know already a couple of ways in which I can improve the function (including getting rid of the groupBy), but let's stick with this. When I…
RickyG
  • 138
  • 6
3
votes
1 answer

How to set .libPaths (checkpoint) on workers when running parallel computation in R

I use the checkpoint package for reproducible data analysis. Some of the computations take a long time to compute, so I want to run those in parallel. When run in parallel however the checkpoint is not set on the workers, so I get an error message…
needRhelp
  • 2,948
  • 2
  • 24
  • 48
3
votes
3 answers

keras dosen't load the model and weights when using checkpoint

I'm using keras to build a deep autoencoder. I used its checkpointer to load the model and the weights but the result is always None which I think it means that the checkpoint dosen't work correctly and is not saving weights. Here is the code how I…
Mari
  • 69
  • 1
  • 8
3
votes
1 answer

Docker container migration

I'm trying to use the experimental checkpoint feature to implement container migration between machines. I've found many examples of checkpointing and restoring on the same machine but I've only found this documentation about migrating checkpoints…
3
votes
1 answer

tensorflow cifar10 resume training from checkpoint file

While using Tensorflow, I am trying to resume CIFAR10 training using a checkpointed file. Referencing some other articles, I tried tf.train.Saver().restore with no success. Can someone shed me light on how to proceed? Code snippet from Tensorflow…
emerson
  • 153
  • 1
  • 8
3
votes
0 answers

R: set 'Checkpoint' on Worker of Cluster

I use the following code to ... 1. create a parallel cluster 2. source test.R 3. and do some parallel work with functions defined in 'test.R' library(parallel) cl <- makeCluster(4) clusterEvalQ(cl, source("test.R")) ## do some parallel…
Bernd
  • 3,405
  • 3
  • 18
  • 21
3
votes
1 answer

Is it possible to use RRO without checkpoint package?

Revolution R Open (RRO) is distributed with checkpoint package to improve the reproducibility of package references. However, some users might not care about reproducibility but really cares about the performance enhancement RRO provides. Is it…
Kun Ren
  • 4,715
  • 3
  • 35
  • 50
3
votes
1 answer

python, multiprocessing and dmtcp: checkpointing one process in Pool?

Is it possible to use python's integration of dmtcp to checkpoint a child process in parallel execution? My situation is as follows: I have a multiprocessing.Pool with several workers receiving async jobs (using apply_async). Certain big jobs…
3
votes
5 answers

Is it possible to interrupt a process and checkpoint it to resume it later on?

Lets say, you have an application, which is consuming up all the computational power. Now you want to do some other necessary work. Is there any way on Linux, to interrupt that application and checkpoint its state, so that later on it could be…
pythonic
  • 20,589
  • 43
  • 136
  • 219
2
votes
0 answers

Spark Streaming: Checkpoint corrupted

Today 4 streaming jobs started to fail due to: StreamingQueryException: [STREAM_FAILED] Query [id = ####, runId = ####] terminated with exception: dbfs:/mnt/path/my_table/sources/0/0 doesn't exist (latestId: 8, compactInterval: 10). These…
2
votes
2 answers

C# BackgroundWorker Cancellation checkpoint shortening

Is there a way to shorten my BackgroundWorker.CancellationPending checkpoint? For example, is there a way to encapsulate return like the example code below?: //REAL CODE (CURRENTLY USE THIS) if (this.TW.CancellationPending) return; //PSEUDO…
PiZzL3
  • 2,092
  • 4
  • 22
  • 30
2
votes
0 answers

Tensorflow Loading pretrained BERT checkpoint on Mac

I am currently trying to import a pretrained BERT model for Text classification. After days of searching the internet I've not been able to find a fix. Using Python veriosn 3.8.2 and tensorflow version 2.4.1 The pretrained model can be found here:…
2
votes
2 answers

Path for Flink state.checkpoints.dir in docker-compose in Windows 10 environment

I have Windows 10 OS, docker-compose and want to work with Apache Flink tutorial Playground, docker-compose starting correctly starting docker-compose but after several minutes of work, Apache Flink has to create checkpoints, but there is some…
Vadim
  • 753
  • 8
  • 22
2
votes
1 answer

Tensorfllow: load checkpoint from changed model

For some reason I want to test the difference in the performance of a detector and his identical version but finetuned with some 3d convolutions. The model of the detector is google EfficientDet, the weights are finetuned on custom data. I was…
Nodiz
  • 333
  • 2
  • 8
2
votes
1 answer

Trying to go back to a "Checkpoint" in ZSH script

Ok so, I'm making a program in ZSH and I've made all the client-sided scripts that are run with commands I'm having two problems where I need to set "Checkpoints" just like in videogame. First Problem: A="LUNGOTEVEREMARZIO-UMBERTO…
RCdS
  • 23
  • 3
1 2
3
20 21