Questions tagged [reproducible-research]

Reproducible research is the idea that the result of scientific research should be published with data and code in order to make it possible for other researchers to verify the results.

Reproducible research is the idea that the result of scientific research should be published with data and code in order to make it possible for other researchers to verify the results.

Reproducible research may be especially important to you if your investigation involves large amount of data or very complex calculations.

One possible set of tools for reproducible research is using with or .

Related links:

227 questions
0
votes
0 answers

Where did OpenCV get their algorithm from for correcting fisheye lenses?

I've been looking into correcting images taken with a fisheye lens and was wondering where OpenCV got their algorithm from? Was it based on a research paper someone published or is it an algorithm developed internally? I have tried looking around on…
0
votes
1 answer

Questions in a Character Degradation Model Paper

I have questions in the ‘Noise region definition’ and ‘Noise generation process’ of the paper “A character degradation model for grayscale ancient document images”. In Noise region definition, g controls the flatness of the regions. What does it…
alyssaeliyah
  • 2,214
  • 6
  • 33
  • 80
0
votes
3 answers

How can I always pull my latest docker image but still deterministically record its composition for future reproducibility?

I'm doing analytical work inside a "Lab" docker environment which I manage. I use Travis to build, tag and publish the lab image to a docker container registry (AWS ECR) and then always pull latest image when I start the container to do my…
0
votes
1 answer

How can I get reproducible results in keras for a convolutional neural network using data augmentation for image classification?

If I train the same convolutional neural network model architecture (on the same data) twice, clearing the session between runs, I get different results. I've set random seeds and thread config as follows: import numpy as np from numpy.random…
0
votes
1 answer

"TypeError: 'Session' object is not callable" error running sess = tf.compat.v1.Session()(graph=tf.compat.v1.get_default_graph(), config=session_conf)

I'm trying to set seeds and configure keras settings to ensure my experiments are reproducible. When I run the following (based on code in an answer to this question): # Import libraries import numpy as np import pandas as pd import tensorflow as…
0
votes
1 answer

Reproduce same results on each run - Keras, Google Colab

I run the following code in Google Colab(with GPU): import random random.seed(1) import numpy as np from numpy.random import seed seed(1) from tensorflow import set_random_seed set_random_seed(2) import pandas as pd from keras.layers.convolutional…
0
votes
0 answers

knitr: Better Reproducibility of Knitted Code

Consider the file foo.Rmd following: ```{r foo, echo=yes} print("Hello") ``` Note that the echoing of the chunk code is not constant, but depends on the value of the variable yes set outside foo.Rmd. One can set the value of the knitting variable,…
antonio
  • 10,629
  • 13
  • 68
  • 136
0
votes
1 answer

How to resolve issue with path with here package in R?

I had the following piece of code which is used for obtaining 4 csv files from a directory called RawData and combining the rows using rbind which works fine library(data.table) setwd("C:/Users/Gunathilakel/Desktop/Vera Wrap up…
Nathan123
  • 763
  • 5
  • 18
0
votes
1 answer

lm() and t.test(var.equal = TRUE) differ on one machine but not on another. Possible reasons?

I have noticed weird behaviour of lm() more specifically that the t.values do not work out. This behaviour is only observable on my machine, unregarding the loaded packages/objects in the global environment. Running the example from…
AaronP
  • 185
  • 10
0
votes
1 answer

How to provide singularity images where users can add a custom set of software from a catalogue provided by us

We want to improve the reproducibility of the analyses at our institute. To this effect, we contemplate on implementing a system based on Singularity. The idea is that at the beginning of the analysis, the user can choose a machine configuration…
0
votes
0 answers

Why is the working directory overwritten to the directory of the current Rmd file?

I have an R.proj file called Food_Choices.Rproj that is supposed to be setting my working directory to ~/Desktop/Food_Choices, a folder containing reproducibility files according to the TIER system like But it's not setting the working directory…
0
votes
1 answer

I am going to do a sentiment analysis research on facebook comments.any idea how can I coolect facebook comments for my research?

I am doing sentimental analysis research for my final year research for facebook comments. But I don't know how can I collect facebook comment's data.
0
votes
1 answer

Simple TensorFlow computation not reproducible on different systems (macOS, Colab, Azure)

I am investigating the reproducibility of code in TensorFlow on my macOS machine, on Google Colab, and on Azure with Docker. I understand that I can set a graph-level seed and an operation-level seed. I am using eager mode (so no parallelism…
miguelmorin
  • 5,025
  • 4
  • 29
  • 64
0
votes
2 answers

Not being able to reproduce the Same results over multiple runs for an LSTM model in tensorflow

I was training an LSTM network in tensorflow. My model has the following configuration: time_steps = 1700 Cell size: 120 Number of input features x = 512. Batch size: 34 Optimizer: AdamOptimizer with learning rate = 0.01 Number of epochs = 20 I…
I. A
  • 2,252
  • 26
  • 65
0
votes
1 answer

Can H2O deeplearning models in R be reproducible while remaining multithreaded?

I've been working on validating models developed using h2o. Specificially I've been testing a neural net implemented using h2o.deeplearning. I've been attempting to generate consistent results by setting a seed in the H2O function, but even doing…