Questions tagged [git-annex]

git-annex allows managing files with Git, without checking the file contents into Git. While that may seem paradoxical, it is useful when dealing with files larger than Git can currently easily handle, whether due to limitations in memory, time, or disk space.

Git-annex is an alternative to ; both systems work by replacing the files that are to be managed by, but not stored in, Git with "link" files. Git then manages the link file instead of the actual data. The actual data for each commit must be stored independently somewhere. The interesting choices here are the "somewhere", how and when the versioned-outside-Git files are synchronized, and how the system itself ( or ) comes up with names for each versioned file.

The host site for git-annex is https://git-annex.branchable.com/

62 questions
0
votes
1 answer

Cause of non-deterministic `git fsck` errors on MacOS 11.2.3, even after deleting and re-cloning?

I have several git and git-annex repositories on my personal laptop, which runs MacOS 11.2.3 with git version 2.24.3 (Apple Git-128) and git-annex 8.20210310. After upgrading my laptop to 11.2.x (Big Sur), I began noticing a much higher level of…
0
votes
0 answers

How can I represent a class of existing python api in Django?

I'm trying to integrate an existing python API, (it's not a web API, it's a python library called Datalad) into a Django application. Going deeper, I'd like do create a django model that represents a dataset class, that is the representation of a…
Befree
  • 1
  • 2
0
votes
1 answer

How to permanently delete a commit in git (annex)?

I've started using datalad, a wrapper for git annex, to version control data and expirements in my lab. It works great except the .git folder can silently grow enormous, especially when going back and forth in git history to repeat certain steps.…
Tim
  • 2,123
  • 4
  • 27
  • 44
0
votes
1 answer

Using datalad with Google Cloud Storage

I am new to Datalad. I am trying to achieve version history and commit details for every person who is doing any changes to my Datalad dataset. For far, I am able to create a sibling of my local dataset to a cloud storage bucket and able to export…
0
votes
0 answers

Relative Symlinks on a NFS Share

I've set up a NFS share on my Linux server, with this line in /etc/exports: /vol/int0 192.168.0.0/24(rw,sync,no_subtree_check) However, this share primarily contains git repositories, or more specifically, git-annexes. That is, the repositories…
Xaldew
  • 560
  • 5
  • 18
0
votes
1 answer

is there a way to download only part of a datalad dataset?

For example: datalad install -g ///adhd200/RawData/Brown downloads the entire dataset. But lets say I only want: RawData / Brown / 0026001 / session_1 / anat_1 Is there a way to achieve this?
bdfy
  • 35
  • 5
0
votes
2 answers

Alternate hashing function based on file extension

I'd like to use git to track Media files as long as associated playlists. Tracking playlists is easy, cause these are text files. About the binary files, I've already taken a look at git-lfs and git-annex, but would want to explore the following…
kalou.net
  • 446
  • 1
  • 4
  • 16
0
votes
1 answer

git annex: 'put' functionality as opposed to 'get'

The git annex documentation gives an example of using get: # git annex sync laptop # git annex get . get my_cool_big_file (from laptop...) ok get iso/debian.iso (from laptop...) ok https://git-annex.branchable.com/walkthrough/#index5h2 Is there a…
Rob Stewart
  • 1,812
  • 1
  • 12
  • 25
0
votes
1 answer

Version control for binary files that doesn't store the files

OK, I have a large folder with millions of binary files. It is possible that these files are altered by a remote process, and I need to know when that happens, BUT… I don't want to store a second copy of these files (inside a repo) and I cannot…
jaydisc
  • 369
  • 2
  • 11
0
votes
1 answer

How to delete one of several git-annex replicas?

Say I have several (normal) git-annex replicas. Now, for some reason I want to give one of the machines or hard drives away, so I want to reduce the number of replicas by one, i.e. delete one replica. The first thing I can do is to ensure by git…
Peter Slotko
  • 327
  • 2
  • 10
0
votes
0 answers

How to undo `git annex initremote`

I used git annex initremote myrsync type=rsync rsyncurl=ssh://... encryption=none to initialise a remote. However, when attempting a git annex copy, I get errors like: rsync failed -- run git annex again to resume file transfer failed copy ...…
gmatht
  • 835
  • 6
  • 14
0
votes
1 answer

git annex get unable to establish SSL connection with dropbox

I was try to build and compile a project called RoboComp on github When I try to download some package using command git annex get ., it failed for it can't establish SSL connection with https://dropboxusercontent.com/s/..., I find it confusing. I…
Xingyu Lin
  • 36
  • 3
0
votes
1 answer

Git annex test add file, sync

I have created a git annex repo, added data. I then went to check it out in another location in the following way (my goal is to checkout origin, add a test file, push it back to origin). git clone ../test_repo/ cd test_repo/ git status git…
Tad
  • 838
  • 2
  • 11
  • 22
0
votes
1 answer

Git-Annex hangs

I am experimenting with git-annex and I think I must be doing something wrong. I have git-annex installed on the machine. I then did the following $ mkdir annex $ cd annex/ $ git init Initialized empty Git repository in /annex/.git/ $ git…
Tad
  • 838
  • 2
  • 11
  • 22
-1
votes
1 answer

Cannot run java in my git-annex'ed project

I have code of a research project managed by datalad (which is a frontend for git and git-annex). It contains my code together with a Singularity container for reproducibility. I installed java manually into this directory. I could run java -version…
akraf
  • 2,965
  • 20
  • 44