git@vger.kernel.org list mirror (unofficial, one of many)
 help / color / mirror / code / Atom feed
From: <rsbecker@nexbridge.com>
To: "'Addison Klinke'" <addison@baller.tv>,
	"'Jason Pyeron'" <jpyeron@pdinc.us>
Cc: "'Junio C Hamano'" <gitster@pobox.com>, <git@vger.kernel.org>,
	"'Addison Klinke'" <agk38@case.edu>
Subject: RE: [FR] supporting submodules with alternate version control systems (new contributor)
Date: Tue, 10 May 2022 14:26:33 -0400	[thread overview]
Message-ID: <03ca01d8649b$7d6a3310$783e9930$@nexbridge.com> (raw)
In-Reply-To: <CAE9CXujPzu3_95pBDVRXKFU_z40j9Y7v5_1y3c+WnFpz1_oY4w@mail.gmail.com>

On May 10, 2022 1:27 PM, Addison Klinke wrote:
>Thanks for the quick replies
>
>> Junio Hamano: When you bind an external repository (be it stored in
>> Git or
>somebody else's system) as a submodule, each commit in the superproject
>records which exact commit in the submodule is used with the rest of the
>superproject tree.
>
>This should be fine then - at least the data versioning tools I'm familiar with can all
>specify their current commit and checkout by commit hash. Does it matter how
>the hashes are structured/stored internally? For example, I believe Dolt keeps
>them in a MySQL table that connects to Noms under the hood.
>
> > Junio Hamano: not judging "if it is realistic" at this point
>
>What would be the best approach for answering this portion?

Basically, answer the following: Can you implement a command like the cvs2git that can be re-executed on an idempotent (repeatedly with the same result) basis?

If yes, then you can build your own automation to move code into a submodule from your own VCS system into a git repository and the work with the submodule without the git code-base knowing about this.

If you can go the other way, from git to your other VCS system, repeatedly, then you can go back again. This is likely to be much harder as git has a much richer representation model than is typical of VCS systems.

One way may be sufficient for your purposes. Research how cvs2git works and see whether you are able to emulate its functions.

>> Jason Pyeron: The submodule "type" could create an object (hashed and
>> stored) that contains the needed "translation" details
>
>That sounds like an interesting idea. Since I'd like to offload the burden of
>maintaining these translation files to the 3rd party developers, it would be nice if
>they got copied to a standard location (i.e. ~/.gitmodules/translations/tool_x)
>during the 3rd party install.
>Then when a submodule is added with "type = tool_x", git checks that the
>appropriate translation file is available, and if so, copies it into the parent
>repository.
>
>On Tue, May 10, 2022 at 11:20 AM Jason Pyeron <jpyeron@pdinc.us> wrote:
>>
>> > -----Original Message-----
>> > From: Junio C Hamano
>> > Sent: Tuesday, May 10, 2022 1:01 PM
>> > To: Addison Klinke <addison@baller.tv>
>> >
>> > Addison Klinke <addison@baller.tv> writes:
>> >
>> > > Is something along these lines feasible?
>> >
>> > Offhand, I only think of one thing that could make it fundamentally
>> > infeasible.
>> >
>> > When you bind an external repository (be it stored in Git or
>> > somebody else's system) as a submodule, each commit in the
>> > superproject records which exact commit in the submodule is used
>> > with the rest of the superproject tree.  And that is done by
>> > recording the object name of the commit in the submodule.
>> >
>> > What it means for the foreign system that wants to "plug into" a
>> > superproject in Git as a submodule?  It is required to do two
>> > things:
>> >
>> >  * At the time "git commit" is run at the superproject level, the
>> >    foreign system has to be able to say "the version I have to be
>> >    used in the context of this superproject commit is X", with X
>> >    that somehow can be stored in the superproject's tree object
>> >    (which is sized 20-byte for SHA-1 repositories; in SHA-256
>> >    repositories, it is a bit wider).
>> >
>> >  * At the time "git chekcout" is run at the superproject level, the
>> >    superproject will learn the above X (i.e. the version of the
>> >    submodule that goes with the version of the superproject being
>> >    checked out).  The foreign system has to be able to perform a
>> >    "checkout" given that X.
>> >
>> > If a foreign system cannot do the above two, then it fundamentally
>> > would be incapable of participating in such a "superproject and
>> > submodule" relationship.
>>
>> The submodule "type" could create an object (hashed and stored) that contains
>the needed "translation" details. The object would be hashed using SHA1 or
>SHA256 depending on the git config. The format of the object's contents would be
>defined by the submodule's "code".

I would not try to do this inside the git infrastructure. What you may be able to do in my suggestion above, is to restrict how your other VCS system is used and restrict how your team uses git to make the mapping repeatable. This is typical of some environments where there is an SVN repo and a git repo that are mirrored. This does simplify matters particularly if you do not have to modify either system but are building a façade or wrapper around both.

Keep this as simple as possible to meet a minimum viable set of requirements.
--Randal 


  reply	other threads:[~2022-05-10 18:27 UTC|newest]

Thread overview: 13+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2022-05-10 16:11 Addison Klinke
2022-05-10 17:00 ` Junio C Hamano
2022-05-10 17:20   ` Jason Pyeron
2022-05-10 17:26     ` Addison Klinke
2022-05-10 18:26       ` rsbecker [this message]
2022-05-10 20:54     ` Philip Oakley
2022-06-01 12:44       ` Addison Klinke
2022-06-03 23:06         ` Philip Oakley
2022-06-04  2:01           ` rsbecker
2022-06-04 13:27             ` Philip Oakley
2022-06-04 15:57               ` rsbecker
2022-06-05 21:52                 ` Philip Oakley
2022-06-06 14:53                   ` Addison Klinke

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

  List information: http://vger.kernel.org/majordomo-info.html

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to='03ca01d8649b$7d6a3310$783e9930$@nexbridge.com' \
    --to=rsbecker@nexbridge.com \
    --cc=addison@baller.tv \
    --cc=agk38@case.edu \
    --cc=git@vger.kernel.org \
    --cc=gitster@pobox.com \
    --cc=jpyeron@pdinc.us \
    --subject='RE: [FR] supporting submodules with alternate version control systems (new contributor)' \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

Code repositories for project(s) associated with this inbox:

	https://80x24.org/mirrors/git.git

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).