diff options
author | Eric Wong <e@yhbt.net> | 2020-05-10 22:37:12 +0000 |
---|---|---|
committer | Eric Wong <e@yhbt.net> | 2020-05-12 06:15:59 +0000 |
commit | 098fecd1fe516a00fbfd49622b82be382ebcdab6 (patch) | |
tree | 0b38bb0910e6f9b8c3600ba6ef105aac43bafed6 /Documentation/public-inbox-v2-format.pod | |
parent | cd8cda10c9687533949a8a358fd7b858f704da6e (diff) | |
download | public-inbox-098fecd1fe516a00fbfd49622b82be382ebcdab6.tar.gz |
The old name may be confused with "Content-ID" as described in RFC 2392, so use an alternate name to avoid confusing future readers.
Diffstat (limited to 'Documentation/public-inbox-v2-format.pod')
-rw-r--r-- | Documentation/public-inbox-v2-format.pod | 12 |
1 files changed, 6 insertions, 6 deletions
diff --git a/Documentation/public-inbox-v2-format.pod b/Documentation/public-inbox-v2-format.pod index d87a717d..9e284a75 100644 --- a/Documentation/public-inbox-v2-format.pod +++ b/Documentation/public-inbox-v2-format.pod @@ -159,7 +159,7 @@ top-level of the directory. =head1 OBJECT IDENTIFIERS -There are three distinct type of identifiers. content_id is the +There are three distinct type of identifiers. content_hash is the new one for v2 and should make message removal and deduplication easier. object_id and Message-ID are already known. @@ -179,11 +179,11 @@ The email header; duplicates allowed for archival purposes. This remains a searchable field in Xapian. Note: it's possible for emails to have multiple Message-ID headers (and L<git-send-email(1)> had that bug for a bit); so we take all of them into account. -In case of conflicts detected by content_id below, we generate a new -Message-ID based on content_id; if the generated Message-ID still +In case of conflicts detected by content_hash below, we generate a new +Message-ID based on content_hash; if the generated Message-ID still conflicts, a random one is generated. -=item content_id +=item content_hash A hash of relevant headers and raw body content for purging of unwanted content. This is not stored anywhere, @@ -193,7 +193,7 @@ For now, the relevant headers are: Subject, From, Date, References, In-Reply-To, To, Cc -Received, List-Id, and similar headers are NOT part of content_id as +Received, List-Id, and similar headers are NOT part of content_hash as they differ across lists and we will want removal to be able to cross lists. @@ -203,7 +203,7 @@ raw body risks being broken by list signatures; but we can use filters (e.g. PublicInbox::Filter::Vger) to clean the body for imports. -content_id is SHA-256 for now; but can be changed at any time +content_hash is SHA-256 for now; but can be changed at any time without making DB changes. =back |