Bug#280844: Another patch to fix downloaded files corruption

To: Michael Vogt <mvogt@acm.org>
Cc: 280844@bugs.debian.org
Subject: Bug#280844: Another patch to fix downloaded files corruption
From: Petr Vandrovec <vandrove@vc.cvut.cz>
Date: Sun, 23 Oct 2005 17:55:26 +0200
Message-id: <[🔎] 435BB26E.2040105@vc.cvut.cz>
Reply-to: Petr Vandrovec <vandrove@vc.cvut.cz>, 280844@bugs.debian.org
In-reply-to: <[🔎] 20051023112700.GD18240@top.ping.de>
References: <[🔎] 20051022160004.GA10379@vana.vc.cvut.cz> <[🔎] 20051023112700.GD18240@top.ping.de>

Michael Vogt wrote:

On Sat, Oct 22, 2005 at 06:00:04PM +0200, Petr Vandrovec wrote:
I have no idea what I'm doing wrong. So I wrote something what looks morereasonable than original Geller's patch to me. Am I really only one who cares
that apt is corrupting package it downloads ?  This time tetex-base_3.0.orig.tar.gz
is corrupted on redownload as its last byte is CR (0D) (it is replaced with 'H',
see previous messages in this bugreport and bug 290694).
I applied your patch to the apt I uploaded to experimental. I'm not
entirely sure about possible side-effect in the patch so I would like
to see it tested in experimental first.

Thanks. Original code seems to think that line delimiter is either LF or LF-CR.But RFC says quite clearly that it is CR-LF. So usually you have:


Content-type: application/octet-stream<CR><LF>
<CR><LF>
binarydata

which current parser parses as 'Content-type:application/octet-stream<CR><LF><CR>' and then finds empty line with <LF> only.

But if binary data start with <CR>, old parser find empty line with <LF><CR>,eating first byte of data payload. Then whole payload is shifted by one byte,and at the end first byte of following 'HTTP/1.0 ...' response is eaten, causingfailure for subsequently downloaded package as parser does not understand'TTP/1.0' header.

With my fixes it parses example above as 'Content-type:application/octet-stream<CR><LF>' followed by empty '<CR><LF>' line, and itshould never look beyond this last <LF> at transfered data.

Maybe this loop should skip all <CR> bytes while doing copy, so subsequent codecan rely on lines separated with single <LF> only, but as it seems thateverybody already handles random <CR>s scattered through headers it would bejust cleanup...

Can sombody at least tell me why this important data corrupting bug isignored for more than year?
Probably because this is a very central piece of the code and any
mistake here is fatal. Anyway, it's in experimental now and let's hope
we find enough people to test it :)


Thanks.
								Petr Vandrovec

Reply to:

References:
- Bug#280844: Another patch to fix downloaded files corruption
  - From: Petr Vandrovec <petr@vandrovec.name>
- Bug#280844: Another patch to fix downloaded files corruption
  - From: Michael Vogt <mvogt@acm.org>

Prev by Date: Bug#335344: marked as done (/usr/bin/apt-get: Unable to install libgtk2.0-dev, dependencies issue)
Next by Date: Re[2]: Разговорный английский язык c преподавателями из США znavshih
Previous by thread: Bug#280844: Another patch to fix downloaded files corruption
Next by thread: Bug#335213: apt-get source segfault with non existing source packages
Index(es):
- Date
- Thread