Bug#789941: marked as done (Performance regression: slow sequential reads for some block devices (readpage vs readpages) - patched in 3.18+)

To: Salvatore Bonaccorso <carnil@debian.org>
Subject: Bug#789941: marked as done (Performance regression: slow sequential reads for some block devices (readpage vs readpages) - patched in 3.18+)
From: "Debian Bug Tracking System" <owner@bugs.debian.org>
Date: Sat, 08 May 2021 19:24:03 +0000
Message-id: <[🔎] handler.789941.D789941.16205017374073.ackdone@bugs.debian.org>
Reply-to: 789941@bugs.debian.org
References: <YJbk5s8yTqiuL9ba@eldamar.lan> <558BFE84.5010604@bytemark.co.uk>

Your message dated Sat, 8 May 2021 21:22:14 +0200
with message-id <YJbk5s8yTqiuL9ba@eldamar.lan>
and subject line Re: Bug#789941: Performance regression: slow sequential reads for some block devices (readpage vs readpages) - patched in 3.18+
has caused the Debian Bug report #789941,
regarding Performance regression: slow sequential reads for some block devices (readpage vs readpages) - patched in 3.18+
to be marked as done.

This means that you claim that the problem has been dealt with.
If this is not the case it is now your responsibility to reopen the
Bug report if necessary, and/or fix the problem forthwith.

(NB: If you are a system administrator and have no idea what this
message is talking about, this may indicate a serious mail system
misconfiguration somewhere. Please contact owner@bugs.debian.org
immediately.)


-- 
789941: https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=789941
Debian Bug Tracking System
Contact owner@bugs.debian.org with problems

--- Begin Message ---

To: submit@bugs.debian.org
Subject: Performance regression: slow sequential reads for some block devices (readpage vs readpages) - patched in 3.18+
From: Nick Thomas <nick@bytemark.co.uk>
Date: Thu, 25 Jun 2015 14:13:40 +0100
Message-id: <558BFE84.5010604@bytemark.co.uk>

Package: linux-image-3.16.0-4-amd64
Version: 3.16.7-ckt11-1

Initially discovered inside a QEMU guest, by doing the following:

# hdparm -t /dev/vda

Under the Wheezy 3.2 kernel:

 Timing buffered disk reads: 384 MB in  3.01 seconds = 127.50 MB/sec

Under the Jessie 3.16 kernel:

 Timing buffered disk reads:  46 MB in  3.07 seconds =  14.97 MB/sec

After some work swapping kernels, I discovered that this behaviour
exists in the 3.16 and 3.17 kernels, but not in 3.2->3.15 or 3.18+

Watching iostat as the I/O is happening indicated that the 3.16/3.17
guests were performing the I/O in 8-sector (512 bytes per sector)
chunks; in the other kernels, the request sizes were 254 sectors instead.

Some further work identified this patch in 3.18:

https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git/diff/?id=447f05bb488bff4282088259b04f47f0f9f76760

diff --git a/fs/block_dev.c b/fs/block_dev.c
index 6d72746..e2f3ad08 100644
--- a/fs/block_dev.c
+++ b/fs/block_dev.c
@@ -304,6 +304,12 @@ static int blkdev_readpage(struct file * file,
struct page * page)
 	return block_read_full_page(page, blkdev_get_block);
 }

+static int blkdev_readpages(struct file *file, struct address_space
*mapping,
+			struct list_head *pages, unsigned nr_pages)
+{
+	return mpage_readpages(mapping, pages, nr_pages, blkdev_get_block);
+}
+
 static int blkdev_write_begin(struct file *file, struct address_space
*mapping,
 			loff_t pos, unsigned len, unsigned flags,
 			struct page **pagep, void **fsdata)
@@ -1622,6 +1628,7 @@ static int blkdev_releasepage(struct page *page,
gfp_t wait)

 static const struct address_space_operations def_blk_aops = {
 	.readpage	= blkdev_readpage,
+	.readpages	= blkdev_readpages,
 	.writepage	= blkdev_writepage,
 	.write_begin	= blkdev_write_begin,
 	.write_end	= blkdev_write_end,




It applies cleanly to 3.16 and 3.17; with the patch applied, the
larger request sizes are again seen and performance returns to the
previous level.

Could we apply this to the 3.16 kernel in Jessie?

--- End Message ---

--- Begin Message ---

To: Nick Thomas <nick@bytemark.co.uk>, 789941-done@bugs.debian.org
Subject: Re: Bug#789941: Performance regression: slow sequential reads for some block devices (readpage vs readpages) - patched in 3.18+
From: Salvatore Bonaccorso <carnil@debian.org>
Date: Sat, 8 May 2021 21:22:14 +0200
Message-id: <YJbk5s8yTqiuL9ba@eldamar.lan>
In-reply-to: <558BFE84.5010604@bytemark.co.uk>
References: <558BFE84.5010604@bytemark.co.uk>

Source: linux
Source-Version: 3.18-1~exp1

On Thu, Jun 25, 2015 at 02:13:40PM +0100, Nick Thomas wrote:
> Package: linux-image-3.16.0-4-amd64
> Version: 3.16.7-ckt11-1
> 
> Initially discovered inside a QEMU guest, by doing the following:
> 
> # hdparm -t /dev/vda
> 
> Under the Wheezy 3.2 kernel:
> 
>  Timing buffered disk reads: 384 MB in  3.01 seconds = 127.50 MB/sec
> 
> Under the Jessie 3.16 kernel:
> 
>  Timing buffered disk reads:  46 MB in  3.07 seconds =  14.97 MB/sec
> 
> After some work swapping kernels, I discovered that this behaviour
> exists in the 3.16 and 3.17 kernels, but not in 3.2->3.15 or 3.18+
> 
> Watching iostat as the I/O is happening indicated that the 3.16/3.17
> guests were performing the I/O in 8-sector (512 bytes per sector)
> chunks; in the other kernels, the request sizes were 254 sectors instead.
> 
> Some further work identified this patch in 3.18:
> 
> https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git/diff/?id=447f05bb488bff4282088259b04f47f0f9f76760

This commit landed in 3.18~rc1 upstream and included at first time in
3.18-1~exp1 in Debian.

Closing the report.

Regards,
Salvatore

--- End Message ---

Reply to:

Prev by Date: Processed: Re: Bug#785419: Bug#855203: hwclock-set: Synchronize from hwclock despite systemd presence
Next by Date: Bug#789952: marked as done (linux-image-4.0.0-2-amd64: kernel 4.0.0-2 with iwlwifi 7260 reports incorrect wifi bit rate)
Previous by thread: Bug#785419: Bug#855203: hwclock-set: Synchronize from hwclock despite systemd presence
Next by thread: Bug#789952: marked as done (linux-image-4.0.0-2-amd64: kernel 4.0.0-2 with iwlwifi 7260 reports incorrect wifi bit rate)
Index(es):
- Date
- Thread