[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: [netfilter-core] [sparc64] possible circular locking / deadlock



Hi Jozsef,

On Mon, Jun 17, 2019 at 05:02:51PM +0200, Jozsef Kadlecsik wrote:
> Hi,
> 
> On Mon, 17 Jun 2019, Anatoly Pugachev wrote:
> 
> > Getting the following git kernel trace on boot with rc.local having :
> > 
> > ipset create sshguard4 hash:net
> > iptables -A INPUT -p tcp --dport 22 -m set --match-set sshguard4 src -j DROP
> 
> In spite of "iptables", it must be the nftables compat backend.
>  
> > current git kernel:
> > 
> > $ uname -a
> > Linux ttip 5.2.0-rc5 #981 SMP Mon Jun 17 09:52:04 MSK 2019 sparc64 GNU/Linux
> > linux-2.6$ git desc
> > v5.2-rc5
> > 
> > 
> > $ dmesg
> > <cut>
> > [   10.356388] Adding 787176k swap on /dev/vdiska4.  Priority:-2
> > extents:1 across:787176k FS
> > [   10.471900] EXT4-fs (vdiska1): mounting ext3 file system using the
> > ext4 subsystem
> > [   10.487226] EXT4-fs (vdiska1): mounted filesystem with ordered data
> > mode. Opts: (null)
> > [   11.158102] random: crng init done
> > [   11.158155] random: 7 urandom warning(s) missed due to ratelimiting
> > 
> > [   11.697866] ======================================================
> > [   11.697875] WARNING: possible circular locking dependency detected
> > [   11.697886] 5.2.0-rc5 #981 Not tainted
> > [   11.697894] ------------------------------------------------------
> > [   11.697902] iptables/732 is trying to acquire lock:
> > [   11.697913] 000000004f61aa56 (&table[i].mutex){+.+.}, at:
> > nfnl_lock+0x24/0x40 [nfnetlink]
> > [   11.697937]
> >                but task is already holding lock:
> > [   11.697946] 000000000d652829 (&net->nft.commit_mutex){+.+.}, at:
> > nf_tables_valid_genid+0x18/0x60 [nf_tables]
> > [   11.697973]
> >                which lock already depends on the new lock.
> > 
> > [   11.697983]
> >                the existing dependency chain (in reverse order) is:
> > [   11.697992]
> >                -> #1 (&net->nft.commit_mutex){+.+.}:
> > [   11.698012]        __mutex_lock+0x48/0x920
> > [   11.698021]        mutex_lock_nested+0x1c/0x40
> > [   11.698033]        nf_tables_valid_genid+0x18/0x60 [nf_tables]
> > [   11.698043]        nfnetlink_rcv_batch+0x24c/0x620 [nfnetlink]
> > [   11.698053]        nfnetlink_rcv+0x110/0x140 [nfnetlink]
> > [   11.698067]        netlink_unicast+0x12c/0x1e0
> > [   11.698076]        netlink_sendmsg+0x324/0x360
> > [   11.698091]        sock_sendmsg+0x34/0x80
> > [   11.698099]        ___sys_sendmsg+0x228/0x240
> > [   11.698108]        __sys_sendmsg+0x4c/0x80
> > [   11.698116]        sys_sendmsg+0x18/0x40
> > [   11.698131]        linux_sparc_syscall+0x34/0x44
> > [   11.698138]
> >                -> #0 (&table[i].mutex){+.+.}:
> > [   11.698157]        lock_acquire+0x1a4/0x1c0
> > [   11.698165]        __mutex_lock+0x48/0x920
> > [   11.698173]        mutex_lock_nested+0x1c/0x40
> > [   11.698181]        nfnl_lock+0x24/0x40 [nfnetlink]
> > [   11.698196]        ip_set_nfnl_get_byindex+0x19c/0x280 [ip_set]
> > [   11.698207]        set_match_v1_checkentry+0x14/0xc0 [xt_set]
> 
> set_match_v1_checkentry() from ipset always assumed that it's called via 
> the old xtables/setsockopt interface. Thus it calls 
> ip_set_nfnl_get_byindex() which is then calls 
> nfnl_lock(NFNL_SUBSYS_IPSET). Here comes the circular dependency.
> 
> I suppose the only solution is to check wether the mutex is already held 
> or not. Until I send the patch, the only way to avoid the issue is to use 
> the old legacy xtables interface.

There's par->nft_compat in xt_tgchk_param that allows you to know if
you are in the context of the xt over nft infrastructure.


Reply to: