Re: delete lines that contain duplicated column items

To: debian-user@lists.debian.org
Subject: Re: delete lines that contain duplicated column items
From: Ken Irving <fnkci@uaf.edu>
Date: Tue, 3 Apr 2007 19:34:19 -0800
Message-id: <[🔎] 20070404033419.GA13166@localhost>
Mail-followup-to: debian-user@lists.debian.org
In-reply-to: <[🔎] 1bf39a2f0704030519j593704c2j3a1b20bdddb6af36@mail.gmail.com>
References: <[🔎] 1bf39a2f0704030519j593704c2j3a1b20bdddb6af36@mail.gmail.com>

On Tue, Apr 03, 2007 at 08:19:10PM +0800, Jeff Zhang wrote:
> I have a simple txt file, like:
> ...
> a a
> aa a
> b b
> ba b
> ...
> 
> I want to just keep the lines that first appeared in column 2 and delete the follow lines that
> contain duplicated ones in column 2.
> then it will like:
> ...
> a a
> b b
> ...
> 
> I've tried `uniq -f1`  but it didn't work.

awk's associative arrays (or perl's hashes) are good for this
sort of thing, e.g.,

  $ awk '!seen[$2]{print; seen[$2]=1}' < file

awk scans input line by line, checks for match condition(s), and 
performs the associated actions.  Here, if the 2nd column value hasn't
been seen, print it and record it as seen. 

-- 
Ken Irving

Reply to:

Follow-Ups:
- Re: delete lines that contain duplicated column items
  - From: "Jeff Zhang" <idealbsd@gmail.com>

References:
- delete lines that contain duplicated column items
  - From: "Jeff Zhang" <idealbsd@gmail.com>

Prev by Date: Re: I have a Debian question, I think.
Next by Date: Re: [OT] How much open is OpenSolaris?
Previous by thread: Re: delete lines that contain duplicated column items
Next by thread: Re: delete lines that contain duplicated column items
Index(es):
- Date
- Thread