Remove duplicate based on condition awk/bash -

- June 15, 2015

i remove duplicates dataset has 3 columns

a       0   3238 b       0   3367 c       0   3130 d       1   3130

i need remove lines contain duplicate values in third column, preferentially keeping value '1' in second column. know how remove duplicates using awk, can't work out how add in conditional statment.

thanks

give line try:

awk '{if($3 in a)a[$3]=$2==1?$0:a[$3];else a[$3]=$0}end{for(i in a)print a[i]}' file

Search This Blog

LAVA

Remove duplicate based on condition awk/bash -

Comments

Post a Comment

Popular posts from this blog

c++ - Linked List error when inserting for the last time -

java - activate/deactivate sonar maven plugin by profile? -

tsql - Pivot with Temp Table (definition for column must include data type) -- SQL Server 2008 -