Questions tagged [awk]

AWK is an interpreted programming language designed for text processing and typically used as a data extraction and reporting tool. AWK is used largely with Unix systems.

AWK is an interpreted programming language (AWK stands for Aho, Weinberger, and Kernighan) designed for text processing and typically used as data extraction and reporting tool. It is a standard feature of most Unix-like operating systems.

Source: Wikipedia.

An awk program is a series of pattern-action pairs, written as:

condition { action }
condition { action }
...

where condition is typically an expression and action a series of one or more commands, separated by a semi-colon ; character. The input is split into records, and each record is split into fields (by default, records are separated by the newline character and fields by horizontal whitespace.) Per record, each condition is checked and, if true, the commands in the action block are executed. Within the action block, fields are accessed by a 1-based index – e.g. $2 for the second field. If the condition is missing, the action block will always be executed. If the condition is present but the action block is absent, the default action is print $0 which is to print the current line after any transformations. Since a non-zero number is equivalent to true, then awk '1' file instructs awk to perform the default action (print) for every line.

Awk can have an optional BEGIN and optional END, where the BEGIN action is invoked before reading any input, and END action is invoked after all input is read:

BEGIN     { action } 
condition { action }
condition { action }
...
END       { action }

Awk was originally developed by Alfred Aho, Brian Kernighan and Peter Weinberger in 1977 and updated in 1985. Since then, various versions and dialects of awk have emerged. The most common are :

awk - the most common and will be found on most Unix-like systems. It also has a well defined IEEE standard.
mawk - a fast AWK implementation which it's code base is based on a byte-code interpreter.
nawk - during the development of AWK, the developers released a new version (new awk) to avoid confusion but it is itself now very old and lacking functionality present in all POSIX awks.
gawk - Also known as GNU awk. The only version in which the developers attempted to add i18n support. Allowed users to write their own C shared libraries to extend it with their own "plug-ins". This version is the standard implementation for Linux.

When asking questions about data processing using awk, please include complete input and desired output.

Some frequently occurring themes:

Books:

The AWK Programming Language by Aho, Kernighan & Weinberger (archive.org link)
Effective AWK, 4th edition by Robbins (see The GNU AWK Users Guide below for latest online version)
Effective AWK, 3rd edition by Robbins
Sed & Awk, 2nd edition by Dougherty & Robbins
Sed & Awk Pocket Reference, 2nd Edition by Arnold Robbins
AWK Language Programming - free book
Awk One-Liners Explained
GNU AWK one-liners by Sundeep Agarwal (includes a chapter on regular expressions)

Resources:

Awk.Info (archive.org link)
The GNU Awk User's Guide
POSIX specification of awk
Idiomatic awk
The awk programming language tutorial site
Awk one-liners
Awk one-liners explained

Other StackExchange Resources:

Related tags:

gawk (GNU's version of awk)
nawk (A very old, pre-POSIX version also from AT&T)
mawk (A different interpreter written by Mike Brennan)
sed (A kindred tool often mentioned in the same breath)

32722 questions

votes

2 answers

Multiline pattern matching in bash

I have a long file of the type Processin SCRIPT10 file.. Submitted batch job 1715572 Processin SCRIPT100 file.. Processin SCRIPT1000 file.. Submitted batch job 1715574 Processin SCRIPT10000 file.. Processin SCRIPT10001 file.. Processin SCRIPT10002…

bash awk

asked May 24 '17 at 09:59

VojtaK

votes

1 answer

How to escape a percent sign in AWK printf?

I'm making an awk statement that will allow me to print a number of unicode nop's to the screen (in testing, 18 of them). It currently looks like the following: awk 'BEGIN {while (c++<18) printf "%u9090"}' When this executes this returns a run time…

awk printf

asked Apr 13 '17 at 07:04

Michael A

9,480
22
70
114

votes

2 answers

Remove \r\n in awk

I have a simple awk command that converts a date from MM/DD/YYYY to YYYY/MM/DD. However, the file I'm using has \r\n at the end of the lines, and sometimes the date is at the end of the line. awk ' BEGIN { FS = OFS = "|" } { split($27, date,…

linux awk

asked Apr 09 '17 at 17:34

richie

votes

10 answers

How to add html attributes and values for all lines quickly with vim and plugins?

My os:debian8. uname -a Linux debian 3.16.0-4-amd64 #1 SMP Debian 3.16.39-1+deb8u2 (2017-03-07) x86_64 GNU/Linux Here is my base file. home help variables compatibility modelines searching selection markers indenting reformatting folding…

html awk sed sublimetext3 emmet

asked Mar 26 '17 at 02:37

showkey

votes

2 answers

How to compare two csv files in UNIX and create delta ( modified/ new records )

I have two csv files old.csv and new.csv. I need only new or updated records from new.csv file. Delete records from new.csv if that is exists in old.csv.…

unix awk

asked Mar 22 '17 at 16:26

user6742120

votes

2 answers

How to pass BASH shell variables into AWK statement

I want to pass 2 shell (bash) variables from into an awk statement. In this example the variables are marker1 and marker2 ubuntu@ubuntutest:/tmp$ echo $marker1 ###___showhost___### ubuntu@ubuntutest:/tmp$ echo…

bash awk

asked Mar 15 '17 at 15:39

vtecdec

votes

1 answer

grep -vf too slow with large files

I am trying filter data from data.txt using patterns stored in a file filter.txt. Like below, grep -v -f filter.txt data.txt > op.txt This grep takes more than 10-15 minutes for 30-40K lines in filter.txt and ~300K lines in data.txt. Is there any…

bash performance shell awk grep

asked Mar 09 '17 at 18:04

user3150037

votes

2 answers

using variables in search pattern in awk script

#!/usr/local/bin/gawk -f ` { awkvar2="/id=22/"; awkvar3="/end/"; if ($0 ~ awkvar2) { triggered=1; } if (triggered) { print; if ($0 ~ awkvar3) { triggered=0; print…

shell awk

asked Nov 24 '10 at 12:30

Omkar

votes

3 answers

Subtracting lines in one file from another file

I couldn't find an answer that truly subtracts one file from another. My goal is to remove lines in one file that occur in another file. Multiple occurences should be respected, which means for exammple if one line occurs 4 times in file A and only…

unix awk sed

asked Mar 06 '17 at 10:53

Hawk

votes

4 answers

Add prefix in bash command output

I would like to add a prefix in each new line of my command output. I would like to do this because I will run multiple commands in parralel whiches will log in same output log. I tried to do this with AWK without success runcommand1 | "[prefix1]" +…

bash logging awk

asked Feb 27 '17 at 09:49

Lombric

votes

2 answers

Sum durations in bash

I am getting execution time of various processes in a file from their respective log files. The file with execution time looks similar to following (it may have hundreds of entries) 1:00:01.11 2:2.20 1.02 The first line is hours:minutes:seconds,…

bash perl awk

asked Feb 20 '17 at 20:58

zatka

votes

5 answers

How to increment a column value with an increasing number in a csv file

I have a text file with 3 columns as below. $ cat test.txt 1,A,300 1,B,300 1,C,300 Till now i have tried as, awk -F, '{$3=$3+1;print}' OFS=, test.txt But output is coming as: 1,A,301 1,B,301 1,C,301 & below is my desired output Now i want to…

awk sed

asked Feb 02 '17 at 05:36

swapneil

votes

4 answers

Extract email addresses from log with grep or sed

Jan 23 00:46:24 portal postfix/smtp[31481]: 1B1653FEA1: to=, relay=mta5.am0.yahoodns.net[98.138.112.35]:25, delay=5.4, delays=0.02/3.2/0.97/1.1, dsn=5.0.0, status=bounced (host mta5.am0.yahoodns.net[98.138.112.35] said: 554…

regex awk sed grep cut

asked Jan 26 '17 at 11:38

sherpaurgen

3,028
6
32
45

votes

6 answers

UNIX(AIX) script to process a file using only awk or other file processing utilities

I have a task to write a script that will filter an input from an MQ runmqsc command and redirect the output into another file. I have been working around using many other Linux commands piped together and it seems to work just fine in Linux, but my…

linux bash awk ibm-mq aix

asked Jan 23 '17 at 06:35

Cristian Baciu

votes

4 answers

Remove lines with specific pattern with Bash

I have a file with one word/character for each line. Example: a abandonado esta estabelecimento o onibus c casa police I need remove lines with specific pattern (ex. pattern "esta"). I tried with awk cat file | awk '!/^esta/' but this solution…

linux bash awk sed

asked Dec 30 '16 at 17:47

vivas

Prev 1 2 3

…

99 100 Next