4.1.1 Introduction to Processing

Course subject(s) Module 4. Processing

Processing Banner

The third part of the processing pipeline involves processing the data we have selected in the previous stage. Under this topic fall tools that sort the data or process sorted files to extract more complex results, two mini-languages, sed and awk, that allow us to process specific records, as well as tools specific for particular tasks.

In the following lessons we will learn how to sort data by various criteria, find differences between files, find common elements in files, evaluate expressions, transform between character sets, perform relational joins on files, convert between data formats, manipulate graph structures, and transform multimedia files.

Creative Commons License
Unix Tools: Data, Software and Production Engineering by TU Delft OpenCourseWare is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Based on a work at https://online-learning.tudelft.nl/courses/unix-tools-data-software-and-production-engineering/ /
Back to top