4.1.1 Introduction to Processing
Course subject(s)
Module 4. Processing
The third part of the processing pipeline involves processing the data we have selected in the previous stage. Under this topic fall tools that sort the data or process sorted files to extract more complex results, two mini-languages, sed and awk, that allow us to process specific records, as well as tools specific for particular tasks.
In the following lessons we will learn how to sort data by various criteria, find differences between files, find common elements in files, evaluate expressions, transform between character sets, perform relational joins on files, convert between data formats, manipulate graph structures, and transform multimedia files.
Unix Tools: Data, Software and Production Engineering by TU Delft OpenCourseWare is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Based on a work at https://online-learning.tudelft.nl/courses/unix-tools-data-software-and-production-engineering/ /