4.1.1 Introduction to Processing
Module 4. Processing
The third part of the processing pipeline involves processing the data we have selected in the previous stage. Under this topic fall tools that sort the data or process sorted ﬁles to extract more complex results, two mini-languages, sed and awk, that allow us to process speciﬁc records, as well as tools speciﬁc for particular tasks.
In the following lessons we will learn how to sort data by various criteria, ﬁnd diﬀerences between ﬁles, ﬁnd common elements in ﬁles, evaluate expressions, transform between character sets, perform relational joins on ﬁles, convert between data formats, manipulate graph structures, and transform multimedia ﬁles.
Unix Tools: Data, Software and Production Engineering by TU Delft OpenCourseWare is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Based on a work at https://online-learning.tudelft.nl/courses/unix-tools-data-software-and-production-engineering/ /