Missing Semester Lecture 2 - Shell Tools and Scripting
MIT The Missing semester Lecture of Your CS Education Lecture 2 - Shell Tools and Scripting
Shell Scripting
To assign variables in bash, use the syntax foo=bar
and access the value of the variable with $foo
. Note that foo = bar
will not work since it it interpreted as calling the foo
program with arguments =
and bar
. In general, in shell scripts the space character will perform argument splitting.
Strings in bash can be defined with '
and "
delimiters, but they are not equivalent. Strings delimited with '
are literal strings and will not substitute variable values whereas "
delimited strings will.
1 | foo=bar |
Here is an example of a function that creates a directory and cd
s into it:
1 | mcd () { |
Here $1
is the first argument to the script/function. Unlike other scripting languages, bash uses a variety of special variables to refer to arguments, error codes and other relevant variables. Below is a list of some of them.
$0
- Name of the script$1
to$9
- Arguments to the script.$1
is the first argument and so on.$@
- All the arguments$#
- Number of arguments$?
- Return code of the previous command (A value of 0 usually means everything went OK; anything different from 0 means an error occurred.)$$
- PID for the current script!!
- Entire last command, including arguments.$_
- Last argument from the last command.A more comprehensive list can be found here.
Exit codes can be used to conditionally execute commands using &&
and ||
, both of which are short-circuiting operators. Commands can also be separated within the same line using a semicolon ;
.
1 | false || echo "Oops, fail" |
Another common pattern is wanting to get the output of a command as a variable. This can be done with command substitution. Whenever you place $( CMD )
it will execute CMD
, get the output of the command and substitute it in place. For example, if you do for file in $(ls)
, the shell will first call ls
and then iterate over those values.
A lesser known similar feature is process substitution, <( CMD )
will execute CMD
and place the output in a temporary file and substitute the <()
with that file’s name. This is useful when commands expect values to be passed by file instead of by STDIN. For example, diff <(ls foo) <(ls bar)
will show differences between files in dirs foo
and bar
.
Let’s see an example that showcases some of these features. It will iterate through the arguments we provide, grep
for the string foobar
, and append it to the file as a comment if it’s not found.
1 |
|
Here
grep foobar "$file" > /dev/null
means throw away the output ofgrep
.2
is a file descriptor in bash meansstderr
, so2> /dev/null
means rewire thestderr
tonull
.
Shell globbing
- Wildcards - Whenever you want to perform some sort of wildcard matching, you can use
?
and*
to match one or any amount of characters respectively. For instance, given filesfoo
,foo1
,foo2
,foo10
andbar
, the commandrm foo?
will deletefoo1
andfoo2
whereasrm foo*
will delete all butbar
. - Curly braces
{}
- Whenever you have a common substring in a series of commands, you can use curly braces for bash to expand this automatically. This comes in very handy when moving or converting files.
1 | convert image.{png,jpg} |
Exercises
Read
man ls
and write anls
command that lists files in the following manner- Includes all files, including hidden files
- Sizes are listed in human readable format (e.g. 454M instead of 454279954)
- Files are ordered by recency
- Output is colorized
A sample output would look like this
1
2
3
4
5-rw-r--r-- 1 user group 1.1M Jan 14 09:53 baz
drwxr-xr-x 5 user group 160 Jan 14 09:53 .
-rw-r--r-- 1 user group 514 Jan 14 06:42 bar
-rw-r--r-- 1 user group 106M Jan 13 12:12 foo
drwx------+ 47 user group 1.5K Jan 12 18:08 ..
1 | ls -a -t -h -l --color |
- Write bash functions
marco
andpolo
that do the following. Whenever you executemarco
the current working directory should be saved in some manner, then when you executepolo
, no matter what directory you are in,polo
shouldcd
you back to the directory where you executedmarco
. For ease of debugging you can write the code in a filemarco.sh
and (re)load the definitions to your shell by executingsource marco.sh
.
1 | marco() { |
- Say you have a command that fails rarely. In order to debug it you need to capture its output but it can be time consuming to get a failure run. Write a bash script that runs the following script until it fails and captures its standard output and error streams to files and prints everything at the end. Bonus points if you can also report how many runs it took for the script to fail.
1 |
|
1 |
|
- As we covered in the lecture
find
’s-exec
can be very powerful for performing operations over the files we are searching for. However, what if we want to do something with all the files, like creating a zip file? As you have seen so far commands will take input from both arguments and STDIN. When piping commands, we are connecting STDOUT to STDIN, but some commands liketar
take inputs from arguments. To bridge this disconnect there’s thexargs
command which will execute a command using STDIN as arguments. For examplels | xargs rm
will delete the files in the current directory. Your task is to write a command that recursively finds all HTML files in the folder and makes a zip with them. Note that your command should work even if the files have spaces (hint: check-d
flag forxargs
).
1 | [root@localhost missing]# find . -name "*.html" | xargs -d "\n" tar -cf htmls.tar |
- (Advanced) Write a command or script to recursively find the most recently modified file in a directory. More generally, can you list all files by recency?
1 | find . -type f | ls -t | head -1 |
1 | find . -type f | ls -t |