We are only going to spend one week on relational databases, but we could easily spend the entire semester on them. We will focus on what is probably the feature that is most likely to be of use to you, and that Excel does not support well: one to many relational joins. Unless you decide to learn more about MS Access on your own, outside of class, you will probably enter data in Excel, import it into Access, join your tables, then export the joined tables back into Excel. That is what we will do today.

1. Start MS Access and make a blank database. Before we do any importing of data from Excel, we'll make a blank database in which to put the data. Note that Access is not available for the Mac, so if you're using your own Mac you'll need to use Citrix for today.

Start Access 365 and you will see a fairly cluttered startup screen that looks like this. We are going to ignore much of the clutter, but most of it is providing easy access to templates for a variety of common business databases (Access is marketed heavily towards business users).

All we need is to click the "Blank database" button, which will bring up a "Blank database" creation window. Click on the "open folder" icon next to the file name field, and then navigate to the location on OneDrive, name the database "mice" and click "OK".

Then, back in the "Blank database" window click "Create" to make the new, empty database. You will now see the Access interface with a new, blank table called Table1 (it will look like this). We'll import the data from an Excel file, which we'll go download now.

2. Download the data. Download this spreadsheet (mice.xlsx), and save it to your OneDrive (it's convenient to put it in the same place as your mice.accdb that you just created). Once you have it downloaded open mice.xlsx in Excel to see how the data are organized.

Since there will only ever be one row in the Mouse table for each mouse, but there will be 10 rows for each mouse in the Weights table, there is a one to many relationship between the Mouse and the Weights table.

Be sure to close Excel before you proceed - Access won't open this file while Excel has it open.

3. Import the first worksheet, Mouse, into Access. To import the data we'll use the "External Data" import tools.

With the import completed you will see a new table called "Mouse" in the list on the left of the screen, and if you double-click it the table will open and show you the data (it should look like this).

4. Import the Weights sheet from the Excel file. Repeat the import steps you just completed, but this time import sheet Weights. When you are asked to define the data types for the fields use:

Allow Access to create a primary key for this table - since there are multiple records for weights of the same mice we can't use Mouse ID as a primary key for the weights table.

When you're done you'll see a second new table, Weights, in your list. Double-click it and you'll see this.

Relational joins and queries

5. Joining the tables using a query. Now we will join the Mouse and Weights data based on matching Mouse ID's. Joining is done in a query, which is a term that refers to all manner of operations that retrieve data from a database. Queries can be designed and then saved, so we use "query" both as a verb (to query the database is to retrieve data from it) and a noun (a query is a set of commands that retrieve the data). We will design a query (noun) in order to query the joined mouse and weights data (verb).

To open the query design view switch to the "Create" tab, and select the "Query Design" option. In the query designer form that pops up do the following:

Note that this query has all the ages, sexes, and diets repeated for each day of measurement, without having to actually enter this information multiple times.

Also, be aware that while the tables are permanently stored on disk, the query is a temporary, on-the-fly reporting of data from the tables. Queries can be saved and re-executed to re-create a view of the joined files, and queries can even be used as though they are tables in other queries. It's possible to use queries to combine data from tables for viewing, and only if the data need to be combined together permanently is it necessary to make a new table with the query output.

To use the query-joined data to do some analysis in another program, such as Excel, you will need to export the query. That's what you'll do next.

Moving the data from an Access query to an Excel spreadsheet

6. Copy and paste the query data to Excel. The simplest, quickest way to get the data back into Excel is to copy and paste it. To do this:

Another way to get the data out of Access and into Excel is to export the query as an Excel file - we'll do that next, after learning to filter the data based on one of the query fields.

7. Filter the data based on sex, and then export to an Excel file. In addition to using queries to pick which fields we want to export we can also filter the data within a variable like we did in Excel. To do this:

You can now export this version of the query by:

If you open the file in Excel you'll see the female data in it.

Relational joins in Excel

Although the relational database management capabilities of Excel are much more limited than in Access, it is possible to establish relationships between tables based on matching columns, and then use the joined tables as though they are one in a PivotTable. Let's briefly see how this is done, using the mouse.xlsx file.

1. Open the mouse.xlsx file in Excel. Since we imported our data from mouse.xlsx into Access for joining, it is already in the right format to set up a join in Excel. In general, though, to join tables in Excel they have to be organized as stacked data - rows that indicate individual observations, columns that indicate variables.

2. Format the Mouse data as a table. To use data for relational joins we first have to format the data sets as tables, and give them names. To do this:

Now, do the same for the data in the Weights sheet - call the table you create "weight".

3. Establish the relationship between mouse and weight. Next we need to tell Excel how our tables are related:

The relationship between the tables is now established, and we can use it to make a PivotTable that uses data from both of them.

4. Use the joined tables in a PivotTable. To use the joined data we will insert a PivotTable, but with one difference from our previous ones...

You will get a blank PivotTable with both of your tables listed in the PivotTable Fields. You can drag these into row, column, and sigma-values boxes as you like to make a PivotTable that uses the joined data. For example, if we want to calculate the average weights for male and female mice you can drag Sex from mouse into the rows, and Weight from weights into the Σ Values field, then change the value field settings to Average - this will use the sexes from one table for grouping, and the weights in the other table for the calculation.

There are some potential pitfalls, though, in using Excel for this - we can see the problem if we use the PivotTable to count up the number of measurement for each mouse. Let's make a new PivotTable to explore this:

Why are these two counts of sample sizes different? Because the PivotTable still treats the joined data as two separate tables, and counts how many rows there are with the same Mouse ID in whichever table you've used - one for Mouse, and ten for weights.

This would not happen in Access, because the joined data reported by the query is treated as a single table. When the query is run and Mouse ID is reported there are ten repetitions of each ID, regardless of whether Mouse ID came from the Mouse table or the Weights table.

The other thing to be aware of in using joins in Excel is that they are limited to joining two tables at a time, whereas any number of tables can be joined in Access. We only had two tables to join, but if we had additional ones we could have joined them as well as long as they had a matching column with one of the tables.

Assignment

When you're done, post a copy of your Access database file, the Excel files with your exported query data, and your mouse.xlsx file with the joined tables and PivotTable to the class web site.