Remember to
- Remind students to install Firefox.
Walk students through installing SQLite Manager Add-on.
Why use a database management system
- Simple and effective software for storing, managing and retrieving information
- Widely used to store and maintain lots of existing data
- Fast processing across large amounts of data
- Out-of-memory and remote processing for really large data
- Can handle spatially explicit queries (GIS)
- Improve quality control of data entry
- Set valid data type and value constraints.
- Use data entry forms in Excel, Access, Filemaker, etc.
- Use quality control scripts to test entered data.
- Concepts are core to understanding data manipulation in other languages
Relational databases
- We will be working with ‘Relational Databases’.
- Data is stored in tables.
- One table per type of data
- Tables can be linked together to combine information.
- Each row contains a single record.
- A single observation or data point
- Each column or field contains a single attribute.
- A single type of information
- One table per type of data
- The concepts of relational database management are core to understanding how to do similar things using programming languages such as R or Python.
- Data is stored in tables.
- We will primarily use SQLite.
- Simple to use
- Almost no work to set up
- Stored in single file
- But, there are lots of alternatives
- Access - commonly used, GUI
- PostgreSQL - fast/powerful, lots of users
Importing table into SQLite Manager
- Download the Portal Project survey data
- Open SQLite Manager
Database
->New Database
-> nameportal_mammals
->OK
->Open
(don’t select any files)Import
button orDatabase
->Import
Select File
-> navigate to just downloadedsurveys.csv
->Open
- Check
First row contains column names
& selectComma
option OK
to modify the data- Name table
surveys
-
Select data types
- Fields in databases have types that define the kind of data they contain.
- Each field/column can only have one type.
- We have to define the types in advance.
- Types include
- Integer
- Text (varchar)
- Decimal/Double/Real/Float
- Types are highly configurable for when space is limiting
- Maximum value of integers
- Maximum length of text
- How many values before and after the decimal place
-
Select
record_id
as thePrimary Key
and clickOK
.- Every table “needs” a column (or set of columns) that is unique across records/row.
- This is called the
Primary Key
. - The easiest way to do this is to use an
INTEGER
that increments every time a new record is added. - Many databases that you import will already have a field like this.
- Otherwise autogenerates one.
Show students the
- objects panel with
Tables
andViews
.Browse & Search
tab.Structure
tab.
We’ve just done most of Exercise 3. Do Exercise 3.11 - Importing Data.