Active Record
Although Yii DAO can handle virtually any database-related task, chances are that we would spend 90% of our time in writing some SQL statements which perform the common CRUD (create, read, update and delete) operations. It is also difficult to maintain our code when they are mixed with SQL statements. To solve these problems, we can use Active Record.
Active Record (AR) is a popular Object-Relational Mapping (ORM) technique.
Each AR class represents a database table (or view) whose attributes are
represented as the AR class properties, and an AR instance represents a row
in that table. Common CRUD operations are implemented as AR methods. As a
result, we can access our data in a more object-oriented way. For example,
we can use the following code to insert a new row to the Post
table:
$post=new Post;
$post->title='sample post';
$post->content='post body content';
$post->save();
In the following we describe how to set up AR and use it to perform CRUD
operations. We will show how to use AR to deal with database relationships
in the next section. For simplicity, we use the following database table
for our examples in this section. Note that if you are using MySQL database,
you should replace AUTOINCREMENT
with AUTO_INCREMENT
in the following SQL.
CREATE TABLE Post ( id INTEGER NOT NULL PRIMARY KEY AUTOINCREMENT, title VARCHAR(128) NOT NULL, content TEXT NOT NULL, createTime INTEGER NOT NULL );
Note: AR is not meant to solve all database-related tasks. It is best used for modeling database tables in PHP constructs and performing queries that do not involve complex SQLs. Yii DAO should be used for those complex scenarios.
1. Establishing DB Connection ¶
AR relies on a DB connection to perform DB-related operations. By default,
it assumes that the db
application component gives the needed
CDbConnection instance which serves as the DB connection. The following
application configuration shows an example:
return array(
'components'=>array(
'db'=>array(
'class'=>'system.db.CDbConnection',
'connectionString'=>'sqlite:path/to/dbfile',
// turn on schema caching to improve performance
// 'schemaCachingDuration'=>3600,
),
),
);
Tip: Because Active Record relies on the metadata about tables to determine the column information, it takes time to read the metadata and analyze it. If the schema of your database is less likely to be changed, you should turn on schema caching by configuring the CDbConnection::schemaCachingDuration property to be a value greater than 0.
Support for AR is limited by DBMS. Currently, only the following DBMS are supported:
Note: The support for Microsoft SQL Server has been available since version 1.0.4; And the support for Oracle has been available since version 1.0.5.
If you want to use an application component other than db
, or if you
want to work with multiple databases using AR, you should override
CActiveRecord::getDbConnection(). The CActiveRecord class is the base
class for all AR classes.
Tip: There are two ways to work with multiple databases in AR. If the schemas of the databases are different, you may create different base AR classes with different implementation of getDbConnection(). Otherwise, dynamically changing the static variable CActiveRecord::db is a better idea.
2. Defining AR Class ¶
To access a database table, we first need to define an AR class by
extending CActiveRecord. Each AR class represents a single database
table, and an AR instance represents a row in that table. The following
example shows the minimal code needed for the AR class representing the
Post
table.
class Post extends CActiveRecord
{
public static function model($className=__CLASS__)
{
return parent::model($className);
}
}
Tip: Because AR classes are often referenced in many places, we can import the whole directory containing the AR class, instead of including them one by one. For example, if all our AR class files are under
protected/models
, we can configure the application as follows:return array( 'import'=>array( 'application.models.*', ), );
By default, the name of the AR class is the same as the database table name. Override the tableName() method if they are different. The model() method is declared as such for every AR class (to be explained shortly).
Column values of a table row can be accessed as properties of the
corresponding AR class instance. For example, the following code sets the
title
column (attribute):
$post=new Post;
$post->title='a sample post';
Although we never explicitly declare the title
property in the Post
class, we can still access it in the above code. This is because title
is
a column in the Post
table, and CActiveRecord makes it accessible as a
property with the help of the PHP __get()
magic method. An exception will
be thrown if we attempt to access a non-existing column in the same way.
Info: In this guide, we name columns using camel cases (e.g.
createTime
). This is because columns are accessed in the way as normal object properties which also uses camel-case naming. While using camel case does make our PHP code look more consistent in naming, it may introduce case-sensitivity problem for some DBMS. For example, PostgreSQL treats column names as case-insensitive by default, and we must quote a column in a query condition if the column contains mixed-case letters. For this reason, it may be wise to name columns (and also tables) only in lower-case letters (e.g.create_time
) to avoid any potential case-sensitivity issues.
3. Creating Record ¶
To insert a new row into a database table, we create a new instance of the corresponding AR class, set its properties associated with the table columns, and call the save() method to finish the insertion.
$post=new Post;
$post->title='sample post';
$post->content='content for the sample post';
$post->createTime=time();
$post->save();
If the table's primary key is auto-incremental, after the insertion the AR
instance will contain an updated primary key. In the above example, the
id
property will reflect the primary key value of the newly inserted
post, even though we never change it explicitly.
If a column is defined with some static default value (e.g. a string, a number) in the table schema, the corresponding property in the AR instance will automatically has such a value after the instance is created. One way to change this default value is by explicitly declaring the property in the AR class:
class Post extends CActiveRecord
{
public $title='please enter a title';
......
}
$post=new Post;
echo $post->title; // this would display: please enter a title
Starting from version 1.0.2, an attribute can be assigned a value of CDbExpression
type before the record is saved (either insertion or updating) to the database.
For example, in order to save a timestamp returned by the MySQL NOW()
function,
we can use the following code:
$post=new Post;
$post->createTime=new CDbExpression('NOW()');
// $post->createTime='NOW()'; will not work because
// 'NOW()' will be treated as a string
$post->save();
Tip: While AR allows us to perform database operations without writing cumbersom SQL statements, we often want to know what SQL statements are executed by AR underneath. This can be achieved by turning on the logging feature of Yii. For example, we can turn on CWebLogRoute in the application configuration, and we will see the executed SQL statements being displayed at the end of each Web page. Since version 1.0.5, we can set CDbConnection::enableParamLogging to be true in the application configuration so that the parameter values bound to the SQL statements are also logged.
4. Reading Record ¶
To read data in a database table, we call one of the find
methods as
follows.
// find the first row satisfying the specified condition
$post=Post::model()->find($condition,$params);
// find the row with the specified primary key
$post=Post::model()->findByPk($postID,$condition,$params);
// find the row with the specified attribute values
$post=Post::model()->findByAttributes($attributes,$condition,$params);
// find the first row using the specified SQL statement
$post=Post::model()->findBySql($sql,$params);
In the above, we call the find
method with Post::model()
. Remember
that the static method model()
is required for every AR class. The method
returns an AR instance that is used to access class-level methods
(something similar to static class methods) in an object context.
If the find
method finds a row satisfying the query conditions, it will
return a Post
instance whose properties contain the corresponding column
values of the table row. We can then read the loaded values like we do with
normal object properties, for example, echo $post->title;
.
The find
method will return null if nothing can be found in the database
with the given query condition.
When calling find
, we use $condition
and $params
to specify query
conditions. Here $condition
can be string representing the WHERE
clause
in a SQL statement, and $params
is an array of parameters whose values
should be bound to the placeholders in $condition
. For example,
// find the row with postID=10
$post=Post::model()->find('postID=:postID', array(':postID'=>10));
Note: In the above, we may need to escape the reference to the
postID
column for certain DBMS. For example, if we are using PostgreSQL, we would have to write the condition as"postID"=:postID
, because PostgreSQL by default will treat column names as case-insensitive.
We can also use $condition
to specify more complex query conditions.
Instead of a string, we let $condition
be a CDbCriteria instance, which
allows us to specify conditions other than the WHERE
clause. For example,
$criteria=new CDbCriteria;
$criteria->select='title'; // only select the 'title' column
$criteria->condition='postID=:postID';
$criteria->params=array(':postID'=>10);
$post=Post::model()->find($criteria); // $params is not needed
Note, when using CDbCriteria as query condition, the $params
parameter
is no longer needed since it can be specified in CDbCriteria, as shown
above.
An alternative way to CDbCriteria is passing an array to the find
method.
The array keys and values correspond to the criteria's property name and value,
respectively. The above example can be rewritten as follows,
$post=Post::model()->find(array(
'select'=>'title',
'condition'=>'postID=:postID',
'params'=>array(':postID'=>10),
));
Info: When a query condition is about matching some columns with the specified values, we can use findByAttributes(). We let the
$attributes
parameters be an array of the values indexed by the column names. In some frameworks, this task can be achieved by calling methods likefindByNameAndTitle
. Although this approach looks attractive, it often causes confusion, conflict and issues like case-sensitivity of column names.
When multiple rows of data matching the specified query condition, we can
bring them in all together using the following findAll
methods, each of
which has its counterpart find
method, as we already described.
// find all rows satisfying the specified condition
$posts=Post::model()->findAll($condition,$params);
// find all rows with the specified primary keys
$posts=Post::model()->findAllByPk($postIDs,$condition,$params);
// find all rows with the specified attribute values
$posts=Post::model()->findAllByAttributes($attributes,$condition,$params);
// find all rows using the specified SQL statement
$posts=Post::model()->findAllBySql($sql,$params);
If nothing matches the query condition, findAll
would return an empty
array. This is different from find
who would return null if nothing is
found.
Besides the find
and findAll
methods described above, the following
methods are also provided for convenience:
// get the number of rows satisfying the specified condition
$n=Post::model()->count($condition,$params);
// get the number of rows using the specified SQL statement
$n=Post::model()->countBySql($sql,$params);
// check if there is at least a row satisfying the specified condition
$exists=Post::model()->exists($condition,$params);
5. Updating Record ¶
After an AR instance is populated with column values, we can change them and save them back to the database table.
$post=Post::model()->findByPk(10);
$post->title='new post title';
$post->save(); // save the change to database
As we can see, we use the same save() method to
perform insertion and updating operations. If an AR instance is created
using the new
operator, calling save() would insert
a new row into the database table; if the AR instance is the result of some
find
or findAll
method call, calling save() would
update the existing row in the table. In fact, we can use
CActiveRecord::isNewRecord to tell if an AR instance is new or not.
It is also possible to update one or several rows in a database table without loading them first. AR provides the following convenient class-level methods for this purpose:
// update the rows matching the specified condition
Post::model()->updateAll($attributes,$condition,$params);
// update the rows matching the specified condition and primary key(s)
Post::model()->updateByPk($pk,$attributes,$condition,$params);
// update counter columns in the rows satisfying the specified conditions
Post::model()->updateCounters($counters,$condition,$params);
In the above, $attributes
is an array of column values indexed by column
names; $counters
is an array of incremental values indexed by column
names; and $condition
and $params
are as described in the previous
subsection.
6. Deleting Record ¶
We can also delete a row of data if an AR instance has been populated with this row.
$post=Post::model()->findByPk(10); // assuming there is a post whose ID is 10
$post->delete(); // delete the row from the database table
Note, after deletion, the AR instance remains unchanged, but the corresponding row in the database table is already gone.
The following class-level methods are provided to delete rows without the need of loading them first:
// delete the rows matching the specified condition
Post::model()->deleteAll($condition,$params);
// delete the rows matching the specified condition and primary key(s)
Post::model()->deleteByPk($pk,$condition,$params);
7. Data Validation ¶
When inserting or updating a row, we often need to check if the column values comply to certain rules. This is especially important if the column values are provided by end users. In general, we should never trust anything coming from the client side.
AR performs data validation automatically when save() is being invoked. The validation is based on the rules specified by in the rules() method of the AR class. For more details about how to specify validation rules, refer to the Declaring Validation Rules section. Below is the typical workflow needed by saving a record:
if($post->save())
{
// data is valid and is successfully inserted/updated
}
else
{
// data is invalid. call getErrors() to retrieve error messages
}
When the data for inserting or updating is submitted by end users in an HTML form, we need to assign them to the corresponding AR properties. We can do so like the following:
$post->title=$_POST['title'];
$post->content=$_POST['content'];
$post->save();
If there are many columns, we would see a long list of such assignments. This can be alleviated by making use of the attributes property as shown below. More details can be found in the Securing Attribute Assignments section and the Creating Action section.
// assume $_POST['Post'] is an array of column values indexed by column names
$post->attributes=$_POST['Post'];
$post->save();
8. Comparing Records ¶
Like table rows, AR instances are uniquely identified by their primary key values. Therefore, to compare two AR instances, we merely need to compare their primary key values, assuming they belong to the same AR class. A simpler way is to call CActiveRecord::equals(), however.
Info: Unlike AR implementation in other frameworks, Yii supports composite primary keys in its AR. A composite primary key consists of two or more columns. Correspondingly, the primary key value is represented as an array in Yii. The primaryKey property gives the primary key value of an AR instance.
9. Customization ¶
CActiveRecord provides a few placeholder methods that can be overridden in child classes to customize its workflow.
beforeValidate and afterValidate: these are invoked before and after validation is performed.
beforeSave and afterSave: these are invoked before and after saving an AR instance.
beforeDelete and afterDelete: these are invoked before and after an AR instance is deleted.
afterConstruct: this is invoked for every AR instance created using the
new
operator.beforeFind: this is invoked before an AR finder is used to perform a query (e.g.
find()
,findAll()
). This has been available since version 1.0.9.afterFind: this is invoked after every AR instance created as a result of query.
10. Using Transaction with AR ¶
Every AR instance contains a property named dbConnection which is a CDbConnection instance. We thus can use the transaction feature provided by Yii DAO if it is desired when working with AR:
$model=Post::model();
$transaction=$model->dbConnection->beginTransaction();
try
{
// find and save are two steps which may be intervened by another request
// we therefore use a transaction to ensure consistency and integrity
$post=$model->findByPk(10);
$post->title='new post title';
$post->save();
$transaction->commit();
}
catch(Exception $e)
{
$transaction->rollBack();
}
11. Named Scopes ¶
Note: The support for named scopes has been available since version 1.0.5. The original idea of named scopes came from Ruby on Rails.
A named scope represents a named query criteria that can be combined with other named scopes and applied to an active record query.
Named scopes are mainly declared in the CActiveRecord::scopes() method as name-criteria pairs. The following code declares two named scopes, published
and recently
, in the Post
model class:
class Post extends CActiveRecord
{
......
public function scopes()
{
return array(
'published'=>array(
'condition'=>'status=1',
),
'recently'=>array(
'order'=>'createTime DESC',
'limit'=>5,
),
);
}
}
Each named scope is declared as an array which can be used to initialize a CDbCriteria instance. For example, the recently
named scope specifies that the order
property to be createTime DESC
and the limit
property to be 5, which translates to a query criteria that should bring back the most recent 5 posts.
Named scopes are mostly used as modifiers to the find
method calls. Several named scopes may be chained together and result in a more restrictive query result set. For example, to find the recently published posts, we can use the following code:
$posts=Post::model()->published()->recently()->findAll();
In general, named scopes must appear to the left of a find
method call. Each of them provides a query criteria, which is combined with other criterias, including the one passed to the find
method call. The net effect is like adding a list of filters to a query.
Starting from version 1.0.6, named scopes can also be used with update
and delete
methods. For example, the following code would delete all recently published posts:
Post::model()->published()->recently()->delete();
Note: Named scopes can only be used with class-level methods. That is, the method must be called using
ClassName::model()
.
Parameterized Named Scopes
Named scopes can be parameterized. For example, we may want to customize the number of posts specified by the recently
named scope. To do so, instead of declaring the named scope in the CActiveRecord::scopes method, we need to define a new method whose name is the same as the scope name:
public function recently($limit=5)
{
$this->getDbCriteria()->mergeWith(array(
'order'=>'createTime DESC',
'limit'=>$limit,
));
return $this;
}
Then, we can use the following statement to retrieve the 3 recently published posts:
$posts=Post::model()->published()->recently(3)->findAll();
If we do not supply the parameter 3 in the above, we would retrieve the 5 recently published posts by default.
Default Named Scope
A model class can have a default named scope that would be applied for all queries (including relational ones) about the model. For example, a website supporting multiple languages may only want to display contents that are in the language the current user specifies. Because there may be many queries about the site contents, we can define a default named scope to solve this problem. To do so, we override the CActiveRecord::defaultScope method as follows,
class Content extends CActiveRecord
{
public function defaultScope()
{
return array(
'condition'=>"language='".Yii::app()->language."'",
);
}
}
Now, if the following method call will automatically use the query criteria as defined above:
$contents=Content::model()->findAll();
Note that default named scope only applies to SELECT
queries. It is ignored for INSERT
, UPDATE
and DELETE
queries.