Introduction
A derived table is a select statement inside parenthesis,
with an alias, used as a table in a join.
A simple example is:
SELECT a.LastName, b.State
FROM UserTable a INNER JOIN
(SELECT UserID, State FROM AddressTable)
b
ON a.UserID = b.UserID
The table with the alias b is the derived table. Of course, in the above example you could
just as easily do the join without the derived table. All code in this article was tested on SQL
Server 7.0 and SQL Server 2000.
Derived Table Example 1
I have found two cases where a derived table can be useful
First we need to set up a few test tables. You can copy and paste the following code
into Query Analyzer:
CREATE TABLE #Components (UserID
int, ComponentName varchar(20),
SerialNumber varchar(10))
CREATE TABLE #Users (UserID int, LocationID int, UserName varchar(20))
CREATE TABLE #Locations (LocationID
int, State char(2))
INSERT INTO #Locations (LocationID,
State)
SELECT 1, 'VA'
UNION
SELECT 2, 'MD'
INSERT INTO #Users (UserID, LocationID, UserName)
SELECT 1, 1, 'Robert Marda'
UNION
SELECT 2, 1, 'Arthur Landers'
UNION
SELECT 3, 2, 'Flecha Ardiente'
UNION
SELECT 4, 2, 'Lia Bakker'
INSERT INTO #Components (UserID, ComponentName, SerialNumber)
SELECT 1, 'Computer', 'A003B2'
UNION
SELECT 1, 'Computer', 'A003B5'
UNION
SELECT 2, 'Computer', 'A003B7'
UNION
SELECT 2, 'Laptop', 'CG15R4D'
UNION
SELECT 3, 'Computer', 'A003B10'
UNION
SELECT 3, 'Laptop', 'CG15R1Z'
UNION
SELECT 3, 'Computer', 'A003B0'
UNION
SELECT 4, 'Laptop', 'CG15R4W'
UNION
SELECT 4, 'Computer', 'A003B6'
UNION
SELECT 4, 'Laptop', 'CG15R3E'
Now for our first result set lets
suppose we need to know how many users are at each location and how many
components are at each location. Here is
a query I have seen developers use to try and get the desired results:
SELECT DISTINCT l.State, COUNT(u.LocationID) AS UserCount, COUNT(c.UserID) AS ComponentCount
FROM #Locations l
INNER JOIN #Users u ON l.LocationID
= u.LocationID
INNER JOIN #Components c ON c.UserID
= u.UserID
GROUP BY l.State
This query will return the following results:
State UserCount ComponentCount
----- ----------- --------------
MD 6 6
VA 4 4
(2 row(s) affected)
The query returns the correct number of components but not
the correct number of users. One way to
get the desired results without changing the database structure is to use a
derived table like this:
SELECT DISTINCT l.State, COUNT(u.LocationID) AS UserCount, c.ComponentCount
FROM #Locations l
INNER JOIN #Users u ON l.LocationID
= u.LocationID
INNER JOIN (SELECT u.LocationID, COUNT(c.ComponentName) AS ComponentCount
FROM #Components c
INNER JOIN #Users u ON c.UserID
= u.UserID
GROUP
BY u.LocationID) c ON c.LocationID
= l.LocationID
GROUP BY l.State,c.ComponentCount
ORDER BY l.State
Executing this query returns the following results:
State UserCount ComponentCount
----- ----------- --------------
MD 2 6
VA 2 4
(2 row(s) affected)
The derived table is everything in the parenthesis and has
the alias name c. To get the desired
results we simply moved one of the count functions into the derived table and
the results came back as needed.
Derived Table Example 2
This next example shows a case where a simple join will not work. This occurs when you need to use the key word distinct and you are using the case statement in the order by clause. Using the same
tables and test data previously created lets say we need to know what
components are at each location. We want
to be able to sort by the state or the component name by setting a
variable. The following query could be
used:
DECLARE @orderby int
SET @orderby = 1
SELECT
l.State, ComponentName
FROM #Locations l
INNER JOIN #Users u ON l.LocationID
= u.LocationID
INNER JOIN #Components c ON c.UserID
= u.UserID
ORDER BY CASE @orderby
WHEN 1 THEN l.State
WHEN 2 THEN ComponentName
END
Here are the results:
State ComponentName
----- --------------------
MD Laptop
MD Laptop
MD Computer
MD Computer
MD Laptop
MD Computer
VA Computer
VA Computer
VA Computer
VA Laptop
(10 row(s) affected)
Now we need to eliminate the duplicates and so we add the
distinct key word to the query:
DECLARE @orderby int
SET @orderby = 1
SELECT DISTINCT
l.State, ComponentName
FROM #Locations l
INNER JOIN #Users u ON l.LocationID
= u.LocationID
INNER JOIN #Components c ON c.UserID
= u.UserID
ORDER BY CASE @orderby
WHEN 1 THEN l.State
WHEN 2 THEN ComponentName
END
However, when we execute the modified query we get the
following error:
Server: Msg 145, Level 15, State
1, Line 5
ORDER BY items must appear in the select list if SELECT
DISTINCT is specified.
I will not claim to understand why the error is given since
as far as I can see I have specified both columns in the order by and the
select list. Just adding the keyword
distinct breaks the query. The way I
found to get around this problem is to use a derived table and put the SELECT
DISTINCT in the derived table. The
modified query looks like this:
DECLARE @orderby int
SET @orderby = 1
SELECT --DISTINCT
l.State, ComponentName
FROM #Locations l
INNER JOIN
(SELECT DISTINCT LocationID, ComponentName FROM #Users u
INNER JOIN #Components c ON c.UserID
= u.UserID) derived ON l.LocationID
= derived.LocationID
ORDER BY CASE @orderby
WHEN 1 THEN l.State
WHEN 2 THEN ComponentName
END
This query gives us the following result set:
State ComponentName
----- --------------------
MD Computer
MD Laptop
VA Computer
VA Laptop
(4 row(s) affected)
In the above query we converted a join between the table
#Users and #Components into the derived table and moved the distinct keyword to
the derived table and were able to get the desired results.
Conclusions
I use derived tables whenever convenient. It is often easier than creating a temp table and usually works better. The above two examples show cases when it can be useful, perhaps required to use something other than a simple join and derived tables work well in both the examples I have given.