Useless joins…

Sometimes when you’re building queries you end up joining tables which you don’t really need (anymore). They are just there to join some other table from which you do need some data. The join is just there to satisfy some foreign key references. It can however greatly impact the speed of your query. If the database doesn’t need to look at the data in the table it is normally much faster (if you don’t have to do the work, you’re always finished).

Look at the following test script and see how it can be done without the extra join:

select blog1.*

     , blog2.*

  from blog1

  join blog3 on blog1.blog3_id = blog3.id

  join blog2 on blog2.blog3_id = blog3.id

and:

select blog1.*

     , blog2.*

  from blog1

  join blog2 on blog2.blog3_id = blog1.blog3_id

The results for both queries are the same, but the second query has to do less work. When you look at the explain plans you see that the second query is doing less work

PLAN_TABLE_OUTPUT

——————————————————————————–

Plan hash value: 3385940288

——————————————————————————–

——————————————————————————–

| 0 | SELECT STATEMENT | | 3 | 279 | 7 (15)| 00:00:0

|* 1 | HASH JOIN | | 3 | 279 | 7 (15)| 00:00:0

| 2 | NESTED LOOPS | | 3 | 159 | 3 (0)| 00:00:0

| 3 | TABLE ACCESS FULL| BLOG1 | 3 | 120 | 3 (0)| 00:00:0

|* 4 | INDEX UNIQUE SCAN| PK_BLOG3_ID | 1 | 13 | 0 (0)| 00:00:0

| 5 | TABLE ACCESS FULL | BLOG2 | 3 | 120 | 3 (0)| 00:00:0

——————————————————————————–

Predicate Information (identified by operation id):

—————————————————

1 – access("BLOG2"."BLOG3_ID"="BLOG3"."ID")

4 – access("BLOG1"."BLOG3_ID"="BLOG3"."ID")

Note

– dynamic sampling used for this statement

22 rows selected

PLAN_TABLE_OUTPUT

——————————————————————————–

Plan hash value: 2974663523

—————————————————————————-

—————————————————————————-

| 0 | SELECT STATEMENT | | 3 | 240 | 7 (15)| 00:00:01 |

|* 1 | HASH JOIN | | 3 | 240 | 7 (15)| 00:00:01 |

| 2 | TABLE ACCESS FULL| BLOG1 | 3 | 120 | 3 (0)| 00:00:01 |

| 3 | TABLE ACCESS FULL| BLOG2 | 3 | 120 | 3 (0)| 00:00:01 |

—————————————————————————-

Predicate Information (identified by operation id):

—————————————————

1 – access("BLOG2"."BLOG3_ID"="BLOG1"."BLOG3_ID")

Note

—–

– dynamic sampling used for this statement

19 rows selected

Note that is this demo there is little work done and you may not even note the difference in performance, but using real tables with real data in them can make your queries speed up an order of magnitude when you exterminate the useless joins.

show entire test script

clear screen

set serveroutput on

set echo off

drop table blog1 purge

/

drop table blog2 purge

drop table blog3 purge

— truncate the explain plan table

truncate table plan_table

create table blog1

( id number(35)

, value1 varchar2(10)

, value2 varchar2(10)

, blog3_id number(35)

)

create table blog2

( id number(35)

, value1 varchar2(10)

, value2 varchar2(10)

, blog3_id number(35)

)

create table blog3

( id number(35)

, value1 varchar2(10)

, value2 varchar2(10)

)

— add the constraints

alter table blog1 add constraint pk_blog1_id primary key (ID);

alter table blog2 add constraint pk_blog2_id primary key (ID);

alter table blog3 add constraint pk_blog3_id primary key (ID);

alter table blog1 add constraint fk_blog1_blog3_id foreign key (BLOG3_ID) references blog3 (ID);

alter table blog2 add constraint fk_blog2_blog3_id foreign key (BLOG3_ID) references blog3 (ID);

— add some records

insert into blog3(id, value1, value2) values (1, ‘one’, ‘first’)

insert into blog3(id, value1, value2) values (2, ‘two’, ‘second’)

insert into blog3(id, value1, value2) values (3, ‘three’, ‘third’)

—

insert into blog1(id, value1, value2, blog3_id) values (1, ‘one’, ‘first’, 1)

insert into blog1(id, value1, value2, blog3_id) values (2, ‘two’, ‘second’, 2)

insert into blog1(id, value1, value2, blog3_id) values (3, ‘three’, ‘third’, 3)

—

insert into blog2(id, value1, value2, blog3_id) values (1, ‘one’, ‘first’, 1)

insert into blog2(id, value1, value2, blog3_id) values (2, ‘two’, ‘second’, 2)

insert into blog2(id, value1, value2, blog3_id) values (3, ‘three’, ‘third’, 3)

set timing on

select blog1.*

, blog2.*

from blog1

join blog3 on blog1.blog3_id = blog3.id

join blog2 on blog2.blog3_id = blog3.id

select blog1.*

, blog2.*

from blog1

join blog2 on blog2.blog3_id = blog1.blog3_id

EXPLAIN PLAN FOR

select blog1.*

, blog2.*

from blog1

join blog3 on blog1.blog3_id = blog3.id

join blog2 on blog2.blog3_id = blog3.id

SELECT PLAN_TABLE_OUTPUT FROM TABLE(DBMS_XPLAN.DISPLAY())

—

EXPLAIN PLAN FOR

select blog1.*

, blog2.*

from blog1

join blog2 on blog2.blog3_id = blog1.blog3_id

SELECT PLAN_TABLE_OUTPUT FROM TABLE(DBMS_XPLAN.DISPLAY())

commit;

set timing off

— clean up

drop table blog1 purge

drop table blog2 purge

drop table blog3 purge

links used in this post:

http://download.oracle.com/docs/cd/B28359_01/server.111/b28274/ex_plan.htm

Bar Solutions Weblog

The beginning of knowledge is the discovery of something we do not understand. [Frank Herbert]

Leave a Reply Cancel reply