Prepare for your exams
Get points
Guidelines and tips

Prepare for your exams

Study with the several resources on Docsity

Earn points to download

Earn points by helping other students or get them with a premium plan

Guidelines and tips

Sell on Docsity

Prepare for your exams

Study with the several resources on Docsity

Find documents

Prepare for your exams with the study notes shared by other students like you on Docsity

Search Store documents

The best documents sold by students who completed their studies

Search through all study resources

Docsity AINEW

Summarize your documents, ask them questions, convert them into quizzes and concept maps

Explore questions

Clear up your doubts by reading the answers to questions asked by your fellow students

Earn points to download

Earn points by helping other students or get them with a premium plan

Share documents

20 Points

For each uploaded document

Answer questions

5 Points

For each given answer (max 1 per day)

All the ways to get free points

Get points immediately

Choose a premium plan with all the points you need

Study Opportunities

Choose your next study program

Get in touch with the best universities in the world. Search through thousands of universities and official partners

Community

Ask the community

Ask the community for help and clear up your study doubts

University Rankings

Discover the best universities in your country according to Docsity users

Free resources

Our save-the-student-ebooks!

Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors

From our blog

Exams and Study

Go to the blog

Database Tuning: Query Optimization, Index Selection, and Schema Design, Slides of Introduction to Database Management Systems

University of Waterloo Introduction to Database Management Systems

An introduction to database tuning, focusing on techniques to speed up queries, such as generating equivalent plans and determining their cost. It also covers cost models for query optimization and strategies for selecting the best query plan. Examples and strategies for using indexes and scanning relations.

Typology: Slides

2011/2012

Uploaded on 02/12/2012

richiey 🇨🇦

32 documents

1 / 15

This page cannot be seen from the preview

Don't miss anything!

Database Tuning and Physical Design:

Query Optimization, Index Selection, and

Schema, Query, and Transaction Tuning

David Toman

School of Computer Science

University of Waterloo

Introduction to Databases CS348

David Toman (University of Waterloo) DB Tuning 1 / 15

Partial preview of the text

Download Database Tuning: Query Optimization, Index Selection, and Schema Design and more Slides Introduction to Database Management Systems in PDF only on Docsity!

Database Tuning and Physical Design:

Query Optimization, Index Selection, and

Schema, Query, and Transaction Tuning

David Toman

School of Computer Science University of Waterloo

Introduction to Databases CS

Database Tuning

Goal

Make a set of applications execute “as fast as possible”.

(^) optimize for response time?
optimize for overall throughput?

How can we affect performance (as DBAs)?

(^) make queries run faster (data structures, clustering, replication)
make updates run faster (locality of data items)
(^) minimize congestion due to concurrency

How does it Search for Plans?

Query equivalence is undecidable!!

always good transformations, e.g.,

A=:p (R on S ) 7! (A=:p (R)) on S

rewrites using integrity constraints
(^) cost-based access path selection and join ordering ... the query optimizer may not find the right plan (query tuning)

Cost Models for Query Optimization

How do we figure out how fast a particular plan is??

For every stored relation R with an attribute A we keep:

jRj: the cardinality of R (the number of tuples in R)
b(R): the blocking factor for R
min(R; A): the minimum value for A in R
max(R; A): the maximum value for A in R
distinct (R; A): the number of distinct values of A

) in practice much more complex statistics are kept

Based on these values we try to estimate the cost of query plans ) costs estimated in number of disk I/O operations

Strategy 1: Use CourseInd

Assuming uniform distribution of tuples over the courses, there will be about jMarkj/100 = 100 tuples with Course = PHYS.

Searching the CourseInd index has a cost of 2. Retrieval of the 100 matching tuples adds a cost of 100 =b(Mark) data blocks.

The total cost of 4.

Selection of N tuples from relation R using a clustered index has a cost of 2 + N =b(R).

Strategy 2: Use StudnumInd

Assuming uniform distribution of tuples over student numbers, there will be about jMarkj/500 = 20 tuples for each student.

Searching the StudnumInd has a cost of 2. Since this is not a clustered index, we will make the pessimistic assumption that each matching record is on a separate data block, i.e., 20 blocks will need to be read.

The total cost is 22.

Selection of N tuples from relation R using a non-clustered index has a cost of 2 + N.

Query Plan Tools (EXPLAIN)

How do we know what plan is used (and what the estimated cost is)? ) db2expln and dynexpln tools

select name from author, wrote where aid=author

(without index)

(index on wrote(author))

Estimated Cost = 25 Estimated Cardinality = 120 Optimizer Plan: RETURN | MSJOIN /
TBSCAN IXSCAN | /
SORT Index: Table: | AW WROTE TBSCAN | Table: AUTHOR

More complex Designs

Multi-attribute Indices complex search/join conditions (in lexicographical order!) index-only plans (several clustered indices)

Join Indices allow replacing joins by index lookups

Materialized Views allow replacing subqueries by index lookups

Problem 1

how does the query optimizer know if/where to use such indices/views?

Problem 2

balance between cost of rematerialization and savings for queries.

Index Selection and Tools Example

rees$ db2advis -d cs -s "select name from author,wrote where aid=author" Calculating initial cost (without recommmended indexes) [25.390385] Initial set of proposed indexes is ready. Found maximum set of [2] recommended indexes Cost of workload with all indexes included [0.364030] timerons total disk space needed for initial set [ 0.014] MB total disk space constrained to [ -1.000] MB 2 indexes in current solution [ 25.3904] timerons (without indexes) [ 0.3640] timerons (with current solution) [%98.57] improvement Trying variations of the solution set. -- -- execution finished at timestamp 2006-11-23-12.25.24. -- -- LIST OF RECOMMENDED INDEXES -- =========================== -- index[1], 0.009MB CREATE INDEX WIZ8 ON "DAVID "."AUTHOR" ("AID" ASC, "NAME" ASC) ; -- index[2], 0.005MB CREATE INDEX AW ON "DAVID "."WROTE" ("AUTHOR" ASC) ; -- =========================== Index Advisor tool is finished.

) can be fed a workload instead of a single query

Schema Tuning and Normal Forms

So far we onlyadded data structures to improve performance. what to do if this isn’t enough?

Changes to the conceptual design

Goals:

(^) avoid expensive operations in query execution (joins)
retrieve related data in fewer operations

Techniques:

alternative normalization/weaker normal form
(^) co-clustering of relations (if available)/denormalization
vertical/horizontal partitioning of data (and views)
(^) avoiding concurrency hot-spots

Database Tuning: Query Optimization, Index Selection, and Schema Design, Slides of Introduction to Database Management Systems

Related documents

Partial preview of the text

Download Database Tuning: Query Optimization, Index Selection, and Schema Design and more Slides Introduction to Database Management Systems in PDF only on Docsity!

Database Tuning and Physical Design:

Query Optimization, Index Selection, and

Schema, Query, and Transaction Tuning

Database Tuning

Goal

How does it Search for Plans?

Cost Models for Query Optimization

Strategy 1: Use CourseInd

Strategy 2: Use StudnumInd

Query Plan Tools (EXPLAIN)

(without index)

(index on wrote(author))

More complex Designs

Problem 1

Problem 2

Index Selection and Tools Example

Schema Tuning and Normal Forms