裝配圖網(wǎng) > 圖紙專區(qū) > 課件教案 > 《數(shù)據(jù)庫系統(tǒng)》教學(xué)課件

《數(shù)據(jù)庫系統(tǒng)》教學(xué)課件

《數(shù)據(jù)庫系統(tǒng)》教學(xué)課件,數(shù)據(jù)庫系統(tǒng),數(shù)據(jù)庫,系統(tǒng),教學(xué),課件 Overview of File Organizations and IndexingJianlin FengSchool of SoftwareSUN YAT-SEN UNIVERSITYcourtesy of Joe Hellerstein for some slidesContextQuery Optimizationand ExecutionRelational OperatorsFiles and Access MethodsBuffer ManagementDisk Space ManagementDBGoal for TodaynBig picture of overheads for data accessqWell simplify things to get focusedqStill,a bit of discipline:nClearly identify assumptions nThen estimate cost in a principled waynFoundation for query optimizationqCant choose the fastest scheme without an estimate of speed!Alternative File OrganizationslMany alternatives exist,each good for some situations,and not so good in others:qHeap files:Suitable when typical access is a file scan retrieving all records.qSorted Files:Best for retrieval in search key order,or only a“range”of records is needed.qClustered Files(with Indexes):Coming soonCost Model for AnalysisnB:The number of data blocksnR:Number of records per blocknD:(Average)time to read or write disk blocknAverage-case analyses for uniform random workloadsnWe will ignore:qSequential vs.Random I/O qPre-fetchingqAny in-memory costs*Good enough to show the overall trends!More AssumptionsnSingle record insert and delete.nEquality selectionqexactly one match nFor Heap Files:qInsert always appends to end of file.nFor Sorted Files:qFiles compacted after deletions.qSelections on search key.Cost of Operations B:The number of data pagesR:Number of records per pageD:(Average)time to read or write disk pageHeap FileSorted FileClustered FileScan all recordsEquality SearchRange SearchInsertDeleteCost of Operations B:The number of data pagesR:Number of records per pageD:(Average)time to read or write disk pageHeap FileSorted FileClustered FileScan all recordsBDBDEquality SearchRange SearchInsertDeleteCost of Operations B:The number of data pagesR:Number of records per pageD:(Average)time to read or write disk pageHeap FileSorted FileClustered FileScan all recordsBDBDEquality Search0.5 BD(log2 B)*DRange SearchInsertDeleteCost of Operations B:The number of data pagesR:Number of records per pageD:(Average)time to read or write disk pageHeap FileSorted FileClustered FileScan all recordsBDBDEquality Search0.5 BD(log2 B)*DRange SearchBD(log2 B)+#match pg*DInsertDeleteCost of Operations B:The number of data pagesR:Number of records per pageD:(Average)time to read or write disk pageHeap FileSorted FileClustered FileScan all recordsBDBDEquality Search0.5 BD(log2 B)*DRange SearchBD(log2 B)+#match pg*DInsert2D(log2B)+B)DDeleteCost of Operations B:The number of data pagesR:Number of records per pageD:(Average)time to read or write disk pageHeap FileSorted FileClustered FileScan all recordsBDBDEquality Search0.5 BD(log2 B)*DRange SearchBD(log2 B)+#match pg*DInsert2D(log2B)+B)DDelete0.5BD+D(log2B)+B)DIndexesnAllow record retrieval by value in one or more fieldsqFind all students in the“CS”departmentqFind all students with a gpa 3nIndex:disk-based data structure for fast lookup by valueqSearch key:any subset of columns in the relation.qSearch key need not be a key of the relationnCan have multiple items matching a lookupnIndex contains a collection of data entriesqItems associated with each search key value kqData entries come in various forms,as well see1st Question to Ask About IndexesnWhat kinds of selections(lookups)do they support?qSelection:qEquality selections(op is=)?qRange selections(op is one of,=,BETWEEN)?qMore exotic selections?n2-dimensional ranges(“east of Berkeley and west of Truckee and North of Fresno and South of Eureka”)qOr n-dimensionaln2-dimensional radii(“within 2 miles of Soda Hall”)qOr n-dimensionalnRanking queries(“10 restaurants closest to Berkeley”)nRegular expression matches,genome string matches,etc.nOne common n-dimensional index:R-treeIndex BreakdownnWhat selections does the index supportnRepresentation of data entries in indexqi.e.,what kind of info is the index actually storing?n3 alternatives herenClustered vs.Unclustered IndexesnSingle Key vs.Composite IndexesnTree-based,hash-based,otherAlternatives for Data Entry k*in IndexnThree alternatives:1.Actual data record(with key value k)2.,rid:record id3.nChoice is orthogonal to the indexing technique.qB+trees,hash-based structures,R trees,GiSTs,nCan have multiple(different)indexes per file.qE.g.file sorted by age,with a hash index on salary,and a B+tree index on name.Alternatives for Data Entries(Contd.)nAlternative 1:Actual data record(with key value k)qIndex as a file organization for records nAlongside Heap files or sorted filesqAt most one Alternative 1 index per relation qNo“pointer lookups”to get data records Alternatives for Data Entries(Contd.)Alternative 2 and Alternative 3 qMust use Alternatives 2 or 3 to support 1 index per relation.qAlternative 3 more compact than Alternative 2,but variable sized data entries neven if search keys are of fixed length.qFor large rid lists,data entry spans multiple blocks!Index ClassificationnClustered vs.Unclustered:nClustered index:qorder of data records the same as,or close to,order of index data entriesqA file can be clustered on at most one search key.qCost of retrieving data records through index varies greatly based on whether index is clustered or not!qAlternative 1 implies clustered,but not vice-versa.nNote:another definition of“clustering”qData mining,AI,statisticsClustered vs.Unclustered IndexnAlternative 2 data entries,data records in a Heap file.qTo build clustered index,first sort the Heap file nwith some free space on each block for future insertsqOverflow blocks may be needed for inserts.nThus,order of data records is close to,but not identical to,the sort order.Index entriesData entriesdirect search for(Index File)(Data file)Data Recordsdata entriesData entriesData RecordsCLUSTEREDUNCLUSTEREDUnclustered vs.Clustered IndexesnClustered ProsqEfficient for range searchesqSupports some types of compressionnMore soonqPossible locality benefitsnDisk scheduling,prefetching,etc.nClustered ConsqMore expensive to maintain non the fly or“sloppily”via reorganizationsnHeap file usually only packed to 2/3 to accommodate insertsCost of Operations B:The number of data pagesR:Number of records per pageD:(Average)time to read or write disk pageHeap FileSorted FileClustered FileScan all recordsBDBD1.5 BDEquality Search0.5 BD(log2 B)*D(logF 1.5B+1)*DRange SearchBD(log2 B)+#match pg*D(logF 1.5B)+#match pg*DInsert2D(log2B)+B)D(logF 1.5B)+2)*DDelete0.5BD+D(log2B)+B)D (because R,W 0.5)(logF 1.5B)+2)*DComposite Search KeysnSearch on a combination of fields.qEquality query:Every field value is equal to a constant value.E.g.wrt index:nage=20 and sal=75qRange query:Some field value is not a constant.E.g.:nage 20;or age=20 and sal 10nData entries in index can be sorted by search key to support range queries.qLexicographic order qLike the dictionary,but on fields,not letters!sue 1375bobcaljoe121020801112name age sal12,2012,1011,8013,7520,1210,1275,1380,111112121310207580Data recordssorted by nameData entries in indexsorted by Data entriessorted by Examples of composite keyindexes using lexicographic order.SummarynFile Layer manages access to records in pages.qRecord and page formats depend on fixed vs.variable-length.qFree space management is an important issue.qSlotted page format supports variable length records and allows records to move on page.nMany alternative file organizations exist,each appropriate in some situation.nIf selection queries are frequent,sorting the file or building an index is important.qHash-based indexes only good for equality search.qSorted files and tree-based indexes best for range search;also good for equality search.(Files rarely kept sorted in practice;B+tree index is better.)nIndex is a collection of data entries plus a way to quickly find entries with given key values.Summary(Contd.)nData entries in index can be one of 3 alternatives:(1)actual data records,(2)pairs,or(3)pairs.qChoice orthogonal to indexing structure(i.e.,tree,hash,etc.).nUsually have several indexes on a given file of data records,each with a different search key.nIndexes can be classified as clustered vs.unclusterednDifferences have important consequences for utility/performance.nCatalog relations store information about relations,indexes and views.

數(shù)據(jù)庫系統(tǒng)教學(xué)課件.zip

《數(shù)據(jù)庫系統(tǒng)》教學(xué)課件

lec 9 Query Optimization.ppt---(點(diǎn)擊預(yù)覽)

lec 9 Physical Design.ppt---(點(diǎn)擊預(yù)覽)

lec 8 Query Processing.ppt---(點(diǎn)擊預(yù)覽)

lec 7 Hash-Based Inds.ppt---(點(diǎn)擊預(yù)覽)

lec 6 External Sorting.ppt---(點(diǎn)擊預(yù)覽)

lec 5 TreeInds.ppt---(點(diǎn)擊預(yù)覽)

lec 4 SQL.ppt---(點(diǎn)擊預(yù)覽)

lec 4 calculus.ppt---(點(diǎn)擊預(yù)覽)

lec 4 algebra.ppt---(點(diǎn)擊預(yù)覽)

lec 3 Storing Data.ppt---(點(diǎn)擊預(yù)覽)

lec 3 File and Indexing.ppt---(點(diǎn)擊預(yù)覽)

lec 2 The Relational Model.ppt---(點(diǎn)擊預(yù)覽)

lec 2 The Entity-Relationship Model.ppt---(點(diǎn)擊預(yù)覽)

lec 15 final review.ppt---(點(diǎn)擊預(yù)覽)

lec 14 warehouse.ppt---(點(diǎn)擊預(yù)覽)

lec 12 recovery.ppt---(點(diǎn)擊預(yù)覽)

lec 11 transaction.ppt---(點(diǎn)擊預(yù)覽)

lec 10 norm.ppt---(點(diǎn)擊預(yù)覽)

lec 1 overview.ppt---(點(diǎn)擊預(yù)覽)

壓縮包目錄

預(yù)覽區(qū)

《數(shù)據(jù)庫系統(tǒng)》教學(xué)課件
- lec 1 overview.ppt--點(diǎn)擊預(yù)覽
- lec 10 norm.ppt--點(diǎn)擊預(yù)覽
- lec 11 transaction.ppt--點(diǎn)擊預(yù)覽
- lec 12 recovery.ppt--點(diǎn)擊預(yù)覽
- lec 14 warehouse.ppt--點(diǎn)擊預(yù)覽
- lec 15 final review.ppt--點(diǎn)擊預(yù)覽
- lec 2 The Entity-Relationship Model.ppt--點(diǎn)擊預(yù)覽
- lec 2 The Relational Model.ppt--點(diǎn)擊預(yù)覽
- lec 3 File and Indexing.ppt--點(diǎn)擊預(yù)覽
- lec 3 Storing Data.ppt--點(diǎn)擊預(yù)覽
- lec 4 algebra.ppt--點(diǎn)擊預(yù)覽
- lec 4 calculus.ppt--點(diǎn)擊預(yù)覽
- lec 4 SQL.ppt--點(diǎn)擊預(yù)覽
- lec 5 TreeInds.ppt--點(diǎn)擊預(yù)覽
- lec 6 External Sorting.ppt--點(diǎn)擊預(yù)覽
- lec 7 Hash-Based Inds.ppt--點(diǎn)擊預(yù)覽
- lec 8 Query Processing.ppt--點(diǎn)擊預(yù)覽
- lec 9 Physical Design.ppt--點(diǎn)擊預(yù)覽
- lec 9 Query Optimization.ppt--點(diǎn)擊預(yù)覽

請(qǐng)點(diǎn)擊導(dǎo)航文件預(yù)覽

編號(hào)：48634128 類型：共享資源大小：6.17MB 格式：ZIP 上傳時(shí)間：2022-01-12

30
積分

舉報(bào)

版權(quán)申訴 word格式文檔無特別注明外均可編輯修改；預(yù)覽文檔經(jīng)過壓縮，下載后原文更清晰！ 立即下載

關(guān) 鍵詞：: 數(shù)據(jù)庫系統(tǒng) 數(shù)據(jù)庫系統(tǒng) 教學(xué) 課件

資源描述：: 《數(shù)據(jù)庫系統(tǒng)》教學(xué)課件,數(shù)據(jù)庫系統(tǒng),數(shù)據(jù)庫,系統(tǒng),教學(xué),課件

展開閱讀全文

溫馨提示:
1: 本站所有資源如無特殊說明，都需要本地電腦安裝OFFICE2007和PDF閱讀器。圖紙軟件為CAD,CAXA,PROE,UG,SolidWorks等.壓縮文件請(qǐng)下載最新的WinRAR軟件解壓。
2: 本站的文檔不包含任何第三方提供的附件圖紙等，如果需要附件，請(qǐng)聯(lián)系上傳者。文件的所有權(quán)益歸上傳用戶所有。
3.本站RAR壓縮包中若帶圖紙，網(wǎng)頁內(nèi)容里面會(huì)有圖紙預(yù)覽，若沒有圖紙預(yù)覽就沒有圖紙。
4. 未經(jīng)權(quán)益所有人同意不得將文件中的內(nèi)容挪作商業(yè)或盈利用途。
5. 裝配圖網(wǎng)僅提供信息存儲(chǔ)空間，僅對(duì)用戶上傳內(nèi)容的表現(xiàn)方式做保護(hù)處理，對(duì)用戶上傳分享的文檔內(nèi)容本身不做任何修改或編輯，并不能對(duì)任何下載內(nèi)容負(fù)責(zé)。
6. 下載文件中如有侵權(quán)或不適當(dāng)內(nèi)容，請(qǐng)與我們聯(lián)系，我們立即糾正。
7. 本站不保證下載資源的準(zhǔn)確性、安全性和完整性, 同時(shí)也不承擔(dān)用戶因使用這些下載資源對(duì)自己和他人造成任何形式的傷害或損失。

裝配圖網(wǎng)所有資源均是用戶自行上傳分享，僅供網(wǎng)友學(xué)習(xí)交流，未經(jīng)上傳用戶書面授權(quán)，請(qǐng)勿作他用。

關(guān)于本文

本文標(biāo)題：《數(shù)據(jù)庫系統(tǒng)》教學(xué)課件
鏈接地址：http://m.appdesigncorp.com/article/48634128.html

點(diǎn)擊下載此資源

《數(shù)據(jù)庫系統(tǒng)》教學(xué)課件

最新文檔

相關(guān)資源

相關(guān)搜索