裝配圖網(wǎng) > 圖紙專(zhuān)區(qū) > 課件教案 > 《數(shù)據(jù)庫(kù)系統(tǒng)》教學(xué)課件

《數(shù)據(jù)庫(kù)系統(tǒng)》教學(xué)課件

《數(shù)據(jù)庫(kù)系統(tǒng)》教學(xué)課件,數(shù)據(jù)庫(kù)系統(tǒng),數(shù)據(jù)庫(kù),系統(tǒng),教學(xué),課件 Tree-Structured IndexesJianlin FengSchool of SoftwareSUN YAT-SEN UNIVERSITYcourtesy of Joe Hellerstein for some slidesReview:Files,Pages,RecordsnAbstraction of stored data is“files”with“pages”of“records”.qRecords live on pagesqPhysical Record ID(RID)=qRecords can have fixed length or variable length.nFiles can be unordered(heap),sorted,or kind of sorted(i.e.,“clustered”)on a search key.nIndexes can be used to speed up many kinds of accesses.(i.e.,“access paths”)Tree-Structured Indexes:IntroductionnSelections of form:field constantnEquality selections(op is=)qEither“tree”or“hash”indexes help here.nRange selections(op is one of,=,BETWEEN)q“Hash”indexes dont work for these.nMore complex selections(e.g.spatial containment)qThere are fancier trees that can do thisnTree-structured indexing techniques support both range selections and equality selections.qISAM:static structure;early index technology.qB+tree:dynamic,adjusts gracefully under inserts and deletes.Range SearchesnFind all students with gpa 3.0qIf data is in sorted file,do binary search to find first such student,then scan to find others.qCost of binary search in a database can be quite high.nWhy?nSimple idea:Create an index file.*Can do binary search on(smaller)index file!Page 1Page 2Page NPage 3Data Filek2kNk1Index FileISAMnIndex file may still be quite large.But we can apply the idea repeatedly!*Leaf pages contain data entries.index entryNon-leafPagesPagesPrimary pagesLeafP0K1P1K2P2KmPmOverflow pageExample ISAM TreenIndex entries:,they direct search for data entries in leaves.nExample where each node can hold 2 entries;10*15*20*27*33*37*40*46*51*55*63*97*2033516340RootISAM is a STATIC StructurenFile creation:qLeaf(data)pages allocated sequentially,sorted by search keyqthen index pagesqthen overflow pgs.nSearch:Start at root;use key comparisons to go to leaf.nCost=log F N qF=#entries/page(i.e.,fanout)qN=#leaf pgsq no need for next-leaf-page pointers.(Why?)nInsert:Find leaf that data entry belongs to,and put it there.Overflow page if necessary.nDelete:Seek and destroy!If deleting a tuple empties an overflow page,de-allocate it and remove from linked-list.Static tree structure:inserts/deletes affect only leaf pages.Data PagesIndex PagesOverflow pagesPage Number48*Example:Insert 23*,48*,41*,42*10*15*20*27*33*37*40*46*51*55*63*97*2033516340RootOverflowPagesLeafIndexPagesPagesPrimary23*41*42*48*10*15*20*27*33*37*40*46*51*55*63*97*2033516340RootOverflowPagesLeafIndexPagesPagesPrimary23*41*42*.then Deleting 42*,51*,97*Note that 51*appears in index levels,but not in leaf!B+Tree StructurenEach node contains d=m =2d entries(index or data)qThe parameter d is called the order of the tree.qEach internal node contains m index entries:.qEach leaf node contains m data entries:nThe ROOT node contains between 1 and 2d index entries.qIt is a leaf or has at least two children.nEach path from the ROOT to any leaf has the same length.nSupports equality and range-searches efficiently.Index EntriesData Entries(Sequence set)(Direct search)B+Tree Equality SearchnSearch begins at root,and key comparisons direct it to a leaf.nSearch for 15*Based on the search for 15*,we know it is not in the tree!Root1724302*3*5*7*14*16*19*20*22*24*27*29*33*34*38*39*13B+Tree Range SearchnSearch all records whose ages are in 15,28.qEquality search 15*.qFollow sibling pointers.Root1724302*3*5*7*14*16*19*20*22*24*27*29*33*34*38*39*13B+Trees in PracticenTypical order:100.Typical fill-factor:67%.qaverage fanout=133nCan often hold top levels in buffer pool:qLevel 1=1 page =8 KBqLevel 2=133 pages=1 MBqLevel 3=17,689 pages=145 MB qLevel 4=2,352,637 pages=19 GB vWith 1 MB buffer,can locate one record in 19 GB(or 0.3 billion records)in two I/Os!Inserting a Data Entry into a B+TreenFind correct leaf L.nPut data entry onto L.qIf L has enough space,done!qElse,must split L(into L and a new node L2)nRedistribute entries evenly,copy up middle key.nInsert index entry pointing to L2 into parent of L.nThis can happen recursivelyqTo split index node,redistribute entries evenly,but push up middle key.(Contrast with leaf splits.)nSplits“grow”tree;root split increases height.qTree growth:gets wider or one level taller at top.Example B+Tree Inserting 8*Root1724302*3*5*7*14*16*19*20*22*24*27*29*33*34*38*39*13Animation:Insert 8*Root1724302*3*5*7*14*16*19*20*22*24*27*29*33*34*38*39*138*145Final B+Tree-Inserting 8*v Notice that root was split,leading to increase in height.v In this example,we can avoid split by re-distributing entries;however,this is usually not done in practice.Root1724302*3*5*7*14*16*19*20*22*24*27*29*33*34*38*39*132*3*Root17243014*16*19*20*22*24*27*29*33*34*38*39*1357*5*8*Data vs.Index Page Split(from previous example of inserting“8*”)nObserve how minimum occupancy is guaranteed in both leaf and index pg splits.nNote difference between copy-up and push-up;be sure you understand the reasons for this.2*3*5*7*5Entry to be inserted in parent node.(Note that 5 iscontinues to appear in the leaf.)s copied up and2*3*5*7*8*Data Page Split8*5243013appears once in the index.Contrast17Entry to be inserted in parent node.(Note that 17 is pushed up and onlythis with a leaf split.)17243013Index Page Split5Deleting a Data Entry from a B+TreenStart at root,find leaf L where entry belongs.nRemove the entry.qIf L is at least half-full,done!qIf L has only d-1 entries,nTry to re-distribute,borrowing from sibling(adjacent node with same parent as L).nIf re-distribution fails,merge L and sibling.nIf merge occurred,must delete entry(pointing to L or sibling)from parent of L.nMerge could propagate to root,decreasing height.Root1724302*3*5*7*14*16*19*20*22*24*27*29*33*34*38*39*132*3*Root17243014*16*19*20*22*24*27*29*33*34*38*39*1357*5*8*Example Tree(including 8*)Delete 19*and 20*.Example Tree(including 8*)Delete 19*and 20*.nDeleting 19*is easy.nDeleting 20*is done with re-distribution.Notice how middle key is copied up.2*3*Root17243014*16*19*20*22*24*27*29*33*34*38*39*1357*5*8*2*3*Root173014*16*33*34*38*39*1357*5*8*22*24*2727*29*.And Then Deleting 24*nMust merge.nObserve toss of index entry(on right),and pull down of index entry(below).3022*27*29*33*34*38*39*2*3*7*14*16*22*27*29*33*34*38*39*5*8*Root3013517Example of Non-leaf Re-distributionnTree is shown below during deletion of 24*.(What could be a possible initial tree?)nIn contrast to previous example,can re-distribute entry from left child of root to right child.Root1351720223014*16*17*18*20*33*34*38*39*22*27*29*21*7*5*8*3*2*After Re-distributionnIntuitively,entries are re-distributed by pushing through the splitting entry in the parent node.nIt suffices to re-distribute index entry with key 20;weve re-distributed 17 as well for illustration.14*16*33*34*38*39*22*27*29*17*18*20*21*7*5*8*2*3*Root13517302022Bulk Loading of a B+TreenGiven:large collection of recordsnDesire:B+tree on some fieldnBad idea:repeatedly insert recordsqSlow,and poor leaf space utilization.Why?nBulk Loading can be done much more efficiently.nInitialization:Sort all data entries,insert pointer to first(leaf)page in a new(root)page.3*4*6*9*10*11*12*13*20*22*23*31*35*36*38*41*44*Sorted pages of data entries;not yet in B+treeRootBulk Loading(Contd.)nIndex entries for leaf pages always entered into right-most index page just above leaf level.When this fills up,it splits.(Split may go up right-most path to the root.)nMuch faster than repeated inserts.3*4*6*9*10*11*12*13*20*22*23*31*35*36*38*41*44*RootData entry pages not yet in B+tree352312610203*4*6*9*10*11*12*13*20*22*23*31*35*36*38*41*44*6Root101223203538not yet in B+treeData entry pages Summary of Bulk LoadingnOption 1:multiple inserts.qSlow.qDoes not give sequential storage of leaves.nOption 2:Bulk Loading qFewer I/Os during build.qLeaves will be stored sequentially(and linked,of course).qCan control“fill factor”on pages.A Note on OrdernOrder(d)makes little sense with variable-length entriesnUse a physical criterion in practice(at least half-full).qIndex pages often hold many more entries than leaf pages.qVariable sized records and search keys:ndifferent nodes have different numbers of entries.qEven with fixed length fields,Alternative(3)gives variable lengthnMany real systems are even sloppier than this-only reclaim space when a page is completely empty.SummarynTree-structured indexes are ideal for range-searches,also good for equality searches.nISAM is a static structure.qOnly leaf pages modified;overflow pages needed.qOverflow chains can degrade performance unless size of data set and data distribution stay constant.nB+tree is a dynamic structure.qInserts/deletes leave tree height-balanced;log F N cost.qHigh fanout(F)means depth rarely more than 3 or 4.qTypically,67%occupancy on average.qUsually preferable to ISAM;adjusts to growth gracefully.qIf data entries are data records,splits can change rids!Summary(Contd.)nKey compression increases fanout,reduces height.nBulk loading can be much faster than repeated inserts for creating a B+tree on a large data set.nB+tree widely used because of its versatility.qOne of the most optimized components of a DBMS.

數(shù)據(jù)庫(kù)系統(tǒng)教學(xué)課件.zip

《數(shù)據(jù)庫(kù)系統(tǒng)》教學(xué)課件

lec 9 Query Optimization.ppt---(點(diǎn)擊預(yù)覽)

lec 9 Physical Design.ppt---(點(diǎn)擊預(yù)覽)

lec 8 Query Processing.ppt---(點(diǎn)擊預(yù)覽)

lec 7 Hash-Based Inds.ppt---(點(diǎn)擊預(yù)覽)

lec 6 External Sorting.ppt---(點(diǎn)擊預(yù)覽)

lec 5 TreeInds.ppt---(點(diǎn)擊預(yù)覽)

lec 4 SQL.ppt---(點(diǎn)擊預(yù)覽)

lec 4 calculus.ppt---(點(diǎn)擊預(yù)覽)

lec 4 algebra.ppt---(點(diǎn)擊預(yù)覽)

lec 3 Storing Data.ppt---(點(diǎn)擊預(yù)覽)

lec 3 File and Indexing.ppt---(點(diǎn)擊預(yù)覽)

lec 2 The Relational Model.ppt---(點(diǎn)擊預(yù)覽)

lec 2 The Entity-Relationship Model.ppt---(點(diǎn)擊預(yù)覽)

lec 15 final review.ppt---(點(diǎn)擊預(yù)覽)

lec 14 warehouse.ppt---(點(diǎn)擊預(yù)覽)

lec 12 recovery.ppt---(點(diǎn)擊預(yù)覽)

lec 11 transaction.ppt---(點(diǎn)擊預(yù)覽)

lec 10 norm.ppt---(點(diǎn)擊預(yù)覽)

lec 1 overview.ppt---(點(diǎn)擊預(yù)覽)

壓縮包目錄

預(yù)覽區(qū)

《數(shù)據(jù)庫(kù)系統(tǒng)》教學(xué)課件
- lec 1 overview.ppt--點(diǎn)擊預(yù)覽
- lec 10 norm.ppt--點(diǎn)擊預(yù)覽
- lec 11 transaction.ppt--點(diǎn)擊預(yù)覽
- lec 12 recovery.ppt--點(diǎn)擊預(yù)覽
- lec 14 warehouse.ppt--點(diǎn)擊預(yù)覽
- lec 15 final review.ppt--點(diǎn)擊預(yù)覽
- lec 2 The Entity-Relationship Model.ppt--點(diǎn)擊預(yù)覽
- lec 2 The Relational Model.ppt--點(diǎn)擊預(yù)覽
- lec 3 File and Indexing.ppt--點(diǎn)擊預(yù)覽
- lec 3 Storing Data.ppt--點(diǎn)擊預(yù)覽
- lec 4 algebra.ppt--點(diǎn)擊預(yù)覽
- lec 4 calculus.ppt--點(diǎn)擊預(yù)覽
- lec 4 SQL.ppt--點(diǎn)擊預(yù)覽
- lec 5 TreeInds.ppt--點(diǎn)擊預(yù)覽
- lec 6 External Sorting.ppt--點(diǎn)擊預(yù)覽
- lec 7 Hash-Based Inds.ppt--點(diǎn)擊預(yù)覽
- lec 8 Query Processing.ppt--點(diǎn)擊預(yù)覽
- lec 9 Physical Design.ppt--點(diǎn)擊預(yù)覽
- lec 9 Query Optimization.ppt--點(diǎn)擊預(yù)覽

請(qǐng)點(diǎn)擊導(dǎo)航文件預(yù)覽

編號(hào)：48634128 類(lèi)型：共享資源大?。?span id="ml9yse3" class="font-tahoma">6.17MB 格式：ZIP 上傳時(shí)間：2022-01-12

30
積分

舉報(bào)

版權(quán)申訴 word格式文檔無(wú)特別注明外均可編輯修改；預(yù)覽文檔經(jīng)過(guò)壓縮，下載后原文更清晰！ 立即下載

關(guān) 鍵詞：: 數(shù)據(jù)庫(kù)系統(tǒng) 數(shù)據(jù)庫(kù) 系統(tǒng) 教學(xué) 課件

資源描述：: 《數(shù)據(jù)庫(kù)系統(tǒng)》教學(xué)課件,數(shù)據(jù)庫(kù)系統(tǒng),數(shù)據(jù)庫(kù),系統(tǒng),教學(xué),課件

展開(kāi)閱讀全文

溫馨提示:
1: 本站所有資源如無(wú)特殊說(shuō)明，都需要本地電腦安裝OFFICE2007和PDF閱讀器。圖紙軟件為CAD,CAXA,PROE,UG,SolidWorks等.壓縮文件請(qǐng)下載最新的WinRAR軟件解壓。
2: 本站的文檔不包含任何第三方提供的附件圖紙等，如果需要附件，請(qǐng)聯(lián)系上傳者。文件的所有權(quán)益歸上傳用戶(hù)所有。
3.本站RAR壓縮包中若帶圖紙，網(wǎng)頁(yè)內(nèi)容里面會(huì)有圖紙預(yù)覽，若沒(méi)有圖紙預(yù)覽就沒(méi)有圖紙。
4. 未經(jīng)權(quán)益所有人同意不得將文件中的內(nèi)容挪作商業(yè)或盈利用途。
5. 裝配圖網(wǎng)僅提供信息存儲(chǔ)空間，僅對(duì)用戶(hù)上傳內(nèi)容的表現(xiàn)方式做保護(hù)處理，對(duì)用戶(hù)上傳分享的文檔內(nèi)容本身不做任何修改或編輯，并不能對(duì)任何下載內(nèi)容負(fù)責(zé)。
6. 下載文件中如有侵權(quán)或不適當(dāng)內(nèi)容，請(qǐng)與我們聯(lián)系，我們立即糾正。
7. 本站不保證下載資源的準(zhǔn)確性、安全性和完整性, 同時(shí)也不承擔(dān)用戶(hù)因使用這些下載資源對(duì)自己和他人造成任何形式的傷害或損失。

裝配圖網(wǎng)所有資源均是用戶(hù)自行上傳分享，僅供網(wǎng)友學(xué)習(xí)交流，未經(jīng)上傳用戶(hù)書(shū)面授權(quán)，請(qǐng)勿作他用。

關(guān)于本文

本文標(biāo)題：《數(shù)據(jù)庫(kù)系統(tǒng)》教學(xué)課件
鏈接地址：http://m.appdesigncorp.com/article/48634128.html

點(diǎn)擊下載此資源

《數(shù)據(jù)庫(kù)系統(tǒng)》教學(xué)課件

最新文檔

相關(guān)資源

相關(guān)搜索