Environmental Erosion Research
پژوهش هاي فرسايش محيطي
E.E.R.
Literature & Humanities
http://magazine.hormozgan.ac.ir
1
admin
2251-7812
2717-3968
10.52547/jeer
6561
8888
45855/11/3/90
fa
jalali
1395
11
1
gregorian
2017
2
1
6
4
online
1
fulltext
fa
انتخاب بهترین تکنیک خوشهبندی سلسله مراتبی بر مبنای تحلیل مؤلفههای اصلی در برآورد رسوب معلق
Choosing the Best Hierarchical Clustering Technique Based on Principal Components Analysis for Suspended Sediment Load Estimation
مدلسازی و تحلیل زمانی و مکانی رخداد انواع مختلف فرسایش محیطی
پژوهشي
Research
<p dir="RTL"><strong><span style="font-family:b compset;">تحلیل منطقه­ای توزیع رسوب معلق از موارد اساسی در اجرای پروژههای حفاظت آب­و­خاک است که امکان برآورد رسوب را در آبخیزهای فاقد آمار فراهم میسازد. برای این منظور تعیین مناطق همگن برای ارائه­ی نتایج نزدیک به واقعیت، نیازمند خوشهبندی صحیح آبخیزها در واحدهای همگن است. در همین راستا، هدف پژوهش حاضر مقایسه­ی تکنیکهای مختلف روش سلسله مراتبی اعم از </span></strong><strong><span dir="LTR"><span style="font-size:11.0pt;">Single</span></span></strong> <strong><span dir="LTR"><span style="font-size:11.0pt;">linkage</span></span></strong><strong><span style="font-family:b compset;">، </span></strong><strong><span dir="LTR"><span style="font-size:11.0pt;">Ward</span></span></strong><strong><span style="font-family:b compset;"> و </span></strong><strong><span dir="LTR"><span style="font-size:11.0pt;">β-Flexible</span></span></strong><strong><span style="font-family:b compset;"> و انتخاب بهترین تکنیک برای تعیین مناطق همگن در حوضه­ی رودخانههای قرهسو و گرگانرود استان گلستان است. در هر کدام از روشهای خوشهبندی آبخیزها، بهعنوان پیشفرض </span></strong><strong><span style="font-family:b compset;"><span style="font-size:11.0pt;">2</span></span></strong><strong><span style="font-family:b compset;">، </span></strong><strong><span style="font-family:b compset;"><span style="font-size:11.0pt;">3</span></span></strong><strong><span style="font-family:b compset;">، </span></strong><strong><span style="font-family:b compset;"><span style="font-size:11.0pt;">4</span></span></strong><strong><span style="font-family:b compset;"> و </span></strong><strong><span style="font-family:b compset;"><span style="font-size:11.0pt;">5</span></span></strong><strong><span style="font-family:b compset;"> خوشه در نظر گرفته و با شاخصهای اعتبارسنجی </span></strong><strong><span dir="LTR"><span style="font-size:11.0pt;">Pseudo-F</span></span></strong><strong><span style="font-family:b compset;"> و </span></strong><strong><span dir="LTR"><span style="font-size:11.0pt;">Dunn</span></span></strong><strong><span style="font-family:b compset;"> بررسی شد. نتایج نشان داد که از بین تکنیکهای مختلف مورد استفاده و براساس شاخصهای سنجش کیفیت خوشهبندی، روش </span></strong><strong><span dir="LTR"><span style="font-size:11.0pt;">Single linkage</span></span></strong><strong><span style="font-family:b compset;"> عملکرد بهتری را ارائه داد. با توجه به اینکه مقادیر رسوب معلق از دادههای اندازهگیریشده­ی دبی و منحنیهای سنجه رسوب بهدست آمده­، خوشهبندی دارای صحت بیشتری است؛ لذا روش مناسب خوشهبندی میتواند در اتخاذ روشهای صحیح مدیریت در حوضههای آبخیز، بهخصوص در مسائل رسوبگذاری و فرسایش مؤثر باشد.</span></strong></p>
<p><strong>1- INTRODUCTION</strong></p>
<p>The assessment of watershed sediment load is necessary for controling soil erosion and reducing the potential of sediment production. Different estimates of sediment amounts along with the lack of long-term measurements limits the accessibility to reliable data series of erosion rate and sediment yield. Therefore, the observed data of suspended sediment load could be used to estimate soil loss in the catchment upstream. Hence, one of the valid methods to estimate soil erosion is using of the recorded data of hydrometery stations in combination with catchment characteristics that will provide accurate predictions. For this purpose, recognition of similar sub-watersheds according to climatic, physiographic, geologic land use could be useful in the erosion control operations.</p>
<p><strong>2- THEORETICAL FRAMEWORK</strong></p>
<p>To estimate the exact amount of sediment in the ungauged areas, clustering is introduced as a key step. Various methods and techniques have been used to determine the best number of clusters. However, application of different clustering methods and selection of the best one is rarely found. To this aim, the objective of present study is to determine the most important variables in sediment production using Single linkage, Ward and β-Flexible methods for the clustering of sub-watersheds of Gorganroud and Qareh-Sou river basins in Golestan Province.</p>
<p><strong>3- METHODOLOGY</strong></p>
<pre>
The Gorganroud and Qareh Sou Watersheds are located at the North-Eastern part of Iran. The seventeen hydrometric stations were selected with a 24-year (1986–2010) recorded data of discharge and suspended sediment load. The Grubbs and Beck method was used to perform the verity in order to verify the outlier discharge measured data. The correlation method was used to fill the missing data in time series. The normality of discharge and suspended sediment data were tested using Kolmogrov-Smirnov test and verified for choosing the well-set trend analyses method. The linear regression and Mann-Kendal Taw methods were used for the data with normal and non-normal distribution in trend analysis, respectively. Auto Correlation Function (ACF) test method was used to determine the internal consistency between the data series.</pre>
<p>A set of 38 factors from the five main groups of categories were investigated to determine the sediment yield controlling independent variables. Principal Component Analysis (PCA) was used to determine the most effective variables. In order to detect the best classification method, three classification techniques (Single linkage, Ward’s, and β-flexible methods) were examined in the study area. The Single Linkage also called nearest neighbor is a simple clustering method. The object pairs forms clusters hierarchically starting from the most similar pairs according to the similarity in a descending order. Ward’s algorithm is one of the frequently used techniques for the regionalization studies of hydrology and climatology factors. A generalized hierarchical method, β-Flexible, formed the group calculating the external object. The distance from a point to the group was computed in this method.</p>
<p>Many indices have been developed to examine the validity of clustering techniques based on finding an optimal partitioning. In the present study, Pseudo F and Dunn’s Indices were used to assess the accuracy of clustering algorithms. Accurate clustering means having non-overlapping partitions. One of the most commonly used criteria for the selection of group number is the maximization of pseudo-F statistics. This statistics is based on multivariate normal distribution of data.</p>
<p><strong>4- RESULTS </strong></p>
<p>All data series of 17 sub-watersheds in Gorganroud and Qareh Sou basins were tested with different clustering alghorithms. Two data series showed autocorrelation, detected by the ACF test. Two data sets had trends according to the Kendal’s test. Therefore, 13 sub-watersheds remained for the final classification. Some 38 independent variables were calculated and screened with PCA. The variables with similar effects on sediment yield, were grouped in 7 components. The selected components were chosen according to the amount of variance. The results of PCA and the selected representative variables in each component have been given in Table 1.</p>
<p></p>
<p align="center"><strong>Table 1: Result of Principal Component Analysis of effective variables on sediment yield in Gorganroud and Qareh Sou Watersheds, Iran</strong></p>
<table align="left" border="1" cellpadding="0" cellspacing="0" style="width:100.0%;" width="100%">
<tbody>
<tr>
<td style="width:12.44%;">
<p align="center" dir="RTL"><strong><span dir="LTR">Components</span></strong></p>
</td>
<td style="width:11.98%;">
<p align="center" dir="RTL"><strong><span dir="LTR">Spatial Amount</span></strong></p>
</td>
<td style="width:18.34%;">
<p align="center" dir="RTL"><strong><span dir="LTR">Variance (%)</span></strong></p>
</td>
<td style="width:15.3%;">
<p align="center" dir="RTL"><strong><span dir="LTR">Cumulative Variance (%)</span></strong></p>
</td>
<td style="width:41.94%;">
<p align="center" dir="RTL"><strong><span dir="LTR">Representative variable</span></strong></p>
</td>
</tr>
<tr>
<td style="width:12.44%;">
<p align="center" dir="RTL"><span dir="LTR">1</span></p>
</td>
<td style="width:11.98%;">
<p align="center" dir="RTL"><span dir="LTR">7.99</span></p>
</td>
<td style="width:18.34%;">
<p align="center" dir="RTL"><span dir="LTR">21.60</span></p>
</td>
<td style="width:15.3%;">
<p align="center" dir="RTL"><span dir="LTR">21.60</span></p>
</td>
<td style="width:41.94%;">
<p align="center" dir="RTL"><span dir="LTR">Main stream length</span></p>
</td>
</tr>
<tr>
<td style="width:12.44%;">
<p align="center" dir="RTL"><span dir="LTR">2</span></p>
</td>
<td style="width:11.98%;">
<p align="center" dir="RTL"><span dir="LTR">6.82</span></p>
</td>
<td style="width:18.34%;">
<p align="center" dir="RTL"><span dir="LTR">18.43</span></p>
</td>
<td style="width:15.3%;">
<p align="center" dir="RTL"><span dir="LTR">40.03</span></p>
</td>
<td style="width:41.94%;">
<p align="center" dir="RTL"><span dir="LTR">Flow discharge with 10 years of return period</span></p>
</td>
</tr>
<tr>
<td style="width:12.44%;">
<p align="center" dir="RTL"><span dir="LTR">3</span></p>
</td>
<td style="width:11.98%;">
<p align="center" dir="RTL"><span dir="LTR">5.97</span></p>
</td>
<td style="width:18.34%;">
<p align="center" dir="RTL"><span dir="LTR">16.12</span></p>
</td>
<td style="width:15.3%;">
<p align="center" dir="RTL"><span dir="LTR">56.16</span></p>
</td>
<td style="width:41.94%;">
<p align="center" dir="RTL"><span dir="LTR">Percent of forest area</span></p>
</td>
</tr>
<tr>
<td style="width:12.44%;">
<p align="center" dir="RTL"><span dir="LTR">4</span></p>
</td>
<td style="width:11.98%;">
<p align="center" dir="RTL"><span dir="LTR">5.25</span></p>
</td>
<td style="width:18.34%;">
<p align="center" dir="RTL"><span dir="LTR">14.18</span></p>
</td>
<td style="width:15.3%;">
<p align="center" dir="RTL"><span dir="LTR">70.33</span></p>
</td>
<td style="width:41.94%;">
<p align="center" dir="RTL"><span dir="LTR">Percent of agricultural lands area</span></p>
</td>
</tr>
<tr>
<td style="width:12.44%;">
<p align="center" dir="RTL"><span dir="LTR">5</span></p>
</td>
<td style="width:11.98%;">
<p align="center" dir="RTL"><span dir="LTR">4.98</span></p>
</td>
<td style="width:18.34%;">
<p align="center" dir="RTL"><span dir="LTR">13.47</span></p>
</td>
<td style="width:15.3%;">
<p align="center" dir="RTL"><span dir="LTR">83.81</span></p>
</td>
<td style="width:41.94%;">
<p align="center" dir="RTL"><span dir="LTR">Drainage density</span></p>
</td>
</tr>
<tr>
<td style="width:12.44%;">
<p align="center" dir="RTL"><span dir="LTR">6</span></p>
</td>
<td style="width:11.98%;">
<p align="center" dir="RTL"><span dir="LTR">2.56</span></p>
</td>
<td style="width:18.34%;">
<p align="center" dir="RTL"><span dir="LTR">6.92</span></p>
</td>
<td style="width:15.3%;">
<p align="center" dir="RTL"><span dir="LTR">90.73</span></p>
</td>
<td style="width:41.94%;">
<p align="center" dir="RTL"><span dir="LTR">Percentage of permeable formations area</span></p>
</td>
</tr>
<tr>
<td style="width:12.44%;">
<p align="center" dir="RTL"><span dir="LTR">7</span></p>
</td>
<td style="width:11.98%;">
<p align="center" dir="RTL"><span dir="LTR">1.95</span></p>
</td>
<td style="width:18.34%;">
<p align="center" dir="RTL"><span dir="LTR">5.28</span></p>
</td>
<td style="width:15.3%;">
<p align="center" dir="RTL"><span dir="LTR">96.01</span></p>
</td>
<td style="width:41.94%;">
<p align="center" dir="RTL"><span dir="LTR">Concentration time</span></p>
</td>
</tr>
</tbody>
</table>
<p></p>
<p>Results of Ward’s, Single linkage and β-flexible methods as hierarchical techniques have been summarized in Table 2.</p>
<p></p><p></p><p></p><p></p><p></p><p></p><p></p><p></p><p></p><p></p><p></p>
<p align="center" dir="RTL"><strong><span dir="LTR">Table 2 Results of the hierarchical clustering technique in Gorganroud and Qareh Sou Watersheds, Iran.</span></strong></p>
<table align="center" border="1" cellpadding="0" cellspacing="0">
<tbody>
<tr>
<td style="height:2px;">
<p align="center" dir="RTL"><strong><span dir="LTR">Method</span></strong></p>
</td>
<td style="height:2px;"><p align="center" dir="RTL"></p></td>
<td style="height:2px;">
<p align="center" dir="RTL"><strong><span dir="LTR">Clusters Number</span></strong></p>
</td>
<td style="height:2px;">
<p align="center" dir="RTL"><strong><span dir="LTR">Dunn Coefficient</span></strong></p>
</td>
<td style="height:2px;">
<p align="center" dir="RTL"><strong><span dir="LTR">Psedue-F</span></strong></p>
</td>
</tr>
<tr>
<td colspan="2" rowspan="4" style="height:2px;">
<p align="center" dir="RTL"><strong><span dir="LTR">Single Linkage</span></strong></p>
</td>
<td style="height:2px;">
<p align="center" dir="RTL"><span dir="LTR">2</span></p>
</td>
<td style="height:2px;">
<p align="center" dir="RTL"><span dir="LTR">0.29</span></p>
</td>
<td style="height:2px;">
<p align="center" dir="RTL"><span dir="LTR">2.12</span></p>
</td>
</tr>
<tr>
<td style="height:2px;">
<p align="center" dir="RTL"><span dir="LTR">3</span></p>
</td>
<td style="height:2px;">
<p align="center" dir="RTL"><span dir="LTR">0.45</span></p>
</td>
<td style="height:2px;">
<p align="center" dir="RTL"><span dir="LTR">3.50</span></p>
</td>
</tr>
<tr>
<td style="height:2px;">
<p align="center" dir="RTL"><span dir="LTR">4</span></p>
</td>
<td style="height:2px;">
<p align="center" dir="RTL"><span dir="LTR">0.32</span></p>
</td>
<td style="height:2px;">
<p align="center" dir="RTL"><span dir="LTR">2.89</span></p>
</td>
</tr>
<tr>
<td style="height:2px;">
<p align="center" dir="RTL"><span dir="LTR">5</span></p>
</td>
<td style="height:2px;">
<p align="center" dir="RTL"><span dir="LTR">0.43</span></p>
</td>
<td style="height:2px;">
<p align="center" dir="RTL"><span dir="LTR">3.30</span></p>
</td>
</tr>
<tr>
<td colspan="2" rowspan="4" style="height:2px;">
<p align="center" dir="RTL"><strong><span dir="LTR">Ward</span></strong></p>
</td>
<td style="height:2px;">
<p align="center" dir="RTL"><span dir="LTR">2</span></p>
</td>
<td style="height:2px;">
<p align="center" dir="RTL"><span dir="LTR">0.29</span></p>
</td>
<td style="height:2px;">
<p align="center" dir="RTL"><span dir="LTR">4.06</span></p>
</td>
</tr>
<tr>
<td style="height:2px;">
<p align="center" dir="RTL"><span dir="LTR">3</span></p>
</td>
<td style="height:2px;">
<p align="center" dir="RTL"><span dir="LTR">0.19</span></p>
</td>
<td style="height:2px;">
<p align="center" dir="RTL"><span dir="LTR">2.73</span></p>
</td>
</tr>
<tr>
<td style="height:2px;">
<p align="center" dir="RTL"><span dir="LTR">4</span></p>
</td>
<td style="height:2px;">
<p align="center" dir="RTL"><span dir="LTR">-</span></p>
</td>
<td style="height:2px;">
<p align="center" dir="RTL"><span dir="LTR">-</span></p>
</td>
</tr>
<tr>
<td style="height:2px;">
<p align="center" dir="RTL"><span dir="LTR">5</span></p>
</td>
<td style="height:2px;">
<p align="center" dir="RTL"><span dir="LTR">-</span></p>
</td>
<td style="height:2px;">
<p align="center" dir="RTL"><span dir="LTR">-</span></p>
</td>
</tr>
<tr>
<td colspan="2" rowspan="4" style="height:2px;">
<p align="center" dir="RTL"><strong><span dir="LTR">β-Flexible</span></strong></p>
</td>
<td style="height:2px;">
<p align="center" dir="RTL"><span dir="LTR">2</span></p>
</td>
<td style="height:2px;">
<p align="center" dir="RTL"><span dir="LTR">0.29</span></p>
</td>
<td style="height:2px;">
<p align="center" dir="RTL"><span dir="LTR">3.57</span></p>
</td>
</tr>
<tr>
<td style="height:2px;">
<p align="center" dir="RTL"><span dir="LTR">3</span></p>
</td>
<td style="height:2px;">
<p align="center" dir="RTL"><span dir="LTR">-</span></p>
</td>
<td style="height:2px;">
<p align="center" dir="RTL"><span dir="LTR">-</span></p>
</td>
</tr>
<tr>
<td style="height:2px;">
<p align="center" dir="RTL"><span dir="LTR">4</span></p>
</td>
<td style="height:2px;">
<p align="center" dir="RTL"><span dir="LTR">0.37</span></p>
</td>
<td style="height:2px;">
<p align="center" dir="RTL"><span dir="LTR">4.06</span></p>
</td>
</tr>
<tr>
<td style="height:2px;">
<p align="center" dir="RTL"><span dir="LTR">5</span></p>
</td>
<td style="height:2px;">
<p align="center" dir="RTL"><span dir="LTR">-</span></p>
</td>
<td style="height:2px;">
<p align="center" dir="RTL"><span dir="LTR">-</span></p>
</td>
</tr>
</tbody>
</table>
<div style="clear:both;"></div><p></p>
<p><strong>5- CONCLUSIONS & SUGGESTIONS</strong></p>
<p>The results showed that the Single linkage method presented a better performance considering the accuracy criterion. The suspended sediment values were determined using measured discharge and available Sediment Rating Curves; therefore, the identified clusters as the reliable and appropriate watershed grouping methods which could be regarded as a useful tool in the management of watersheds particularly in the context of erosion and sedimentation.</p>
<p></p>
تحلیل منطقهای, خوشهبندی, رسوب معلق, مناطق همگن, گرگانرود و قرهسو
Clustering, Gorganroud and Qareh-Sou, Homogenous Region, Regional Analysis, Suspunded Sediment Load
47
67
http://magazine.hormozgan.ac.ir/browse.php?a_code=A-10-263-1&slc_lang=fa&sid=1
Maryam
Adhami
مریم
ادهمی
m.adhami66@yahoo.com
10031947532846004035
10031947532846004035
No
Tarbiat Modares University
دانشگاه تربیت مدرس
Mohsen
Zabihi
محسن
ذبیحی
Mohsen_Zabihi69@yahoo.com
10031947532846004036
10031947532846004036
No
Tarbiat Modares University
دانشگاه تربیت مدرس
Saieed
Zare Naghadeh
سعید
زارع نقده
saieed_zare@yahoo.com
10031947532846004037
10031947532846004037
No
Dokuz Eylul University
دانشگاه Dokuz Eylul
Raoof
Mostafazadeh
رئوف
مصطفی زاده
Raoofmostafazadeh@yahoo.com
10031947532846004038
10031947532846004038
Yes
Mohaghegh Ardabili University
دانشگاه محقق اردبیلی