Finite Difference Method

Finite Difference Method

<p> Finite Difference Method</p><p>We consider the elliptic partial differential equation, knows as Poisson’s equation,</p><p> uxx  u yy  f where(x, y)  [a,b][c,d] </p><p>To solve Poisson’s equation by difference method, the region  is partitioned into a grid consisting of n x m rectangles with sides h and k. The mesh point are given by</p><p> xi  a  ih , y j  c  jk , i  0,1,,n , j  0,1,,m</p><p>By using central difference approximations for the special derivatives, the finite</p><p> u  2u  u u  2u  u difference equation is i1, j i, j i1, j  i, j1 i, j i, j1  f h2 k 2 i, j Often it is desirable to set h=k, the equation becomes</p><p> 4ui, j  ui, j1  ui, j1  ui1, j  ui1, j  hf i, j</p><p>(i+1,j)</p><p>(i,j-1) (i,j) (i,j+1)</p><p>(i-1,j)</p><p>We can also solve this equation by parallel computing. Suppose first that the parallel</p><p> system consists of a mesh connected array of p processors Pi, j arranged in a two- dimensional lattice. Suppose that N 2  mp . Then it is natural to assign m unknowns to each processor. Each processor will proceed to compute iterates for the unknowns it</p><p>28 holds. On a local memory system, at the end of each iteration the new iterates at certain grid points will need to be transmitted to adjacent processors.</p><p>We denote by ” internal boundary values “ those values of uij that are needed by other processors at the next iteration and must be transmitted.</p><p> xxxxxxxxxxxxxxxxxxxxxxx x x x x </p><p> x Pij x </p><p> x x x x. xxxxxxxxxxxxxxxxxxxxxxxx </p><p> u41 u42 u43 u44</p><p> u31 u32 u33 u34</p><p>P21 P22</p><p> u21 u22 u23 u24</p><p> u11 u12 u13 u14</p><p>P11 P12</p><p>For example, for the situation the computation in processor P22 would be</p><p> k1 k1 compute u33 ,u34 ; send to P12</p><p>29 k1 k1 k1 compute u43 , send u33 ,u43 to P21</p><p>⋮ From the above, we know that there are different types of connection between the processors. We choose the type of connection depend on the structure of the matrix.</p><p>Example:</p><p>A star A ring A cube</p><p>Example 9</p><p>We have poisson equation</p><p> uxx  u yy 1 where (x, y)  [0,1][0,1]</p><p>The region  is partitioned into a grid consisting of 240x240 rectangles.</p><p>Result:</p><p>No of processor 1 4 9 16 Time used in each processor calculating x 17.64 12.17 5.57 3.27 Total time used in this function 21.08 13.4 7.42 4.51</p><p>(1) In the Scientific Computing Lab, the result shows that the total time used in this method is decreasing if more processors are used. E.g. When 9 computers are used, the total time used is 7.42s. It accelerates about triple times comparing with that when single processor is used. The time needed to calculate x becomes shortened</p><p>5.24+0.23=5.57s. It is because each computer just needs to calculate 1/9 part of x. The</p><p>30 time used in ‘Send’ and ‘Receive’ is very small and the time mainly used in calculation. So we find that this method can apply parallel algorithm.</p><p>The speedup of parallel algorithm is </p><p>S p =execution time for a single processor / execution time using p processors</p><p>S E  p p p</p><p> p 1 4 9 16 1 1.5731 2.841 4.6741 Speed up, S p </p><p>1 0.3933 0.3157 0.2921 Efficiency, E p</p><p>We find that the efficiency of the algorithm is decreasing slowly. Since the number of processor is limited in our network of work-stations, we need to use PC cluster to see the development of curve when more processors are used.</p><p>31 The upper line shows the ideal case where Sp=1. (2) In TDG Cluster, the result is similar. The number of processor is increased and we can use up to 100 CPUs.</p><p>No of processor, p 1 4 9 16 25 36 64 100 Total time used in this function 39.68 16.17 7.8 4.25 2.86 2.28 1.92 1.85 1 2.4539 5.0872 9.3365 13.874 17.4035 20.0667 21.4486 Speed up, S p</p><p>1 0.6135 0.5652 0.5835 0.555 0.4834 0.3135 0.2145 Efficiency, E p</p><p>From the table, we know that the efficiency is decreasing gradually. It means that the</p><p> development of Sp is increased slowly.</p><p>32 We notice that the curve of speed up becomes a horizontal line at the end. It is because when we use more than 70 processors, the time used in calculation is also around 2 seconds and the time used in message passing is no changed, so the total time is almost the same. Conclusion In conclusion, we know that the jacobi method can apply parallel algorithm. We can connect several processors to calculate the x at the same time, it will decrease the time used. But the Gauss-Seidel method need the latest information as new as possible, so the processors can not run the program at the same time and it can not apply parallel algorithm.</p><p>For jacobi method, we need to notice the time used in message passing. It is because the time need to broadcast or gather the x is very long. The solution is we can apply the property of the matrix’s structure. Ex. For the sparse matrix, we just need to send</p><p>33 the necessary data to the adjacent processor, it will decrease the time used in transferring data. In addition, we just use this parallel algorithm if the program need extensive time to solve in one processor, i.e. the matrix’s size is very large or it need many iterates to convergent, then the efficiency will be more evident.</p><p>The main factors that cause degradation from perfect speedup are:</p><p>(i) Lack of a perfect degree of parallelism in the algorithm and/or lack of perfect</p><p> load balance;</p><p>(ii) Communication, contention, and synchronization time</p><p>By load balancing we mean the assignment of tasks to the processors of the system so as to keep each processor doing useful work as much as possible.</p><p>We expect to need a parallel machine only for those problems too large to run in a feasible time on a single processor.</p><p>Bibliography</p><p>[1] K. A. Gallivan, Michael T. Heath, Esmond Ng, James M. Ortega, Barry W. Peyton, R. J. Plemmons, Charles H. Romine, A. H. Sameh, Robert G. Voigt, Parallel Algorithms for Matrix Computations, Society for Industrial and Applied Mathematics, 1990</p><p>[2] U. Schendel, Introduction to Numerical Methods for Parallel Computers, Ellis Horwood Limited, 1984</p><p>[3] Marc Snir, MPI--the complete reference, Cambridge, Mass, 1998</p><p>[4] C. T. Kelley, Iterative methods for linear and nonlinear equations, Society for</p><p>34 Industrial and Applied Mathematics, 1995</p><p>[5] Anne Greenbaum, Iterative Method for Solving Linear Systems, Society of Industrial and Applied Mathematics, 1997</p><p>[6] Wolfgang Hackbusch, Iterative Solution of Large Sparse Systems of Equations, Springer-Verlag, 1994</p><p>Appendix</p><p>Total time used in different method</p><p>Example 1 Jacobi method (matrix size:6x6) Gauss-Seidel method (matrix size:6x6)</p><p>Example 2: Jacobi method (matrix size:6x6) Comparing the cases when 1,3 clusters are used.</p><p>Example 3: Jacobi method (Sparse matrix:57600x57600) Comparing the cases when 1,2,4,8 clusters are used.</p><p>35 Example 4: Jacobi method (Sparse matrix:921600x921600) Comparing the cases when 1,2,4,8 clusters are used.</p><p>Example 5: Gauss-Seidel method (matrix size:6x6) Comparing the cases when 1,3 clusters are used.</p><p>Example 6: Gauss-Seidel method (Sparse matrix:57600x57600) Comparing the cases when 1,5 clusters are used.</p><p>Example 7: Conjugate gradient method (Sparse matrix) (diagonal=5) Comparing the cases when 1,2,4 clusters are used.</p><p>Example 8: Conjugate gradient method (Sparse matrix) (diagonal=4) Comparing the cases when 1,2,4,8 clusters are used.</p><p>Example 9: Finite different method (Poisson’s matrix) Comparing the cases when 1,4,9,16,25,36,64,100 clusters are used.</p><p>Profile Report (1) Network of work-stations (2) PC Cluster</p><p>36</p>

View Full Text

Details

  • File Type
    pdf
  • Upload Time
    -
  • Content Languages
    English
  • Upload User
    Anonymous/Not logged-in
  • File Pages
    9 Page
  • File Size
    -

Download

Channel Download Status
Express Download Enable

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

  • Not to be reproduced or distributed without explicit permission.
  • Not used for commercial purposes outside of approved use cases.
  • Not used to infringe on the rights of the original creators.
  • If you believe any content infringes your copyright, please contact us immediately.

Support

For help with questions, suggestions, or problems, please contact us