{"id":442,"date":"2010-10-29T12:53:37","date_gmt":"2010-10-29T03:53:37","guid":{"rendered":"http:\/\/jungwonkim.com\/?p=442"},"modified":"2018-03-06T12:59:35","modified_gmt":"2018-03-06T17:59:35","slug":"massimiliano-fatica","status":"publish","type":"post","link":"https:\/\/blog.jungwon.kim\/?p=442","title":{"rendered":"Massimiliano Fatica"},"content":{"rendered":"<p><strong>Accelerating linpack with CUDA on heterogenous clusters<\/strong><\/p>\n<p><a href=\"http:\/\/portal.acm.org\/citation.cfm?id=1513895.1513901\">http:\/\/portal.acm.org\/citation.cfm?id=1513895.1513901<\/a><\/p>\n<p>The author calculates the bandwidth of PCIe and the peak GFlops of a CPU and a GPU. Then calculate the execution time with the measurement and the data input size, and get the optimal split fraction. The author does not overlap the execution with data transfer. Because the memory system cannot supply data to both the PCIe and the CPU at maximum speed on Intel systems using Front Side Bus (FSB). However, on the new Intel systems with Quick Path Interconnect (QPI), this may not be the case.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Accelerating linpack with CUDA on heterogenous clusters http:\/\/portal.acm.org\/citation.cfm?id=1513895.1513901 The author calculates the bandwidth of PCIe and the peak GFlops of a CPU and a GPU. Then calculate the execution time with the measurement and the data input size, and get<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-442","post","type-post","status-publish","format-standard","hentry","category-uncategorized"],"_links":{"self":[{"href":"https:\/\/blog.jungwon.kim\/index.php?rest_route=\/wp\/v2\/posts\/442","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/blog.jungwon.kim\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blog.jungwon.kim\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blog.jungwon.kim\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/blog.jungwon.kim\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=442"}],"version-history":[{"count":1,"href":"https:\/\/blog.jungwon.kim\/index.php?rest_route=\/wp\/v2\/posts\/442\/revisions"}],"predecessor-version":[{"id":1904,"href":"https:\/\/blog.jungwon.kim\/index.php?rest_route=\/wp\/v2\/posts\/442\/revisions\/1904"}],"wp:attachment":[{"href":"https:\/\/blog.jungwon.kim\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=442"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blog.jungwon.kim\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=442"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blog.jungwon.kim\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=442"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}