2013

Programmability and Performance of Heterogeneous Platforms

K. Krommydas, T. R. W. Scogland, and W. Feng. In International Conference on Parallel and Distributed Systems, 2013

In some work that could be considered a continuation of the architecture specific optimization analysis of GEM, Konstantinos Krommydas and I evaluate programmability performance tradeoffs across three architectures, an Intel CPU, Intel Xeon Phi, and an NVIDIA Kepler GPU. Some of the results were surprising, not the least of which being that when fully optimized the GPU core code ended up more readable than the highly optimized CPU code.

Tags: GPU

Posted:01 Oct 2013

Trends in energy-efficient computing: A perspective from the Green500

B. Subramaniam, W. Saunders, T. R. W. Scogland, and W. Feng. In International Green Computing Conference, 2013

Balaji Subramaniam took point on our annual analysis of the Green500 this year, and reached out to Winston Saunders to include the Exascalar metric and draw some new conclusions based on the list.

Tags: Green500

Posted:27 Jun 2013

2012

The Green500 list: escapades to exascale

T Scogland, B Subramaniam, and W Feng. In International Supercomputing Conference (ISC), 2012

The most recent installment of our annual analysis of the Green500 list is appearing in ISC this year instead of HPPAC. As we collect more data, we gain more and more insight not only into the progress made in green computing, but into the trends we’re tracking towards future goals. This paper investigates the track between today and the exascale goals set for 2018.

Tags: Green500 exascale

Posted:15 Jun 2012

Heterogeneous Task Scheduling for Accelerated OpenMP

Thomas R. W. Scogland, Barry Rountree, Wu-chun Feng, and Bronis R. de Supinski.. In IPDPS12, 26th IEEE International Parallel and Distributed Processing Symposium, Shanghai, China

The final camera ready version of our paper “Heterogeneous Task Scheduling for Accelerated OpenMP” is finally in. This paper was a breaking point for me, and the first paper I felt like I really drove start to finish and am happy with. This work and the work to follow will build my thesis, interesting and fun work.

Tags: GPU OpenMP scheduling

Posted:12 May 2012

OpenCL and the 13 Dwarfs: A Work in Progress

Wu-chun Feng, Heshan Lin, Thomas Scogland, and Jing Zhang.. In ICPE '12: Proceedings of the third joint WOSP/SIPEW international conference on Performance Engineering, 2012

Our first publication discussing the OpenCL and the 13 Dwarfs benchmark suite, glad to have a tangible artifact from this now. Keep a look out for the official release of the benchmark sometime around June, 2012! Update: The official release has come! If you’re interested, go here for the code.

Tags: OCD dwarfs OpenCL GPU

Posted:01 Apr 2012

2011

StreamMR: An Optimized MapReduce Framework for AMD GPUs

M Elteir, Heshan Lin, Wu-chun Feng, and T Scogland. In 2011 IEEE 17th International Conference on Parallel and Distributed Systems (ICPADS), pp. 364-371

I’m rather fond of this work. It’s in direct opposition to the claims made in the original Mars paper that their two pass method was the only way to handle map reduce on GPUs which cannot use atomics. While StreamMR is compared against versions which can use atomics now, it can work on GPUs with or without them, and does not require a second pass.

Tags: GPU MapReduce

Posted:01 Dec 2011

Emerging Trends on the Evolving Green500: Year Three

Tom Scogland, Balaji Subramaniam, and Wu-chun Feng. In IPDPSW '11: Proceedings of the 2011 IEEE International Symposium on Parallel and Distributed Processing Workshops and PhD Forum, 2011, IEEE Computer Society pp. 822-828

The third Green500 paper to HPPAC. I was uninvolved in the second one, working on other projects. Coming back, between Wu, Balaji and I we found some interesting new ways to analyze the data and draw new conclusions from the list. This work covers more than any other Green500 review to date.

Tags: green Green500

Posted:01 May 2011

Towards accelerating molecular modeling via multi-scale approximation on a GPU

M Daga, Wu-chun Feng, and T Scogland. In IEEE 1st International Conference on Computational Advances in Bio and Medical Sciences (ICCABS), 2011 pp. 75-80

This paper explores the benefits of Hierarchical Charge Partitioning with multiple levels as applied to a CUDA version of the GEM application originally explored on GPU in Accelerating electrostatic surface potential calculation with multi-scale approximation on graphics processing units. The final speedup over serial without HCP is completely staggering, at tens of thousands of times faster.

Tags: GPU GEM

Posted:01 Jan 2011

Architecture-Aware Mapping and Optimization on a 1600-Core GPU

M Daga, T Scogland, and Wu-chun Feng. In Parallel and Distributed Systems (ICPADS), 2011 IEEE 17th International Conference on, 2011 pp. 316-323

This paper provides an overview of some of the architecture specific optimizations we have identified for AMD Radeon GPUs. Each is characterized in terms of the GEM GPU application described in Accelerating electrostatic surface potential calculation with multi-scale approximation on graphics processing units. While some of these are less necessary now, many of the optimizations can still be applied and will give benefits not only on AMD GPUs but a variety of other platforms as well.

Tags: GPU GEM optimization

Posted:01 Jan 2011

2010

A first look at integrated GPUs for green high-performance computing

T R W Scogland, H Lin, and W Feng. In COMPUTER SCIENCE - RESEARCH AND DEVELOPMENT: First International Conference on Energy-Aware High Performance Computing, 2010 vol. 25 (3-4) pp. 125-134

My first foray into GPU research, at least in terms of publication. We opened a big can of worms with this paper, asking where some of these anomalies came from, and the future work explaining it never really happened. If nothing else, this paper serves as a reminder that just because we think we know how something works doesn’t mean we always know how it will behave

Tags: GPU green

Posted:01 Aug 2010

Accelerating electrostatic surface potential calculation with multi-scale approximation on graphics processing units

Ramu Anandakrishnan, Tom R W Scogland, Andrew T Fenley, John C Gordon, Wu-chun Feng, and Alexey V Onufriev. In Journal of Molecular Graphics and Modelling, 2010 vol. 28 (8) pp. 904-910

Tags: GPU GEM

Posted:01 Jun 2010

2009

The Green500 List: Year one

W Feng and T Scogland. In HPPAC '09, In Proceedings of the 2009 IEEE International Symposium on Parallel&Distributed Processing (IPDPS)

A retrospective on the first year of the Green500. I have done a few of these now, and am in the interesting position of being the only student member of the team who has been around since the first release.