Probabilistic diagnosis of performance faults in large-scale parallel applications

I Laguna, DH Ahn, BR De Supinski, S Bagchi… - Proceedings of the 21st …, 2012 - dl.acm.org
Debugging large-scale parallel applications is challenging. Most existing techniques
provide mechanisms for process control but little information about the causes of failures …

Data races vs. data race bugs: telling the difference with portend

B Kasikci, C Zamfir, G Candea - ACM SIGPLAN Notices, 2012 - dl.acm.org
Even though most data races are harmless, the harmful ones are at the heart of some of the
worst concurrency bugs. Alas, spotting just the harmful data races in programs is like finding …

Triage: diagnosing production run failures at the user's site

J Tucek, S Lu, C Huang, S Xanthos… - ACM SIGOPS Operating …, 2007 - dl.acm.org
Diagnosing production run failures is a challenging yet importanttask. Most previous work
focuses on offsite diagnosis, ie development site diagnosis with the programmers present …

mSWAT: Low-cost hardware fault detection and diagnosis for multicore systems

SK Sastry Hari, ML Li, P Ramachandran… - Proceedings of the …, 2009 - dl.acm.org
Continued technology scaling is resulting in systems with billions of devices. Unfortunately,
these devices are prone to failures from various sources, resulting in even commodity …

Log2: A {Cost-Aware} logging mechanism for performance diagnosis

R Ding, H Zhou, JG Lou, H Zhang, Q Lin, Q Fu… - 2015 USENIX annual …, 2015 - usenix.org
Logging has been a common practice for monitoring and diagnosing performance issues.
However, logging comes at a cost, especially for large-scale online service systems. First …

Breadcrumbs: efficient context sensitivity for dynamic bug detection analyses

MD Bond, GZ Baker, SZ Guyer - Proceedings of the 31st ACM SIGPLAN …, 2010 - dl.acm.org
Calling context--the set of active methods on the stack--is critical for understanding the
dynamic behavior of large programs. Dynamic program analysis tools, however, are almost …

SOBER: statistical model-based bug localization

C Liu, X Yan, L Fei, J Han, SP Midkiff - ACM SIGSOFT Software …, 2005 - dl.acm.org
Automated localization of software bugs is one of the essential issues in debugging aids.
Previous studies indicated that the evaluation history of program predicates may disclose …

On automatic detection of performance bugs

S Tsakiltsidis, A Miranskyy… - 2016 IEEE international …, 2016 - ieeexplore.ieee.org
Context: Software performance is a critical non-functional requirement, appearing in many
fields such as mission critical applications, financial, and real time systems. In this work we …

Comparing static bug finders and statistical prediction

F Rahman, S Khatri, ET Barr, P Devanbu - Proceedings of the 36th …, 2014 - dl.acm.org
The all-important goal of delivering better software at lower cost has led to a vital, enduring
quest for ways to find and remove defects efficiently and accurately. To this end, two parallel …

Quantifying performance bottleneck cost through differential analysis

S Koliaï, Z Bendifallah, M Tribalat, C Valensi… - Proceedings of the 27th …, 2013 - dl.acm.org
Accurate performance analysis is critical for understanding application efficiency and then
driving software or hardware optimizations. Although most of static and dynamic …