Hongyu Zhang  

Contact Email: hongyujohn@gmail.com

 

I am currently an Associate Professor at The University of Newcastle, Australia. I was a Lead Researcher at Microsoft Research. Before joining Microsoft, I was an associate professor at Tsinghua University, China (2006-2014). I received my PhD degree in Computer Science from School of Computing, National University of Singapore in 2003.

 

My research is in the area of software engineering, in particular, software analytics, software quality, software maintenance, and software reuse. The main theme of my research is to improve software quality and productivity by utilizing knowledge mined from software repositories. Over the years, a software organization could accumulate a large amount of data including source code, bug reports, execution logs, changes, metrics, documents, and so on. Data mining, machine learning, and information retrieval techniques can be applied to extract knowledge from the software data and solve software engineering problems. Together with my students and collaborators, I have published more than 150 research papers in international journals and conferences. More details about the papers can be found at my Google Scholar page.

 

Outside work, I like reading, hiking, spending time with friends, and playing with my two kids.

I am always open for collaborations!

 

Research Area:

My research area is software engineering, in particular:

§  software analytics, mining software repository, data-driven software engineering

§  software measurement and empirical software engineering

§  software quality assurance, testing, debugging

§  software reuse (generative programming and software product lines)

§  software maintenance

 

My DBLP, and Google Scholar

Research Grants:

§  Australian Research Council (ARC) Discovery Project, Data-driven Approach to Resilient Online Service Systems, Grant No. DP200102940, 2020-2022. (Lead CI)

§  NSF China, Project “Software Crash Analysis”, Grant No. 61272089. (PI)

§  NSF China, Project “Software Defect Prediction Models and Applications”, Grant No. 61073006, 2011 – 2013. (PI)

§  NSF China, Project "Software Customization Techniques", Grant No. 60703060, 2008-2011. (PI)

§  NSF China, Project "Software Defect and Failure Prediction Techniques", Grant No. 90718022, 2008-2011. (PI)

§  National High-tech 863 Project No. 2007AA01Z122, 2008-2010. (Co-PI)

§  National High-tech 863 Project No. 2007AA01Z480, 2008-2010. (Co-PI)

§  The 6th Key Researcher Support Program, Tsinghua University, 2007-2009. (PI)

 

Tool Development:

§  I am involved in the Microsoft project Developer Assistant, which puts millions of code snippets at your fingertips while you are coding in Visual Studio. 
News: Visual Studio BlogMicrosoft Blog, PC Advisor  MSDN

§  BugLocator: locating buggy source code files based on bug reports.

§  ReLink: recovering missing links between fixed bugs and committed changes.

§  XVCL: an XML-based variant configuration language.

 

Publications:

Referred conference and journal papers:

§  Ru Zhang, Wencong Xiao, Hongyu Zhang, Yu Liu, Haoxiang Lin, Mao Yang, An Empirical Study on Program Failures of Deep Learning Jobs, Proc. the 42nd International Conference on Software Engineering (ICSE 2020), Oct 2020, Seoul, South Korea.

§  Jian Zhang, Xu Wang, Hongyu Zhang, Hailong Sun, and Xudong Liu, 2020, Retrieval-based Neural Source Code Summarization, Proc. the 42nd International Conference on Software Engineering (ICSE 2020), Oct 2020, Seoul, South Korea.

§  Xiaodong Gu, Hongyu Zhang, Sunghun Kim, CodeKernel: A Graph Kernel based Approach to the Selection of API Usage Examples, Proc. The 34th IEEE/ACM International Conference on Automated Software Engineering (ASE 2019), San Diego, California, USA, November 2019.

§  Junjie Chen, Guancheng Wang, Dan Hao, Yingfei Xiong, Hongyu Zhang, Lu Zhang, History-Guided Configuration Diversification for Compiler Test-Program Generation, Proc. The 34th IEEE/ACM International Conference on Automated Software Engineering (ASE 2019), San Diego, California, USA, November 2019.

§  Junjie Chen, Xiaoting He, Qingwei Lin, Hongyu Zhang, Dan Hao, Feng Gao, Zhangwei Xu, Yingnong Dang, Dongmei Zhang, Continuous Incident Triage for Large-Scale Online Service Systems. In Proc. The 34th IEEE/ACM International Conference on Automated Software Engineering, November 2019, San Diego, CA, USA.

§  Zhenyu Zhang, Hailong Sun, Hongyu Zhang, Developer recommendation for Topcoder through a meta-learning based policy model, Empirical Software Engineering, 1-31, Springer, to appear: 2020.

§  Junjie Chen, Jibesh Patra, Michael Pradel, Yingfei Xiong, Hongyu Zhang, Dan Hao, Lu Zhang, A Survey of Compiler Testing, ACM Computing Survey, to appear 2020.

 

§  Huong Ha and Hongyu Zhang, DeepPerf: Performance Prediction for Configurable Software with Deep Sparse Neural Network, In Proc. the 41th International Conference on Software Engineering (ICSE 2019), Research Track, Montreal, Canada, May 2019.

§  Huong Ha and Hongyu Zhang, Performance-Influence Model for Highly Configurable Software with Fourier Learning and Lasso Regression, In Proc. 35th IEEE International Conference on Software Maintenance and Evolution (ICSME 2019), Cleveland, USA, 2019.

§  Bo Zhang and Hongyu Zhang and Junjie Chen and Dan Hao and Pablo Moscato. Automatic Discovery and Cleansing of Numerical Metamorphic Relations. In Proc. 35th IEEE International Conference on Software Maintenance and Evolution (ICSME 2019), Cleveland, USA, 2019.

§  Jian Zhang, Xu Wang, Hongyu Zhang, Hailong Sun, Kaixuan Wang and Xudong Liu, A Novel Neural Source Code Representation based on Abstract Syntax Tree, In Proc. the 41th International Conference on Software Engineering (ICSE 2019), Research Track, Montreal, Canada, May 2019.

§  Junjie Chen, Xiaoting He, Qingwei Lin, Yong Xu, Hongyu Zhang, Dan Hao, Feng Gao, Zhangwei Xu, Yingnong Dang and Dongmei Zhang, An Empirical Investigation of Incident Triage for Online Service Systems, In Proc. the 41th International Conference on Software Engineering (ICSE 2019), SEIP Track, Montreal, Canada, May 2019.

§  Chuan Luo, Holger H. Hoos, Shaowei Cai, Qingwei Lin, Hongyu Zhang, Dongmei Zhang, Local Search with Efficient Automatic Configuration for Minimum Vertex Cover, Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence (IJCAI'2019), 1297-1304, August 2019.

§  Xu Zhang, Yong Xu, Qingwei Lin, Bo Qiao, Hongyu Zhang, Yingnong Dang, Chunyu Xie, Xinsheng Yang, Qian Cheng, Ze Li, Junjie Chen, Xiaoting He, Randolph Yao, Jian-Guang Lou, Murali Chintalapati, Furao Shen, and Dongmei Zhang. 2019. Robust log-based anomaly detection on unstable log data. In Proceedings of the 2019 27th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE 2019), 807-817.

§  Jinkun Lin, Shaowei Cai, Chuan Luo, Qingwei Lin, and Hongyu Zhang. 2019. Towards more efficient meta-heuristic algorithms for combinatorial test generation. In Proceedings of the 2019 27th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE 2019), 212-222.

§  Xu Zhang, Qingwei Lin, Yong Xu, Si Qin, Hongyu Zhang, Bo Qiao, Yingnong Dang, Xinsheng Yang, Qian Cheng, Murali Chintalapati, Youjiang Wu, Ken Hsieh, Kaixin Sui, Xin Meng, Yaohai Xu, Wenchi Zhang, Furao Shen, and Dongmei Zhang. 2019. Cross-dataset time series anomaly detection for cloud systems. In Proceedings of the 2019 USENIX Conference on Annual Technical Conference (USENIX ATC '19). USENIX Association, Berkeley, CA, USA, 1063-1076.

§  Yujun Chen, Xian Yang, Qingwei Lin, Hongyu Zhang, Feng Gao, Zhangwei Xu, Yingnong Dang, Dongmei Zhang, Hang Dong, Yong Xu, Hao Li and Yu Kang, Outage Prediction and Diagnosis for Cloud Service Systems, Proc. The Web Conference 2019 (formerly WWW), San Francisco, May 2019.

§  Yongfeng Gu, Jifeng Xuan, Hongyu Zhang, Lanxin Zhang, Qingna Fan, Xiaoyuan Xie and Tieyun Qian. Does the Fault Reside in a Stack Trace? Assisting Crash Localization by Predicting Crashing Fault Residence, Journal of Software and Systems, vol.148, 2019, 88-104.

§  Junjie Chen, Guancheng Wang, Dan Hao, Yingfei Xiong, Hongyu Zhang, Lu Zhang, Bing Xie, Coverage Prediction for Accelerating Compiler Testing, IEEE Transactions on Software Engineering, 2019.

 

§  Shilin He, Qingwei Lin, Jian-Guang Lou, Hongyu Zhang, Michael R. Lyu and Dongmei Zhang, Identifying Impactful Service System Problems via Log Analysis, Proc. The ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE 2018), November 2018, Lake Buena Vista, Florida, USA.

§  Qingwei Lin, Ken Hsieh, Yingnong Dang, Hongyu Zhang, Kaixin Sui, Yong Xu, Jian-Guang Lou, Chenggang Li, Youjiang Wu, Randolph Yao, Murali Chintalapati and Dongmei Zhang, Predicting Node Failure in Cloud Service Systems, Proc. The ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE 2018), November 2018, Lake Buena Vista, Florida, USA.

§  Qingwei Lin, Weichen Ke, Jian-Guang Lou, Hongyu Zhang, Kaixin Sui, Yong Xu, Ziyi Zhou, Bo Qiao and Dongmei Zhang, BigIN4: Instant, Interactive Insight Identification for Multi-Dimensional Big Data, Proc. The 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2018), Applied Data Science track, London, UK, August 2018.

§  Yong Xu, Kaixin Sui, Randolph Yao, Hongyu Zhang, Qingwei Lin, Yingnong Dang, Peng Li, Keceng Jiang, Wenchi Zhang, Jian-Guang Lou, Murali Chintalapati, Dongmei Zhang, Improving Service Availability of Cloud Systems by Predicting Disk Error, Proc. 2018 USENIX Annual Technical Conference (USENIX ATC'18), July 2018, Boston, MA, USA.

§  Jiajun Jiang, Yingfei Xiong, Hongyu Zhang, Qing Gao, and Xiangqun Chen, Shaping Program Repair Space with Existing Patches and Similar Code, Proc. ACM SIGSOFT International Symposium on Software Testing and Analysis (ISSTA 2018),July 2018, Amsterdam, Netherlands.

§  Mohamad Barbar, Yulei Sui, Hongyu Zhang, Shiping Chen and Jingling Xue, Live Path Control Flow Integrity, In the 40th International Conference on Software Engineering (ICSE 2018), Gothenburg, Sweden, May 2018. (Poster)

§  Xiaodong Gu, Hongyu Zhang, Sunghun Kim, Deep Code Search, In Proc. the 40th International Conference on Software Engineering (ICSE 2018), Gothenburg, Sweden, May 2018.

 

§  Rongxin Wu, Ming Wen, Shing-Chi Cheung, Hongyu Zhang, ChangeLocator: Locate Crash-Inducing Changes Based on Crash Reports, Empirical Software Engineering, Springer, Nov 2017, pp. 1-35.

§  Zhiqiang Li, Xiao-Yuan Jing, Xiaoke Zhu, Hongyu Zhang, Baowen Xu, Shi Ying, On the Multiple Sources and Privacy Preservation Issues for Heterogeneous Defect Prediction, IEEE Transactions on Software Engineering, Dec 2017.

§  Xiaodong Gu, Hongyu Zhang, Dongmei Zhang, Sunghun Kim, DeepAM: Migrate APIs with Multi-modal Sequence to Sequence Learning, In Proceedings of the 26th International Joint Conference on Artificial Intelligence (IJCAI 2017), Melbourne, Australia, August 19-25, 2017

§  Chengxun Shu and Hongyu Zhang, Neural Programming by Example, Proc. the 31st AAAI Conference on Artificial Intelligence (AAAI 2017), February 2017, San Francisco, California, USA.

§  Junjie Chen, Yanwei Bai, Dan Hao, Yingfei Xiong, Hongyu Zhang, and Bing Xie. Learning to Prioritize Test Programs for Compiler Testing. Proc. the 39th International Conference on Software Engineering (ICSE 2017), May 2017, Buenos Aires, Argentina.

§  Zhiqiang Li, Xiao-Yuan Jing, Xiaoke Zhu and Hongyu Zhang. Heterogeneous Defect Prediction through Multiple Kernel Learning and Ensemble Learning, Proc. The 33th International Conference on Software Maintenance and Evolution (ICSME 2017), Shanghai, China, Sep 2017.

 

§  Hongyu Zhang, Anuj Jain, Gaurav Khandelwal, Chandrashekhar Kaushik, Scott Ge, and Wenxiang Hu, Bing Developer Assistant: Improving Developer Productivity by Recommending Sample Code, In the 24th ACM SIGSOFT International Symposium on the Foundations of Software Engineering (FSE 2016), industry track, Seattle, WA, USA, November 2016.

§  Xiaodong Gu, Hongyu Zhang, Dongmei Zhang, Sunghun Kim. Deep API Learning. In Proceedings of the 24th ACM SIGSOFT International Symposium on the Foundations of Software Engineering (FSE 2016), Seattle, WA, USA, November 13–18, 2016.

§  Qingwei Lin, Hongyu Zhang, Jian-Guang Lou, Yu Zhang, Xuewei Chen, Log Clustering based Problem Identification for Online Service Systems, Proc. the 38th International Conference on Software Engineering (ICSE 2016), SEIP Track, May 14-22, 2016, Austin, TX.

§  Qingwei Lin Jian-Guang Lou Hongyu Zhang Dongmei Zhang, iDice: Problem Identification for Emerging Issues, Proc. the 38th International Conference on Software Engineering (ICSE 2016), May 14-22, 2016, Austin, TX.

§  Junjie Chen, Wenxiang Hu, Dan Hao, Yingfei Xiong, Hongyu Zhang, Lu Zhang, Bing Xie, An Empirical Comparison of Compiler Testing Techniques, Proc. the 38th International Conference on Software Engineering (ICSE 2016), May 14-22, 2016, Austin, TX.

§  Junjie Chen, Yanwei Bai, Dan Hao, Yingfei Xiong, Hongyu Zhang, Lu Zhang, Bing Xie, Test Case Prioritization for Compilers: A Text-Vector Based Approach, Proc. the International Conference on Software Testing, Verification and Validation (ICST 2016), April 10-15, 2016, Chicago, IL, USA.

§  Rongxin Wu, Xiao Xiao, S.C. Cheung, Hongyu Zhang, and Charles Zhang, Casper: An Efficient Approach to Call Trace Collection, in Proc. POPL’16, St. Petersburg, Florida, USA, Jan 2016.  

§  Xia, X, Gong, L, Le, TDB, Lo, D, Jiang, L & Zhang, H 2016, ‘Diversity maximization speedup for localizing faults in single-fault and multi-fault programs’, Automated Software Engineering, vol. 23, pp. 43–75, doi:10.1007/s10515-014-0165-z
 

§  Fei Lv, Hongyu Zhang, Jian-guang Lou, Shaowei Wang, Dongmei Zhang, and Jianjun Zhao, "CodeHow: Effective Code Search based on API Understanding and Extended Boolean Model", in Proc. ASE 2015, Lincoln, Nebraska, Nov 2015.

§  Jieming Zhu, Pinjia He, Qiang Fu, Hongyu Zhang, Michael R. Lyu, Dongmei Zhang, Learning to Log: Helping Developers Make Informed Logging Decisions, Proc. ICSE 2015, Florence, Italy, May 2015.

§  Hucheng Zhou, Jian-Guang Lou, Hongyu Zhang, Haibo Lin, Haoxiang Lin and Tingting Qin, An Empirical Study on Quality Issues of Production Big Data Platform, Proc. ICSE 2015, SEIP Track, Florence, Italy, May 2015.

§  Rui Ding, Hucheng Zhou, Jian-Guang Lou, Hongyu Zhang, Qingwei Lin, Qiang Fu, Dongmei Zhang, Tao Xie, Log2: A Cost-Aware Logging Mechanism for Performance Diagnosis, The 2015 USENIX Annual Technical Conference (USENIX ATC'15), Santa Clara, CA, July 2015.

 

§  Meng-Hui Lim, Jian-Guang Lou, Hongyu Zhang, Qiang Fu, Andrew Teoh, Qingwei Lin, Rui Ding, Dongmei Zhang, Identifying Recurrent and Unknown Performance Issues, Proc. ICDM 2014, Shenzhen, China, Dec 2014.

§  Yu Cao, Hongyu Zhang, and Sun Ding, SymCrash: Selective Recording for Reproducing Crashes, Proc. ASE 2014, Västerås, Sweden, September 2014. (ACM Distinguished Paper Award)

§  Chengnian Sun, Haidong Zhang, Jian-Guang Lou, Hongyu Zhang, Qiang Wang, Siau-Cheng Khoo, and Dongmei Zhang, Querying Sequential Software Engineering Data, Proc. FSE 2014, Hong Kong, November 2014.

§  Chu-Pan Wong, Yingfei Xiong, Hongyu Zhang, Dan Hao, Lu Zhang and Hong Mei, Boosting Bug-Report-Oriented Fault Localization with Segmentation and Stack-Trace Analysis, Proc. ICSM 2014, Victoria, British Columbia, Canada, September 20 - 26, 2014.

§  Rongxin Wu, Hongyu Zhang, Shing-Chi Cheung and Sunghun Kim, CrashLocator: Locating Crashing Faults based on Crash Stacks, Proc. International Symposium on Software Testing and Analysis (ISSTA'14), San Jose, CA, July 2014. (ACM Distinguished Paper Award)

§  Hao Hu, Hongyu Zhang, Jifeng Xuan and Weigang Sun. Effective Bug Triage based on Historical Bug-Fix Information, Proc. ISSRE 2014, Naples, Italy, Nov 2014, pp. 122-132.

§  Sun Ding, Hee Beng Kuan Tan,

§  Sun Ding, Hongyu Zhang, H. B. K. Tan, Detecting infeasible branches based on code patterns, In Proc. Conference on Software Maintenance, Reengineering and Reverse Engineering (CSMR-WCRE’14), February 2014, Antwerp, Belgium, pp.74-83.

 

§  Hongyu Zhang, Liang Gong, Steve Versteeg, Predicting Bug-Fixing Time: An Empirical Study of Commercial Software Projects, in Proc. 35th International Conference on Software Engineering (ICSE'13), May 2013, San Francisco, CA, USA., pp. 1042-1051. (full industry track paper, 20.4% acceptance rate).

§  Hongyu Zhang and S. C. Cheung, A Cost-Effectiveness Criterion for Applying Software Defect Prediction Models, in Proc. ESEC/FSE 2013, Saint Petersburg, Russia, Aug 2013.

§  Jiangtao Gong and Hongyu Zhang, BugMap: A Topographic Map of Bugs, in Proc. ESEC/FSE 2013, Saint Petersburg, Russia, Aug 2013.

§  K. Liu, H. B. K. Tan and H. Zhang, Has this Bug Been Reported ? in Proc. WCRE 2013, Koblenz, Germany, October 2013.

§  Dan Hao, Tian Lan, Hongyu Zhang, Chao Guo, Lu Zhang, Is This a Bug or an Obsolete Test?, in Proc. The European Conference on Object-Oriented Programming (ECOOP 2013), Montpellier, France, July 2013.

§  Jue Wang, Yingnong Dang, Hongyu Zhang, Kai Chen, Tao Xie and Dongmei Zhang, Mining Succinct and High-Coverage API Usage Patterns from Source Code, in Proc. MSR 2013, May 2013, San Francisco, CA, USA.

§  Giulio Concas, Maria Ilaria Lunesu, Michele Marchesi,

§  Fayola Peters, Tim Menzies, Liang Gong, Hongyu Zhang, Balancing Privacy and Utility in Cross-Company Defect Prediction, IEEE Trans. on Software Eng., 39(8), 2013, 1054-1068.

 

§  Jian Zhou, Hongyu Zhang, Learning to Rank Duplicate Bug Reports, in Proc. 21st ACM Conference on Information and Knowledge Management (CIKM 2012), Maui, Hawaii, Oct 2012. (13.4% acceptance rate)

§  Ming Li, Hongyu Zhang, Rongxin Wu, and Zhi-Hua Zhou, Sample-based Software Defect Prediction with Active and Semi-supervised Learning, Journal of Automated Software Engineering, Springer, Jan 2012, pp.1-30.

§  Jian Zhou, Hongyu Zhang, and David Lo, Where Should the Bugs be Fixed? in Proc. 34th IEEE/ACM International Conference on Software Engineering (ICSE'12), Zurich, Switzerland, June 2012. (full research track paper, 21% acceptance rate).

§  Yingnong Dang, Rongxin Wu, Hongyu Zhang, Dongmei Zhang, and Peter Novel, ReBucket – A Method for Clustering Duplicate Crash Reports based on Call Stack Similarity, in Proc. 34th IEEE/ACM International Conference on Software Engineering (ICSE'12), Zurich, Switzerland, June 2012. (full industry track paper, 18% acceptance rate).

§  Jue Wang and Hongyu Zhang, Predicting Defect Numbers based on Defect State Transition Models, Proc. 6th ACM / IEEE International Symposium on Empirical Software Engineering and Measurement (ESEM 2012) , Lund, Sweden, Sep 2012.

§  Liang Gong, David Lo, Lingxiao Jiang and Hongyu Zhang: Diversity Maximization Speedup for Fault Localization. Automated Software Engineering (ASE 2012), Essen, Germany, Sep 2012. (full paper, 15% acceptance rate)

§  Liang Gong, David Lo, Lingxiao Jiang and Hongyu Zhang: Interactive Fault Localization Leveraging Simple User Feedback. International Conference on Software Maintenance (ICSM 2012), Riva del Garda, Trento, Italy, Sep 2012. (full paper, 25% acceptance rate)

 

§  Rongxin Wu, Hongyu Zhang, Sunghun Kim, and S.C.Cheung, ReLink: Recovering Links between Bugs and Changes, in Proc. The joint meeting of the European Software Engineering Conference and the ACM SIGSOFT Symposium on the Foundations of Software Engineering (ESEC/FSE'11), Szeged, Hungary, Sep 5-9 2011. (full paper, 17% acceptance rate).

§  Sunghun Kim, Hongyu Zhang, Rongxin Wu and Liang Gong, Dealing with Noise in Defect Prediction, in Proc. of 33rd IEEE/ACM International Conference on Software Engineering (ICSE'11), Honolulu, Hawaii, USA, May 21-28 2011.

§  Yuan-Fang Li, Hongyu Zhang, Integrating software engineering data using semantic web technologies, Proc. MSR 2011: 211-214.

§  Stan Jarzabek, Ulf Pettersson, Hongyu Zhang, University-Industry Collaboration Journey towards Product Lines. ICSR 2011: 223-237

§  Hongyu Zhang, Hee Beng Kuan Tan, Lu Zhang, Xi Lin, Xiaoyin Wang, Chun Zhang and Hong Mei, Checking Enforcement of Integrity Constraints in Database Applications Based on Code Patterns, Journal of Systems and Software, 2011.

 

§  Hongyu Zhang and Sunghun Kim, Monitoring the Evolution of Software Quality with Respect to Defect, IEEE Software, July/August, 2010.

§  Hongyu Zhang and Rongxin Wu, Sampling Program Quality, Proc. 26th IEEE International Conference on Software Maintenance (ICSM 2010), Timisoara, Romania, September 2010.

§  Hongyu Zhang, Adam Nelson, Tim Menzies, On the Value of Learning From Defect Dense Components for Software Defect Prediction, Proc. International Conference on Predictor Models in Software Engineering (PROMISE10), Timisoara, Romania, Sep 12-13, 2010.

§  Hongyu Zhang, Bei Shi, Lu Zhang 2010, ‘Automatic checking of license compliance’, 2010 IEEE International Conference on Software

§  Maintenance, doi:10.1109/icsm.2010.5609557

§  Hongyu Zhang and Stan Jarzabek, A Hybrid Approach to Feature-Oriented Programming in XVCL, Proc. 14th International Software Product Line Conference (SPLC 2010), Jeju Island, South Korea, Sep. 2010, pp. 440-445.

 

§  Hongyu Zhang, An Investigation of the Relationships between Lines of Code and Defects, Proc. 25th IEEE International Conference on Software Maintenance (ICSM 2009), Edmonton, Canada, September 2009.

§  Stan Jarzabek, Yinxing Xue, Hongyu Zhang, Youpeng Lee, Avoiding Some Common Preprocessing Pitfalls with Feature Queries. APSEC 2009: 283-290

§  Lin Liu, Hongyu Zhang, Fei Peng, Wenting Ma, et al, Understanding Chinese Characteristics of Requirements EngineeringProc. of 17th International Requirements Engineering Conference (RE’09), August 2009, Atlanta, USA, pp. 261-266

§  Hongyu Zhang, Yuan-Fang Li and Hee Beng Kuan Tan, Measuring Design Complexity of Semantic Web Ontologies, Journal of Systems and Software, 83(5), 2009.

§  Hongyu Zhang, Discovering Power Laws in Computer Programs, Information Processing & Management, 45(4): 477-483, Elsevier, 2009.

§  Hongyu Zhang, Hee Beng Kuan Tan, The Distribution of Program Sizes and Its Implications: An Eclipse Case Study. CoRR abs/0905.2288 ()

§  Stan Jarzabek, Hongyu Zhang, Youpeng Lee, Yinxing Xue and Naveed Shaikh, Increasing Usability of Preprocessing for Feature Management in Product Lines with Queries, 31st International Conference on Software Engineering (ICSE 2009),Vancouver, Canada, May 2009. pp. 215-218.

 

§  Hongyu Zhang, On the Distribution of Software Faults, IEEE Transactions on Software Engineering, vol. 34(2), March/April 2008. IEEE Press.

§  Hongyu Zhang, An initial study of the growth of eclipse defects. MSR 2008: 141-144.

§  Hongyu Zhang, The scale-free nature of semantic web ontology. WWW 2008: 1047-1048

§  Hongyu Zhang, Exploring Regularity in Source Code: Software Science and Zipf's Law, Proc. 15th Working Conference on Reverse Engineering (WCRE 2008), Antwerp, Belgium, October 2008.

§  Tan, H. B. K., Yuan Zhao, and Hongyu Zhang, Conceptual Data Model Based Software Size Estimation for Information System, ACM Transactions on Software Engineering and Methodology (TOSEM), 2008.

§  Hongyu Zhang and Xiuzhen Zhang, Comments on “Data Mining Static Code Attributes to Learn Defect Predictors”, IEEE Transactions on Software Engineering, IEEE Press, vol. 33(9), Sep 2007.

§  Hai Wang , Yuan Fang Li, Jing Sun, Hongyu Zhang and Jeff Pan, Verifying Feature Models using OWL, Journal of Web Semantics, Vol 5(2), June 2007, Elsevier, pp. 117-129.

§  Hongyu Zhang and Hee Beng Kuan Tan, An Empirical Study of Class Sizes for Large Java Systems, Proc. of 14th Asia-Pacific Software Engineering Conference (APSEC 2007), Nagoya, Japan, December 2007. IEEE Press, pp. 230-237.

§  Hongyu Zhang, Xiuzhen Zhang, Ming Gu, Predicting Defective Software Components from Code Complexity Measures, Proc. 13th IEEE Pacific Rim International Symposium on Dependable Computing Conference(PRDC 2007), Melbourne, Australia, December 2007, IEEE Press.

 

§  Tan, H B K, Yuan Zhao, and Hongyu Zhang, Estimating LOC for information systems from their conceptual models, Proc. of International Conference on Software Engineering (ICSE 2006), May 2006, Shanghai, China, pp. 321-333.

§  Stan Jarzabek, Hongyu Zhang, Shen Ru, Vu Tung Lam, and Sun Zhenxin, Analysis of Meta-Programs: An Example, International Journal of Software Engineering and Knowledge Engineering, 16(1), World Scientific, 2006.

§  Hongyu Zhang and Stan Jarzabek, A Bayesian Network Approach to Rational Architectural Design, International Journal of Software Engineering and Knowledge Engineering, vol. 15 (4), World Scientific, August 2005, pp. 695-717.

§  Jing Sun, Hongyu Zhang, Yuan Fang Li and Hai Wang. Formal Semantics and Verification for Feature Modeling. Proc. of 10th IEEE International Conference on Engineering of Complex Computer Systems (ICECCS'05), IEEE Press, Shanghai, China, June 2005. pp. 303-312.

§  Hongyu Zhang and Stan Jarzabek, XVCL: A Mechanism for Handling Variants in Software Product Lines, Science of Computer Programming, vol. 53 (3), Elsevier, Dec 2004, pp. 381-407.

§  Stan Jarzabek, Wai Chun Ong and Hongyu Zhang, Handling Variant Requirements in Domain Modeling, Journal of Systems and Software, vol. 68 (3), Elsevier, Dec 2003, pp. 171-182.

§  Hongyu Zhang and Stan Jarzabek, An XVCL Approach to Handling Variants: A KWIC Product Line Example, Proc. of10th Asia-Pacific Software Engineering Conference (APSEC 2003), Chiangmai, Thailand, December 2003. IEEE Press.

§  Hongyu Zhang, Stan Jarzabek and Bo Yang, Quality Prediction and Assessment for Product Lines, Proc. of the 15th International Conference On Advanced Information Systems Engineering (CAiSE'03), Klagenfurt/Velden, Austria, June 2003. Lecture Notes in Computer Science (LNCS) 2681, Springer-Verlag, pp. 681-695.

§  Hongyu Zhang and Stan Jarzabek, An XVCL-based Approach to Software Product Line Development, Proc. of 15th International Conference on Software Engineering and Knowledge Engineering (SEKE'03), San Francisco, USA, July 2003, pp. 267-275.

§  Stan Jarzabek, Wai Chun Ong and Hongyu Zhang, Handling Variant Requirements in Domain Modeling, Journal of Systems and Software, vol. 68 (3), Elsevier Science, 2003.

§  Stan Jarzabek, Paul Bassett, Hongyu Zhang, and Weishan Zhang, XVCL: XML-based Variant Configuration Language. Proc. of 25th International Conference on Software Engineering (ICSE 2003), pp. 810-811.

§  Soe Myat Swe, Hongyu Zhang and Stan Jarzabek, XVCL: A Tutorial, Proc. of 14th International Conference on Software Engineering and Knowledge Engineering (SEKE'02), Ischia, Italy, 2002.ACM Press. pp. 341-349.

§  Hongyu Zhang, Stan Jarzabek and Soe Myat Swe, XVCL Approach to Separating Concerns in Product Family Assets, Proc. Third International Symposium on Generative and Component-based Software Engineering (GCSE 2001),Erfurt, Germany, September 2001. Lecture Notes in Computer Science (LNCS) 2186, Springer-Verlag, pp. 36-47.

§  Stan Jarzabek and Hongyu Zhang, XML-based Method and Tool for Handling Variant Requirements in Domain Models, Proc. Fifth IEEE International Symposium on Requirements Engineering (RE'01), Toronto, Canada, August 2001. IEEE Press, pp. 166-173.

Scholarly book chapters:

§  Qingwei Lin, Jian-Guang Lou, Hongyu Zhang, Dongmei Zhang, How to Tame Your Online Services, Chapter 12 of the book Perspectives on Data Science for Software Engineering (editors: T. Menzies, L. Williams, T. Zimmermann), Morgan-Kaufmann, 2016, ISBN 978-01280420.

§  Zhitao Hou, Hongyu Zhang, Haidong Zhang, Dongmei Zhang, Visual Analytics for Software Engineering Data, Chapter 15 of the book Perspectives on Data Science for Software Engineering (editors: T. Menzies, L. Williams, T. Zimmermann ), Morgan Kaufmann, 2016, ISBN 978-0128042069.

§  Hai Wang, Yuan Fang Li, Jing Sun, Hongyu Zhang, Jeff Z. Pan, Towards a Consistent Feature Model using OWL, a chapter of the book Semantic Web Enabled Software Engineering (editors. J. Pan, Y. Zhao), IOS Press, 2014.ISBN 978-1-61499-369-8.

§  Weishan Zhang, Stan Jarzabek, Hongyu Zhang, Neil Loughran, Awais Rashid, Software evolution with XVCL, Chapter VI of the book Software Evolution with UML and XML (editor H. Yang), Idea Group Publishing, 2005. ISBN 9781591404620.

 

 

Research Program Committee:

§  The 43rd International Conference on Software Engineering ICSE 2021, Technical Track

§  The thirty-seventh International Conference on Machine Learning (ICML 2020)

§  The 30th/33th/34th IEEE/ACM International Conference on Automated Software Engineering (ASE 2015, ASE 2018, ASE 2019, ASE 2020).

§  The Joint Meeting of the European Software Engineering Conference and the ACM SIGSOFT Symposium on the Foundations of Software Engineering ESEC/FSE 2015 (Industrial Track), FSE 2016 ESEC/FSE 2017, FSE 2018 (artifacts)

§  The 34th/35th/36th International Conference on Software Engineering ICSE 2012, ICSE 2013 (Formal Demonstration Track), ICSE 2014 (Tutorials and Technical Briefings)

§  The IEEE International Conference on Software Maintenance and Evolution (ICSM 2013, ICSME 2014, ICSME’16, ICSME'17, ICSME'18, ICSME'19).

§  The International Conference on Software Analysis, Evolution and Reengineering (SANER 2015, SANER 2016 , SANER 2017, SANER 2018 (industrial), SANER 2020 (industrial))

§  The 10th/11th/12nd Working Conference on Mining Software Repositories (MSR 2013, MSR 2014, MSR 2015, MSR 2017, MSR 2020)

§  The 15th/16th/17th European Conference on Software Maintenance and Reengineering (CSMR 2011CSMR 2012, CSMR 2013).

§  The 5th/6th/7th/8th International Conference on Predictive Models in Software Engineering (PROMISE 2010PROMISE 2011PROMISE 2012PROMISE 2013, PROMISE 2014,ROMISE 2015, PROMISE 2016)

§  The 24th/25th/26th International Conference on Software Engineering and Knowledge Engineering (SEKE 2012, SEKE 2013, SEKE 2014

§  The joint 10th International Workshop on Principles of Software Evolution and the 5th ERCIM Workshop on Software Evolution (IWPSE/EVOL'09), 24-25 August (co-located with ESEC/FSE 2009), Amsterdam.

§  The 10th International Conference on Agile Processes and eXtreme Programming in Software Engineering (XP 2009), May 26-30, 2009, Sardinia, Italy

§  The 3rd/6th/7th/8th IEEE International Symposium on Theoretical Aspects of Software Engineering (TASE 2009, TASE2012, TASE 2013, TASE 2014)

§  The 15th/16th/17th/18th/19th/20th/21st/22nd Asia-Pacific Software Engineering Conference (APSEC 2016, APSEC 2015, APSEC 2014, APSEC 2013APSEC 2012, APSEC 2011, APSEC 2010, APSEC 2009, APSEC 2008

§  The 17th/18th/19th/20th/22nd/23rd/24th Australian Software Engineering Conference (ASWEC 2015, ASWEC 2014, ASWEC 2013ASWEC 2010, ASWEC 2009, ASWEC 2008, ASWEC 2007, ASWEC 2006)

§  The 2nd/3rd SEMAT Workshop on a General Theory of Software Engineering (GTSE 2013GTSE 2014)

§  The 13th/14th International Conference on Quality Software (QSIC 2013, QSIC 2014)

§  The 38th Annual International Computers, Software & Applications Conference (COMPSAC 2014)

§  The 9th International Conference on Global Software Engineering (ICGSE 2014)

§  The 24th International Conference on Program Comprehension (ICPC 2016)

§  The 10th International Symposium on Empirical Software Engineering and Measurement (ESEM 2016, ESEM 2017, ESEM 2018, ESEM 2019)

§  The 18th International Conference on Software and Systems Reuse (ICSR 2019)

§  The 24th International Systems and Software Product Line Conference (SPLC 2020)

 

Program Organizations:

§  General co-chair, The 36th International Conference on Software Maintenance and Evolution (ICSME 2020)

§  Program co-chair, The 18th IEEE International Conference on Software Quality, Reliability, and Security (QRS 2018)

§  Program co-chair, The 25th Asia-Pacific Software Engineering Conference (APSEC 2018)

§  Tool Demonstration co-chair: The International Symposium of Software Testing and Analysis (ISSTA 2019)

§  Short Paper chair: 2018 Australian Software Engineering Conference (ASWEC 2018)

§  Co-organizer: Dagstuhl Seminar 17502 on "Testing and Verification of Compilers", Dec 2017, Germany.

§  Program co-chair, Early Research Achievements (ERA) track, ICSME'16.

§  Program co-chair, PROMISE'16.

§  The International Conference on Predictive Models in Software Engineering (PROMISE), 2014-Now.     (Steering Committee Member)

§  The Second International Workshop on Software Mining (SoftMine-2013, co-located with ASE'13),  Silicon Valley, CA, November 2013.  (co-organizers)

§  The 8th International Workshop on Advanced Modularization Techniques (AOAsia/Pacific 2013), a workshop at AOSD 2013, March 2013.

§  The First International Workshop on Software Mining (SoftMine-2012, co-located with KDD'12),  Beijing, China, May 2012.  (co-organizers)

§  The 12th International Conference on Quality Software (QSIC 2012), August 2012, Xi'an, China. (industry track co-chairs)

§  The 26th European Conference on Object-Oriented Programming (ECOOP 2012), June 2012, Beijing, China. (local organisation co-chairs)

§  ICSE 2014 Workshop on Emerging Trends in Software Metrics (WETSoM @ ICSE 2014), India, June 2014. (co-organizers)

§  ICSE 2012 Workshop on Emerging Trends in Software Metrics (WETSoM @ ICSE 2012), Zurich, Switzerland, June 2012. (co-organizers)

§  ICSE 2011 Workshop on Emerging Trends in Software Metrics (WETSoM @ ICSE 2011), May, 2011, Honolulu, Hawaii, USA. (co-organizers)

§  ICSE 2010 Workshop on Emerging Trends in Software Metrics (WETSoM @ ICSE 2010), May 4, 2010, Cape Town, South Africa. (co-organizers)

§  The 1st International Symposium on Emerging Trends in Software Metrics (ETSM 2009), 26 May, 2009, Pula, Sardinia, Italy. (co-organizers)

§  15th Asia-Pacific Software Engineering Conference (APSEC 2008), Beijing, China, Dec 2008 (publicity chair). 

 

I am on the Editorial Board of  Journal of Systems and Software and Elsevier Array.
I am a frequent reviewer for the following international journals: IEEE Transactions on Software Engineering, IEEE Software, IEEE Transactions on Knowledge and Data Engineering, Empirical Software Engineering, International Journal of Software Engineering and Knowledge Engineering, Science of Computer Programming, Software Quality Journal, Software Practice & Experience,  Journal of Software Maintenance and Evolution....

I am also invited to review proposals for Natural Science Foundation of China (NSFC), Natural Sciences and Engineering Research Council of Canada (NSERC), European research Council (ERC), Singaporean National Satellite of Excellence (NSoE), Hong Kong Research Grants Council (RGC), and Australian Research Council (ARC). 

Recent Invited Talks/Seminars:

§  Keynote: Prediction Models in Software Engineering, The 2nd Forum on Mining Software Repository, Hangzhou, China, Nov 2019.

§  Keynote: Intelligent Fault Diagnosis and Prediction through Data Analytics, The 6th International Workshop on Quantitative Approaches to Software Quality (QuASoQ 2018).

§  Invited: Towards Effective Code Reuse by Searching, Workshop on Machine Learning and Software Engineering, National Institute of Informatics (NII), Tokyo, Dec 3, 2018.

§  Invited: AI-Enabled Software and Service Engineering, The 2018 Computing in the 21st Century Conference & Asia Faculty Summit, Microsoft Research Asia, Nov 2018.

§  Invited: Log-based Fault Diagnosis for Large-Scale Software Systems, Asian-Pacific Workshop of Advanced Software Engineering, Gold Coast, Australia, Nov 2018.

§  Invited: Towards Intelligent Software Development, The First Yanqi Meeting on Automatic Software Engineering, Beijing, China, Oct 2018.

§  Invited: Towards Intelligent Code Reuse, 2017 China Software Engineering Research and Industry Summit, Sep 2017, Shanghai, China.

§  Keynote: Improving Software Quality through Data Analytics, The 7th International Workshop on Emerging Trends in Software Metrics, May 15, 2016, Austin, Texas, USA (co-located with ICSE 2016)

§  Keynote: Software Analytics: Data-Driven Software Engineering, The Fourth International Workshop on Software Mining, Nov 2015, 2016, Lincoln, Nebraska, USA (co-located with ASE 2015)

§  Invited:  Code Search: Research and Practice, The 3rd Chinese forum of Software Engineering Research and Practice (SERP 2016), July 20, 2016, Beijing, China.

§  Invited: Towards a Theory of Software Engineering, The 5th International Workshop on Theory-Oriented Software Engineering, May 15, 2016, Austin, Texas, USA (co-located with ICSE 2016)

§  Invited: Effective Bug Management via Software Analytics, 4th International Symposium on High Confidence Software (ISHCS 2015), Jan 2015, Beijing, China.

§  Invited: Monte Verita Symposium on Developer Support, Switzerland, March 2012.

§  Invited: MSR (Mining Software Repository) Vision 2020, Canada, August 2012.

§  Invited: Symposium on Advanced Software Engineering Techniques, Shanghai Jiaotong University, 2012.

§  Invited: Symposium on Software Quality and Analysis, Nanjing University, 2012.

§  Seminar: at University of University of Science and Technology Beijing, April 2016. 

§  Seminar: at Chinese Academy of Science, April 2016.

§  Seminar: at Tsinghua University, May 2014.

 

Visiting Positions:

I was a visiting professor/researcher at the following organizations:

§  University of Cagliari, Italy (1/2011 – 3/2011)

§  Microsoft Research Asia (7/2012 – 8/2012)

§  Swinburne University of Technology, Australia (8/2012 – 9/2012)

§  The Hong Kong University of Science and Technology (10/2012 – 3/2013)

Teaching:

I taught the following courses to postgraduate and undergraduate students: 

§  Software Verification and Validation

§  Software Engineering Final Year Project

§  Software Analysis (a joint course with Peking University) 

§  A MOOC course on Big Data  

§  Software Measurement and Process Improvement (first semester)

§  Software Quality Engineering (second semester. This course was evaluated top 15% among all postgraduate courses offered in Tsinghua University in 2011)

§  ISYS1081/2368 "Software Reuse"  

§  COSC1073 "Programming Principles 1A" (a first-year Java programming course)

§  COSC1095/1295 "Programming Principles 2J"/"Java For Programmers"   

§  COSC1252/1264 “Object-Oriented Programming” 

Students:

I am grateful that I have the privilege to advise the following brilliant students/interns:

Liya Chakma, Rongxin Wu (now at HKUST), Jian Zhou (now at Baidu), Liang Gong (now at UC Berkeley), Jianxun Yang, Shuijin Lu, Jue Wang (now at Postal Bank), Shuai Chen (now at Facebook), Wei Li (now at Google), Jiangtao Gong (now at Tsinghua), Ke Ma, Bei Shi (now at CUHK), Lu Zhang (now at Virginia Tech), Zeqi Shen, Yu Cao...

Fei Lv (now at Alibaba), Galina Meyer (now at Stanford), Qing Ren (now at UCLA), Pinjia He, Sheng Tian, Wenhao Song, Senlan Yao (now at Google), Bonan Dong (now at Cornell), Xutong Chen, Wangsheng Hu, Hong Wu, Jinbo Pan, Xiaodong Gu, Wenxiang Hu, Chengxun Shu, Xingzhao Yue... 

Note: If I missed any of you accidently, please do email me (and forgive me). Please also let me know your latest status.

 

My Erdös number is 4: Hongyu Zhang - Stanislaw Jarzabek - Tomasz Krawczyk - William T. Trotter, Jr. - Paul Erdös

I am a senior member of IEEE.

 

(Last updated: June 2019)

Psalm 67:1-3: May God be gracious to us and bless us, and make his face shine on us, so that your ways may be known on earth, your salvation among all nations.