image001.jpg

黄宜华 博士, 教 授,博导

Yihua Huang, Ph.D., Professor

bw必威西汉姆联官网 计算机科学与技术系

Department of Computer Science & Technology Nanjing University

PASA大数据技术研究组

PASA Big Data Research Lab

中国计算机学会大数据专家委员会

副主任

江苏省计算机学会大数据专家委员会

主任

江苏省数字经济学会

副理事长

 

主要研究兴趣

主要学习工作经历

近期承担的研究项目

教学工作

 

获奖

书籍与研究论文

代表性研究成果与评价

会议报告

个人爱好

    联系信息                                                                                         Contact Information

邮件:

黄宜华

Mail:

Yihua Huang

 

bw必威西汉姆联官网计算机科学与技术系

 

Department of Computer Science & Technology, Nanjing University

南京市栖霞区仙林大道163

163 Xianlin Road, Nanjing

中国南京 210023

Nanjing 210023, China

办公室:

计算机系大楼408

Office:

408, Computer Department Building

bw必威西汉姆联官网仙林校区

Xianlin Campus of Nanjing University

电话:

025-8968-6517

Tel:

025-8968-6517

邮箱:

yhuang@nju.edu.cn

Email:

yhuang@nju.edu.cn

 

    主要研究兴趣                               Research Interest

         大数据智能化分析应用

Analytic Applications for Big Data

         大数据分布并行处理

Big Data Distributed & Parallel Processing

         大数据机器学习算法与系统

Machine Learning Algorithms & Systems for Big Data

         文本语义分析

Text semantic analysis

         Web数据挖掘

Web Data Mining

 

               承担/完成研究项目                        Research Projects

21.   视图公共安全应用体系建设科技示范

21. Applications for Public Safety for Large-scale Media Data

江苏省科技厅重点项目课题(项目号BE2021729

Jiangsu Province Science & Tech Research ProgramBE2021729))

2021-2023,课题负责人

      2021-2023PI

20.   中药分子标识研究及中药智慧云信息平台建设

20. Study on Herb Molecular Markers & Chinese Herb Cloud Information Platform

国家重点研发计划项目(2019YFC1711000

  National Key R&D Program of China2019YFC1711000

2020-2021,课题负责人

  2020-2021PI for sub-project

19.  大数据计算的混合编程环境与大数据分析处理系统支撑平台

19. Hybrid Programming Environment & Platform for Big Data Analytics & Processing

国家自然基金重点课题(项目号U181461

  China NSF Research Program#U181461

2019-2022,课题负责人

  2019-2022PI

18.  跨平台统一大数据分析处理与可视化编程系统平台

18. Cross-platform Big Data Analytic & Processing & Virtual Programming Platform

江苏省科技厅重点项目(项目号BE2017155

  Jiangsu Province Science & Tech Research Program# BE2017155

2017-2020,项目负责人

  2017-2020PI

17.  大数据OLAP分析引擎及Flink实时计算技术

17. OLAP Analytic Engine & Flink Real-time Computation for Big Data

华为合作项目,2020

  Huawei, 2020

16.  AutoML算法平台及其应用

16. AutoML AlgorithmsPlatform & Applications

华为合作项目,2018-2019

Huawei2018-2019

15.  证券行情数据回放系统与统一大数据分析平台

15. Securities Market Data Replay System & Unified Big Data Analytic Platform

     华泰证券,2017-2018

  Huatai2017-2018

14.  基于Alluxio的多HDFS NameNode路由选择和热数据缓存    

14. Large Scale Text Analysis & Deep Recommendation Algorithm & System

苏宁云商,2017

  Suning2017

13.  大规模文本分析与深度推荐算法与系统   

13. Large Scale Text Analysis & Deep Recommendation Algorithm & System

微软亚洲研究院合作项目,2016-2017,项目负责人 

  Microsoft Asia Research Lab Program 2016-2017PI

12.  大数据分层式存储系统缓存调度策略与框架研究  

12. Cache Schedule Policy & Framework for Hierarchical Big Data Storage System

Intel公司合作项目,2016-2017,项目负责人 

  Intel Research Program2016-2017PI

11.  大数据机器学习与数据分析统一编程计算模型与关键技术研究 

11. Unified Programming Model & Key Techniques for Big Data Machine Learning & Data Analysis

国家自然基金面上项目 (项目号61572250

  China NSF Research Program #61572250

2016-2019,项目负责人

  2016-2019PI

10. 大数据并行化分析计算统一编程框架与软件平台 

10. Unified Programming Framework & Platform for Big Data Analysis

江苏省科技支撑计划项目(项目号BE2014131

  Jiangsu Province Science & Tech. Support ProgramBE2014131

2014-2017,项目负责人

  2014-2017PI

9. 大规模软件结构智能化分析算法与系统平台 

  9. Algorithms & Platform for Large Scale Software Structure Analysis

华为公司合作项目,2015-2016,项目负责人 

  Huawei2015-2016PI

8. Apache Alluxio优化与功能增强

  8. Optimization and Enhancement for Apache Alluxio

Apache Alluxio开源社区合作研究, 2014-现在

  Apache Alluxio Open Source Research, 2014-Present

7. Apache Spark优化与功能增强

  7. Optimization and Enhancement for Apache Spark

Apache Spark开源社区合作研究, 2014-2015

  Apache Spark Open Source Research, 2014-2015

6. 面向大数据的媒体内容分析与关联语义挖掘研究

  6. Research on Big Media Data Content Analysis and Associated Semantic Mining

国家自然科学基金专项基金项目(项目号61223003

      China National Science Foundation Special Research Grant(#61223003)

       2013.1-2016.12,项目主要参与者

  1/2013-12/2016, Co-PI

5. Gradient Boosting决策树Spark并行化训练算法研究

  5. Gradient Boosting Decision Tree Parallel Training Algorithm with Spark

   百度主题研究项目, 2014,项目负责人

      Baidu Research Project, 2014, PI

4. HBase二级索引与查询技术研究

  4. Secondary Index and Query for HBase

    中兴通讯,项目负责人,2013-2014

      ZTE, China. 2013-2014, PI

    3. 大规模中文文本语义分析与医疗文本挖掘

  3. Large Scale Chinese Text Semantic Analysis and Medical Record Mining  

        美国Intel Labs研究项目,  2013.4-2014.3,项目负责人

      USA Intel Labs URO Funding, 4/2013-3/2014, PI

2. 面向复杂结构的精确Web信息抽取集成模型与关键技术研究

  2. Research on Model and Techniques for Web Info Extraction & Integration

        国家自然科学基金面上项目(项目号61072152

  China National Science Foundation Research Grant(#61072152)

2011.1-2013.12,项目负责人

  1/2011-12/2013, PI

    1. 精确信息定制服务Web信息抽取集成通用引擎与服务软件平台

  1. Accurate Web Info Extraction and Integration Engine and Service Platform

        江苏省科技支撑计划项目(项目号BE2011172

      Jiangsu Province Science & Technology Research Grant (#BE2011172)

        2011.4-2013.12,项目负责人

      4/2011-12/2013, PI

 

 

 

    主要学习和工作经历                                         

       2008-现在   bw必威西汉姆联官网计算机科学与技术系  教授

 

       2002-2008  美国佐治亚医学院生物技术与基因药物研究中心 研究员

 

       1998-2001  美国佛罗里达大学数据库研究中心 访问学者

 

       1998-2001  bw必威西汉姆联官网计算机科学与技术系  教授

 

       1993-1997  bw必威西汉姆联官网计算机科学与技术系  副教授

 

       1988-1993  bw必威西汉姆联官网计算机科学与技术系  讲师

 

       1986-1988  bw必威西汉姆联官网计算机科学与技术系  助教

 

       1994-1997  bw必威西汉姆联官网计算机科学与技术系  博士

 

       1983-1986  bw必威西汉姆联官网计算机科学与技术系  研究生

 

       1979-1983  bw必威西汉姆联官网计算机科学与技术系  本科

 

 

    教学工作                                         

 

 

讲授课程:

大规模数据并行处理(本科与研究生)
(Google大学合作部网站课件下载)

曾开设课程:

Web技术与应用开发

 

 

计算机原理

课程建设:

计算机硬件类课程群建设与实验教学研究

 

微机原理与接口 

 

 

 

程序设计语言

年级导师:

第一讲:如何尽快适应大学学习和生活(PDF

 

中文信息处理

 

第二讲:欲立业先立人-大学时代个人品德和综合素质的培养(PDF

 

数字电路设计 

 

第三讲:计算机学科专业、课程和知识体系(PDF

 

 

研究生培养:

研究生学习培养要求与指南(课题组内使用)(PDF

 

 

    获奖                                         

2020年研究生团队荣获 KDD Cup AutoML国际大赛第二名

 

2019年研究生团队荣获第五届中国互联网+大学生创新创业大赛国赛金奖

 

2019年研究生团队荣获 NeurIPS AutoSpeech 国际大赛第一名

 

2019年研究生团队荣获 NeurIPS AutoDL 国际大赛第三名

 

2019年研究生团队荣获 KDD Cup AutoML 国际大赛TOP10优胜奖

 

2019年研究生团队荣获 ACML AutoSpeech 国际大赛第一名

 

2019年研究生团队荣获 ACML AutoWSL 国际大赛第四名

 

2019年研究生团队荣获 WAIC AutoNLP 国际大赛第七名

 

2018年研究生团队荣获 NeurIPS AutoML 国际大赛第三名

 

2020年研究生团队荣获 2018 PAKDD AutoML国际大赛第三名

 

2016年研究生团队荣获SortBenchmark国际排序大赛CloudSort国际冠军

2016年研究生团队荣获教育部第二届全国高校云计算应用创新大赛大数据技能赛冠军

2015年研究生团队荣获教育部第一届全国高校云计算应用创新大赛大数据技能赛冠军

2012Google奖教金

2012年课程研究生组队参赛第一届“中国云/移动互联网创新大奖赛”,获得9项奖

2000年江苏省科技进步二等奖

1993年江苏省科技进步二等奖

1997年第三届中国PC应用软件设计大赛优胜奖

1997/1996/1995年bw必威西汉姆联官网优秀青年教师

1995年 江苏省八五先进科技工作者

1995国家教委教材二等奖,bw必威西汉姆联官网优秀教材一等奖

1992年江苏省优秀软件一等奖

1991年bw必威西汉姆联官网科技开发特别贡献奖

 

 

  兴趣爱好                                         

 

    个人爱好                                         

 

 

   乒乓球,阅读,哲学,中国传统文化,中医保健

散文:远走高飞的小鸟

 

 

 

    书籍与发表论文                                          Publications

 

书籍《深入理解大数据大数据处理与编程实践》,机械工业出版社,2014,国家教委计算机教指委计算机类专业系统能力培养系列教材。

研究论文:

1.      Rong Gu, Han Yin, Weichang Zhong, Chunfeng Yuan, Yihua Huang. Meces: Latency-effcient Rescaling via Prioritized State Migration for Stateful Distributed Stream Processing Systems. accepted by USENIX Annual Technical Conference USENIX ATC 2022CCF-A类会议), to appear.

2.      Jingfan Chen, Wenqi Fan, Guanghui Zhu, Xiangyu Zhao, Chunfeng Yuan, Qing Li, and Yihua Huang. Knowledge-enhanced Black-box Attacks for Recommendations. accepted by the 28rd SIGKDD conference on Knowledge Discovery and Data Mining (SIG KDD 2022,CCF A), to appear.

3.      Guanghui Zhu, Zhuoer Xu, Chunfeng Yuan, and Yihua Huang. DIFER: Differentiable Automated Feature Engineering. accepted by the 1st International Conference on Automated Machine Learning (AutoML-Conf 2022),  to appear.

4.      Rong Gu, Kai Zhang, Zhihao Xu, Yang Che, Bin Fan, Haojun Hou, Haipeng Dai, Li Yi, Yu Ding, Guihai Chen and Yihua Huang. Fluid: Dataset Abstraction and Elastic Acceleration for Cloud-native Deep Learning Training Jobs. Accpted by (IEEE ICDE 2022, CCF-A), to appear.

5.      Rong Gu, Yuquan Chen, Shuai Liu, Haipeng Dai, Guihai Chen, Kai Zhang, Yang Che, and Yihua Huang. Liquid: Intelligent Resource Estimation and Network-Efficient Scheduling for Deep Learning Jobs on Distributed GPU Clusters. Accpted by (IEEE TPDS, CCF-A), to appear.

6.      Rong Gu, Jun Shi, Xiaofei Chen, Zhaokang Wang, Yang Che, Kai Zhang, Yihua Huang. Octopus-DF: Unified DataFrame-based Cross-platform Data Analytic System. Accpted by (PARCO, CCF-B), to appear.

7.      Guanghui Zhu, Feng Cheng, Defu Lian, Chunfeng Yuan, and Yihua Huang. NAS-CTR: Efficient Neural Architecture Search for Click-Through Rate Prediction. Proc. of the ACM 45th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2022, CCF A), accepted, 2022.

8.      Jingfan Chen, Guanghui Zhu, Haojun Hou, Chunfeng Yuan, and Yihua Huang. AutoGSR: Neural Architecture Search for Graph-based Session Recommendation. Proc. of the ACM 45th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2022, CCF A), accepted, 2022.

9.      Guanghui Zhu, Wenjie Wang, Zhuoer Xu, Feng Cheng, Mengchuan Qiu, Chunfeng Yuan, and Yihua Huang. PSP: Progressive Space Pruning for Efficient Graph Neural Architecture Search. Proc. of the IEEE 38th International Conference on Data Engineering (ICDE 2022, CCF A), accepted, 2022.

10.    Chengcheng Mai, Mengchuan Qiu, Kaiwen Luo, Ziyan Peng, Jian Liu, Chunfeng Yuan, Yihua Huang. Pretraining Multi-modal Representations for Chinese NER Task with Cross-Modality Attention. Proceedings of the Fifteenth ACM International Conference on Web Search and Data Mining (WSDM, CCF-B), pp. 726�734, 2022.

11.    Rong Gu, Zhiqiang Zuo, Xi Jiang, Han Yin, Zhaokang Wang, Linzhang Wang, Xuandong Li, and Yihua Huang. Towards Efficient Large-scale Interprocedural Program Static Analysis on Distributed Data-Parallel Computation. IEEE Transactions on Parallel and Distributed Systems (IEEE TPDS, CCF-A). Vol.32(4), 2021, pp. 867-883.

12.    Zhaokang Wang, Weiwei Hu, Guowang Chen, Chunfeng Yuan, Rong Gu, Yihua Huang. Towards Efficient Distributed SubgraphEnumeration via Backtracking-based Framework. IEEE Transactions on Parallel and Distributed Systems (IEEE TPDS, CCF-A). Vol.32(12), 2021, pp. 2953-2969.

13.    Rong Gu, Yang Qi, Tongyu Wu, Zhaokang Wang, Xiaolong Xu, Chunfeng Yuan, Yihua Huang. SparkDQ: Efficient Generic Big Data Quality Management on Distributed Data-Parallel Computation. Journal of Parallel and Distributed Computing (JPDC, CCF-B). Vol.156(1), 2021, pp. 132-147. Rong Gu, Zhixiang Zhang, Zhihao Xu, Zhaokang Wang, Kai Zhang, Chunfeng Yuan, Yihua Huang. Alchemy: Distributed Financial Quantitative Analysis System with High-level Programming Model. Software: Practice and Experience (SPE, CCF-B). Vol.51(8), 2021, pp. 1676-1699.

14.    Rong Gu, Chongjie Li, Haipeng Dai, Yili Luo, Xiaolong Xu, Shaohua Wan, Yihua Huang. Improving In-Memory File System Reading Performance by Fine-Grained User-Space Cache Mechanisms. Journal of Systems Architecture (JSA, CCF-B). Vol.115(1), 2021, pp. 1-15.

15.    Zhaokang Wang, Shen Wang, Junhong Li, Chunfeng Yuan, Rong Gu and Yihua Huang. Distributed Local Structural Vertex Similarity Calculation on Big Graphs. Journal of Parallel and Distributed Computing (JPDC, CCF-B). Vol.158(1), 2021, pp. 29-46.

16.    Zhaokang Wang, Yunpan Wang, Chunfeng Yuan, Rong Gu, Yihua Huang. Empirical Analysis of Performance Bottlenecks in Graph Neural Network Training and Inference with GPUs. NeuroComputing (CCF-C). Vol.446(1), 2021, pp. 165-191.

17.    Zhuoer Xu, Guanghui Zhu, Chunfeng Yuan, and Yihua Huang. One-Stage Tree: End-to-End Tree Builder and Pruner. Machine Learning Journal (MLJ, CCF B), pp.1-27, 2021.

18.    Zhaokang Wang, Junhong Li, Yifan Qi, Guanghui Zhu, Chunfeng Yuan, and Yihua Huang. UniGPS: A Unified Programming Framework for Distributed Graph Processing. Proc. of the 27th International Conference on Parallel and Distributed Systems (ICPADS, CCF C), accepted, 2021.

19.    Guanghui Zhu, Feng Cheng, Mengchuan Qiu, Zhuoer Xu, Wenjie Wang, Chunfeng Yuan, and Yihua Huang. Progressive AutoSpeech: An Efficient and General Framework for Automatic Speech Classification. Proc. of the 25th Pacific-Asia Conference on Knowledge Discovery andData Mining (PAKDD, CCF C), pp. 168-180, India, 2021.

20.    Chengcheng Mai, Xueming Qiu, Kaiwen Luo, Min Chen, Bo Zhao, Yihua Huang. TSSE-DMM: Topic Modeling for Short Texts Based on Topic Subdivision and Semantic Enhancement. Proceedings of the 24th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD, CCF-C), pp. 640�651, India, 2021.

21.    麦丞程、陈玉婷、仇学明、刘健、赵博、袁春风、黄宜华. 公共服务热线中基于地域自适应的突发事件实时检测方法. 《计算机学报》 (CCF A)2020Vol. 43 (12) : 2259-2275.

22.    Guanghui Zhu and Ruancheng Zhu. Accelerating Hyperparameter Optimization of Deep Neural Network via Progressive Multi-Fidelity Evaluation. Proc. of the 24th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD, CCF C), pp. 752-763, Singapore, 2020.

23.    Zhaokang Wang, Rong Gu, Weiwei Hu, Chunfeng Yuan, Yihua Huang. BENU: Distributed Subgraph Enumeration With Backtracking-based Framework. Proc. of the IEEE International Conference on Data Engineering (ICDE 2019), 136-147, 2019, DOI 10.1109/ICDE.2019.00021.

24.    Guanghui Zhu Xiaoqi WuChunfeng Yuan Yihua Huang. HyMJ: A Hybrid Structure-Aware Approach to Distributed Multi-Way Join Query. IEEE International Conference on Data Engineering (ICDE 2019)short paper

25.    Guanghui Zhu, Qian Wang, Qiwei Tang, Rong Gu, Chunfeng Yuan and Yihua Huang. Efficient and Scalable Functional Dependency Discovery on Distributed Data-Parallel Platforms. IEEE Transactions on Parallel and Distributed Systems (TPDS'2019). 30(2): 2663-2676 (2019)  DOI: 10.1109/TPDS.2019.292501414.

26.    Rong Gu, Yufa Zhou, Zhaokang Wang, Chunfeng Yuan, and Yihua Huang. Penguin: Efficient Query-based Framework for Replaying Large Scale Historical Data. IEEE Transactions on Parallel and Distributed Systems (TPDS'18).2018, DOI: 10.1109/TPDS.2018.2829759

27.    Rong Gu, Yun Tang, Chen Tian, Hucheng Zhou, Guanru Li, Xudong Zheng, and Yihua Huang. Improving Execution Concurrency of Large-Scale Matrix Multiplication on Distributed Data-Parallel Platforms. IEEE Transactions on Parallel and Distributed Systems (TPDS'17). Vol.28(9), 2017, pp. 2539-2552.

28.    Guanghui Zhu, QiuHu,Rong Gu, Chunfeng Yuan, and Yihua Huang. ForestLayer: Efficient training of deep forests on distributed task-parallel platforms. Journal of Parallel and Distributed Computing (JPDC'2019) . Vol(132):113-126.

29.    Guanghui Zhu, Chen Guo, Le Lu, Zhi Huang, Chunfeng Yuan, Rong Gu* and Yihua Huang*. DGST: Efficient and Scalable Suffix Tree Construction on Distributed Data-Parallel Platforms. Parallel Computing (PC'2019), Vol(87):87-102.

30.    Rong Gu, Shiqing Fan, Qiu Hu, Chunfeng Yuan and Yihua Huang. Parallelizing Machine Learning Optimization Algorithms on Distributed Data-Parallel Platforms with Parameter Server. Proc. of the 24th International Conference on Parallel and Distributed Systems (ICPADS 2018), pp. 126-133, Sentosa, Singapore, Dec.11 - 13, 2018.

31.    Rong Gu, Chongjie Li, Peng Shu, Chunfeng Yuan, Yihua Huang. Adaptive Cache Policy Scheduling for Big Data Applications on Distributed Tiered Storage System. Concurrency and Computation: Practice and Experience. 31(15):1-25 (2019) DOI:10.1002/cpe.5138 .

32.    Rong Gu, Min Chen, Wenjia Yang, Chunfeng Yuan and Yihua Huang. Seal: Efficient Training Large Scale Statistical Machine Translation Models on Spark. Proc. of the 24th International Conference on Parallel and Distributed Systems (IEEE ICPADS 2018), pp. 118-125, Sentosa, Singapore, Dec.11 - 13, 2018.

33.    Rong Gu, Kaixuan Huang, Zhixiang Zhang, Chunfeng Yuan and Yihua Huang. Push-based Network-efficient Hadoop YARN Scheduling Mechanism for In-memory Computing. Proc. of the 25th International Conference on Parallel and Distributed Systems (IEEE ICPADS 2019),133-140, 2019. DOI 10.1109/ICPADS.2019.00026

34.    Guanghui Zhu, Xiaoqi Wu, RongGu, Chunfeng Yuan, Yihua Huang. AutoMJ: Towards Efficient Multi-way Join Query on Distributed Data-Parallel Platform. in Proceedings of the 23rd International Conference on Parallel and Distributed Systems (ICPADS 2017), pp. 161-169, Shenzhen, China, 15-17 Dec., 2017.

35.    Wei Ge Xianxian LiChunfeng Yuan Yihua Huang. Correlation-aware partitioning for skewed range query optimization. World Wide Web(WWW'18), 2018, pp 1-27.

36.    Bo Zhao, Hucheng Zhou, Guoqiang Li, and Yihua Huang . ZenLDA: Large-Scale Topic Model Training on Distributed Data-Parallel Platform. Big Data Mining and Analytics, March 2018, 1(1): 57-74

37.    Peng Shu, RongGu, Qianhao Dong, Chunfeng Yuan, Yihua Huang. Accelerating Big Data Applications on Tiered Storage System with Various Eviction Policies. Proc. of the IEEE International Symposium on Parallel and Distributed Processing with Applications (IEEE ISPA 2016), pp. 1350 - 1357, Tianjin, China, 23-26 August, 2016.

38.    Stock Market Prediction Exploiting Microblog Sentiment Analysis. Bo Zhao, Yongji He, Chunfeng Yuan and Yihua Huang. International Joint Conference on Neural Networks (IJCNN 2016), 24-29 July, p4482-4488, Vancouver, Canada.

39.    PTR: Phrase-Based Topical Ranking for Automatic Keyphrase Extraction in Scientific Publications. Minmei WangBo ZhaoChunfeng Yuan Yihua Huang. International Conference on Neural Information Processing(ICONIP2016)2016.10.16-21. Tokyo, Japan

40.    Goldfish:基于矩阵分解的大规模RDF数据存储与查询系统. 顾荣, 仇红剑, 杨文家, 胡伟, 袁春风, 黄宜华. 《计算机学报》, 201710期,p2212-2230

41.    SCoS:基于Spark的并行谱聚类算法设计与实现. 朱光辉,黄圣彬,袁春风,黄宜华. 《计算机学报》,20176

42.    Rong Gu, Shanyong Wang, FangFang Wang, Chufeng Yuan, Yihua Huang. Cichlid: Efficient Large Scale RDFS/OWL Reasoning with Spark. 2015 IEEE International Parallel & Distributed Processing Symposium (IPDPS 2015), India, May 25-29, 2015

43.    Rong Gu, Xiaoliang Yang, Jinshuang Yan, Yuanhao Sun, Bing Wang, Chunfeng Yuan, and Yihua Huang. SHadoop: Improving MapReduce Performance By Optimizing Job Execution Mechanism in Hadoop Clusters. Journal of Parallel and Distributed Computing(JPDC'14). Vol.74(3), 2014, pp. 2166-2179.

44.    Rong Gu, Wei Hu, Yihua Huang. Rainbow: A Distributed and Hierarchical RDF Triple Store with Dynamic Scalability. Proc. of the 2014 IEEE International Conference on Big Data (IEEE BigData 2014), p 561-566, Oct. 27-30, 2014. Washington, USA.

45.    Shengsheng Shi, Chengfei Liu, Chunfeng Yuan, Yihua Huang. Multi-Feature and DAG-Based Multi-Tree Matching Algorithm for Automatic Web Data Mining. The 2014 Web Intelligence Congress(WI 2014), Aug. 11-14, 2014. Warsaw, Poland.

46.    Hongjian Qiu, Rong Gu, Chunfeng Yuan and Yihua Huang. YAFIM: A Parallel Frequent Itemset Mining Algorithm with Spark. The 3rd International Workshop on Parallel and Distributed Computing for Large Scale Machine Learning and Big Data Analytics(ParLearning 2014), conjunction with IPDPS 2014, May 23, 2014. Phoenix, USA

47.    Lei Jin, Rong Gu, Chunfeng Yuan and Yihua Huang. Large Scale Deep Learning On Xeon Phi Many-core Coprocessor. The 3rd International Workshop on Parallel and Distributed Computing for Large Scale Machine Learning and Big Data Analytics(ParLearning 2014), conjunction with IPDPS 2014, May 23, 2014. Phoenix, USA

48.    Ge, Wei; Huang, Yihua; Zhao, Di; Luo, Shengmei; Yuan, Chunfeng; Zhou, Wenhui; Tang, Yun; Zhou, Juan. CinHBa: A secondary index with hotscore caching policy on key-value data store. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), v 8933, p 602-615, 2014.

49.    顾荣, 王芳芳, 袁春风, 黄宜华. YARM:基于MapReduce的高效可扩展的语义推理引擎. 《计算机学报》,01期,pp 74-852014/8.

50.    顾荣,严金双, 杨晓亮, 袁春风, 黄宜华. Hadoop MapReduce短作业执行性能优化. 《计算机研究与发展》,2014Vol. 51 (6): 1270-1280.

51.    赵博, 黄书剑, 戴新宇, 袁春风, 黄宜华. 基于分布内存数据库的并行化层次短语机器翻译算法.《计算机研究与发展》,2014Vol. 51 (12): 2724-2732.

52.    Rong Gu, Furao Shen, and Yihua Huang. A Parallel Computing Platform for Training Large Scale Neural Networks. Proceedings of the IEEE International Conference on Big Data (IEEE BigData 2013), pp. 376 - 384, Santa Clara, CA, USA, Oct. 6-9, 2013.

53.    Shengsheng Shi, Wu Wei, Yulong Liu, Haitao Wang, Lei Luo, Chunfeng Yuan, and Yihua Huang. NEXIR: A Novel Web Extraction Rule Language toward a Three-Stage Web Data Extraction Model. The 14th International Conference on Web Information System Engineering (WISE2013), Nanjing, China, 13-15 Oct. 2013. WISE 2013, Part I, “Lecture Notes in Computer Science” Proceedings 8180, p29-42, Springer-Verlag Berlin Heidelberg, 2013.

54.    Wu Wei, Shengsheng Shi, Yulong Liu, Haitao Wang, Chunfeng Yuan and Yihua Huang. Extraction Rule Language for Web Information Extraction and Integration. The 10th Web Information System and Application Conference(ISA2013), p65-70, Nov. 1-3, Yangzhou, China, 2013.

55.    Shengsheng Shi, FuliangQuan,Tao Xie,Chunfeng Yuan and Yihua Huang. Layered and Weighted Tree Matching Algorithm for Automatic Web Data Records Recognition, The 10th Web Information System and Application Conference(WISA 2013)p55-60, Nov. 1-3, Yangzhou, China, 2013.

56.    Yi Shen, Shengsheng Shi, Haitao Wang, Wu Wei, Chunfeng Yuan, and Yihua Huang. Parallel Approach and Platform for Large-scale Web Data Extraction. 2013 The First International Conference on Advanced Cloud and Big Data(CBD 2013), Nanjing, Dec. 13-15, 2013.

57.    Wenhui Zhou, Chunfeng Yuan, Rong Gu, Yihua Huang. Large Scale Nearest Neighbors Search Based on Neighborhood Graph. 2013 The First International Conference on Advanced Cloud and Big Data(CBD 2013), Nanjing, Dec. 13-15, 2013.

58.    Jinshuang Yan, Xiaoliang Yang, Rong Gu, Chunfeng Yuan, and Yihua Huang. Performance Optimization for Short MapReduce Job Execution in Hadoop. Proceedings of 2nd International Conference on Cloud and Green Computing and 2nd International Conference on Social Computing and Its Applications(CGC/SCA 2012), p 688-694, 2012

59.    Tao Xie, Shengsheng Shi, Fuliang Quan, Chunfeng Yuan, and Yihua Huang. Research on Complex Structure-Oriented  Accurate Web Information Extraction Rules. Proceedings of the 2010 IEEE International Conference on Progress in Informatics and Computing(PIC 2010), p 312-316, 2010

60.    Xiaoliang Yang, Chunfeng Yuan, Yihua Huang. Parallization of BLAST with MapReduce for long sequence alignment. Proceedings - The 4th International Symposium on Parallel Architectures, Algorithms and Programming(PAAP 2011), p 241-246, 2011

61.    Tao Xiao, Shuai Wang, Chunfeng Yuan, Yihua Huang. PSON: A Parallelized SON Algorithm with MapReduce for Mining Frequent Sets. The 4th International Symposium on Parallel Architectures, Algorithms and Programming(PAAP 2011), p 252-257, 2011

62.    Yongzhuang Wei, Shuai Wang, Chunfeng Yuan, and Yihua Huang. Parallelized Near-Duplicate Document Detection Algorithm for Large Scale Chinese Web Pages. Proceedings of the 13th International Conference on Parallel and Distributed Computing, Applications and Technologies(PDCAT 2012), p 523-529, 2012.

63.    Jian Zhang, Chunfeng Yuan, and Yihua Huang. Parallelized Similarity Flooding Algorithm for Processing Large Scale Graph Datasets with MapReduce. Proceedings of the 13th International Conference on Parallel and Distributed Computing, Applications and Technologies(PDCAT 2012), p 184-188, 2012.

64.    Yulong Liu, Shengsheng Shi, Chunfeng Yuan and Yihua Huang, Automated Text Data Extraction based on Unsupervised Small Sample Learning. The 7th Intellegent System and Knowledge Engineering (ISKE 2012), Dec. 15-17, 2012, Beijing. Chapter in book “Foundations and Applications of Intelligent Systems”, Advances in Intelligent Systems and Computing 213, p133-150, Springer-Verlag Berlin Heidelberg, 2013.

65.    Chunfeng Yuan, Yihua Huang, Zhesheng Zhang, Guihai Chen, Wanchun Dou. Improvements on Teaching Methods and Contents for the “Computer Organization and Architecture” Curriculum. Proceedings of International Conference on Scalable Computing and Communications - The 8th International Conference on Embedded Computing, ScalCom-EmbeddedCom 2009, p 560-565, 2009

66.    Jin Yu, Jianxin Yu, Yihua Huang. Design and Implementation of Embedded Networked Intelligent Chinese Checkers Game Software. International Conference on Automatic Control and Artificial Intelligence (ACAI2012), March 24-26,2012, Xiamen, China

67.    Yihua Huang, Tianyun Ni, Lei Zhou and Stanley Su. JXP4BIGI: a generalized, Java XML-based approach for biological information gathering and integration. Bioinformatics. Vol. 19 no. 18. 2003.

68.    Stanley Su, Chunbo Huang, Joachim Hammer, Yihua Huang, Haifei Li, Liu Wang, Youzhong Liu, Charnyote P., Minsso Lee, Herman Lam. An Internet-Based Negotiation Server for E-Commerce. The VLDB (Very Large Data Bases) Journal, Special Issue on E-Services. Vol. 10, 2001.