论文报告日程

论文报告日程

9月25日下午

大会优秀论文报告

主持人：侯鑫中国科学院计算技术研究所报告20分钟
14:00-14:20	PB量级冷冻电镜数据压缩存储方法	杨涛清华大学
14:20-14:40	FUS: FPGA-based Universal Sketch with Homogeneous and Heterogeneous Memory Architectures	廖云坤中国科学院大学
14:40-15:00	alphaSPARSELib：国产处理器上的跨平台稀疏BLAS库	郭诚欣中国科学院计算技术研究所
15:00-15:20	Discontinuous Galerkin Hartree-Fock: Predicting Accurate Electronic Structures of Complex Metallic Systems with Millions of Atoms on Exascale Sunway Supercomputer	秦新明中国科学技术大学
15:20-15:40	Learning Global Land Cover Mapping Through a Highly-Scalable Weakly-Supervised Method	郑珏鹏中山大学
15:40-16:00	超算用户作业水平指数研究与评测工具开发	高亦沁上海交通大学
16:00-16:20	MT-3000处理器上的Multi-Head Attention优化	路瑶北京航空航天大学

9月26日上午

论文报告：应用

主持人：吕志强青岛大学报告15分钟
9:00-9:15	Investigation on Load balancing strategies for Lattice Boltzmann Method with local grid refinement	Huang Chenxi
9:15-9:30	Flow field prediction based on semi-supervised learning	Wang Xiao
9:30-9:45	Accelerating Molecular Dynamics Simulation with Random Batch Ewald on GPU	Lyu Qizheng
9:45-10:00	Chat2Matgen：基于RAG框架的材料科学LLM服务平台	邱意
10:00-10:15	基于等变图神经网络的拉格朗日粒子流预测器	蒋权
10:15-10:30	Survey on Deep Learning-based Meteorological Forecasting Models	Wang Yuan
10:30-10:45	基于超级计算机的高性能计算应用发展现状及趋势研究	刘扬
10:45-11:00	云中混合工作流构造与调度算法	Zhao Ran
11:00-11:15	面向WaaS平台的多工作流容错调度策略	Zhi Wentao
11:15-11:30	优化器对神经网络力场性能的影响与分析	李恩吉
11:30-11:45	基于海光DCU的碱基识别算法的实现与优化	薄凯彬
11:45-12:00	FaaS在生物信息领域应用综述	Wang Xiaoguang
12:00-12:15	面向非结构异构CFD的时间推进方法评估	Dai Zhe
12:15-12:30	GreenB+Tree: An Energy-Efficient B+Tree for MIMD Architectures	Muchun Peng

论文报告：SoP与量子计算

主持人：张儒戈中国科学院计算技术研究所报告15分钟
9:00-9:15	面向反应堆堆芯海量网格的并行区域分解技术	Dong Lingyu
9:15-9:30	基于Tensor Cores的新型GPU架构的高性能Cholesky分解	ShiLu
9:30-9:45	基于ARIMA和LSTM的高性能计算平台资源使用的预测研究	Li Siqi
9:45-10:00	面向 SW26010-Pro 处理器的 AztecOO 移植与异构并行优化	Xu Jiwei
10:00-10:15	基于多源日志语义分析的异构超算平台作业故障识别方法	He Hu
10:15-10:30	The Design and Implementation of AI for Science Computing Platform	Sun Xiang
10:30-10:45	OpenLM：多平台高性能的大模型推理框架	Liu Gao
10:45-11:00	TS3: 能效优先的特定起点分类最优线程数搜索	Ma Zhaoyang
11:00-11:15	基于循环展开的高效RISC-V内存一致性测试方法	胡津涛
11:15-11:30	TxCocket: An Innovative Solution for Efficient Cross-Node Data Transmission Enabled by CXL-Based Shared Memory	Huang Tao
11:30-11:45	Redex: An Adaptive Learning Index Scheme for Distributed File Systems	Wang Zhenfei
11:45-12:00	PZRT:A high-performance path tracer based on MIMD architecture PEZY-SC3s	Yang Shun
12:00-12:15	Research on the construction and application of knowledge graph in materials science	Yuan Yang
12:15-12:30	An Empirical Study of Error-Free Transformations for Enhancing Mathematical Function Precision	Jie Shen

9月26日下午

论文报告：算法

主持人：王子轩中国科学院计算技术研究所报告15分钟
14:00-14:15	JRPL：一种考虑局部性的作业执行时间预测算法	闫家晨
14:15-14:30	FLS and ILS: Two New Element-Based Level Scheduling Algorithms to Accelerate Sparse Triangular Solves in the DICCG Method	Ding Haoran
14:30-14:45	A Parallel Acceleration Method for Automatic Color Equalization Algorithm Based on Graphics Processing Units	Zuo Xianyu
14:45-15:00	基于空间与时间相似性采样的MPI并行程序性能轨迹跟踪方法	宣智博
15:00-15:15	GPU Acceleration for DNA Sequence Alignment Algorithm and its Application	Zhong Heming
15:15-15:30	Mixed Precision SpMV on GPUs for Irregular Data with Hierarchical Precision Selection	Xu Jianfei
15:30-15:45	Characteristics Analysis and Runtime Prediction of Jobs in SuperComputer	Hongzhen Yang
15:45-16:00	Optimization of The ParILUT-GPU Algorithm	Yang Shaofeng
16:00-16:15	Optimizing 2D convolution for DCUs	樊文龙
16:15-16:30	片上网络中一种新型混合路由策略	Xiao Canwen
16:30-16:45	基于静动态样本点重构的处理器精度提升方法	Zhong Jiaqing
16:45-17:00	一种基于国产神威架构的粒子程序优化方法	陶冉冉

论文报告：系统

主持人：蒋贤蒙中国科学院计算技术研究所报告15分钟
14:00-14:15	GPP：面向地震勘探应用的并行与分布式编程框架	Wang Mingjian
14:15-14:30	面向 Intel GPU 基于时延预估的高吞吐 QoS 保障推理系统	Zhou Yueyuan
14:30-14:45	AdaptiveLLM：基于自适应张量交换和张量重算的大语言模型推理优化	Liang Xuning
14:45-15:00	OpenArray2.0：面向地球系统模式的自动并行计算框架	Wang Dong
15:00-15:15	OBCC：后摩尔时代E级计算编程墙的一种计量方法	张晓哲
15:15-15:30	面向超算系统的大规模并行程序I/O性能分析及工具	郭山川
15:30-15:45	AI+HPC：“智能+”驱动下的超算系统软件及应用技术发展综述	Tan Zhengyuan
15:45-16:00	Agile Verification of Directory-based Cache Coherency for Multi-core Processors in Chiplets Era	Luo Li
16:00-16:15	基于拒绝采样的多起始点算法优化及其在全局最小值求解的应用	Li Rui
16:15-16:30	Optimization of General matrix multiplication on ARMv8 for end-to-side large model	Liu Wenjuan
16:30-16:45	A High-Performance Matrix Transposition for MIMD Architecture Processor	Liang Yaling
16:45-17:00	基于GPU的FFT高性能算法库的实现和优化	Zhenpeng Du