面向大规模地震数据并行处理高速可扩展通信技术应用研究

2022年 61卷 第No. 5期
阅读:87
查看详情
Application of high-speed extensible communication technology for large scale seismic data processing
(中国石油化工股份有限公司石油物探技术研究院,江苏南京211103)
(Sinopec Geophysical Research Institute,Nanjing 211103,China)
随着地震数据规模的不断增大,在进行数据并行处理时,并行计算通信框架因使用TCP(Transmission Control Protocol)协议存在网络吞吐量低、高时延等性能问题,以及主从并行模式下存在主节点网卡性能瓶颈问题,制约了数据并行处理效率的线性提升,集群节点规模扩展性下降明显。为此,提出采用RoCE(RDMA over Converged Ethernet)协议替换TCP协议、主节点配置高性能100GE网卡的高效并行计算通信优化方案,解决了并行计算框架存在的数据网络传输性能问题及主节点同时收集多个计算节点计算结果数据的网络性能瓶颈问题,实现了高速可扩展技术的大规模地震数据处理通信应用方案,计算节点可快速完成数据通信,提升了大规模地震数据处理计算效率;另外采用UCX(Unified Communication X)技术还提升了应用系统的可移植性与使用的便捷性。逆时偏移处理数据的测试结果表明,对于本次大规模数据的处理,逆时偏移并行计算效率提升了32.8%,效果显著,可缩短大规模地震数据逆时偏移计算的时间,并减少计算能源消耗,因此具有很高的实用价值和经济效益。
The amount of seismic data is continuously increasing.Due to Transmission Control Protocol (TCP),a parallel computing communication framework has performance limitations during data processing,such as low network throughput and high latency.Additionally,the performance bottleneck of the master node network card in the master-slave parallel mode restricts the linear improvement of the efficiency of data parallel processing,and the scalability of cluster nodes decreases significantly.Therefore,a high-efficiency parallel computing communication optimization solution using the RDMA over Converged Ethernet protocol(RoCE protocol)instead of the TCP protocol was proposed,using a master node configured with a high-performance 100GE network card.This solves the data network transmission performance problem,which exists in the parallel computing framework,and the network performance bottleneck problem of the master node,which occurs due to simultaneous operation of multiple calculations nodes.Thus,the proposed solution achieves the application of a large-scale seismic data processing communication scheme with high-speed scalable technology.The calculation node can complete the data communication quickly,which improves the calculation efficiency of large-scale seismic data processing.In addition,the adoption of Unified Communication X(UCX)technology also improves the portability and convenience of the application system.In this study,the reverse time migration processing system was selected to complete the production application.The results show that for large-scale data the parallel calculation efficiency of reverse time migration increased by 32.8%.It can shorten the calculation time of reverse time migration of large-scale seismic data and reduce the calculation energy consumption.Therefore,this method has a high practical value as well as economic benefits.
大规模地震数据处理; 逆时偏移处理; 并行计算; MPI; 通信框架; 计算效率; RoCE; 100GE网卡;
large-scale seismic data processing;; reverse time migration processing;; parallel computing;; MPI;; communication ramework;; calculation efficiency;; RoCE;; 100Gb network card;

基金项目:国家企业创新发展联合基金“海相深层复杂构造成像与多类型储层预测方法”(U19B6003-04)资助。

10.3969/j.issn.1000-1441.2022.05.004