搜索

昆艺师生情系“一老一小” 看护“朝夕夸姣”

发表于 2025-03-05 05:21:02 来源:龙神马壮网

据广西互联网违法和不良信息告发,昆艺看护夸姣2024年11月14日,网上呈现北流市教育局局长酒驾、患艾滋病,以及其它传言的信息

[root@server3AIGC]#nvidia-smiMonJun311:59:362024+-----------------------------------------------------------------------------------------+|NVIDIA-SMI550.67DriverVersion:550.67CUDAVersion:12.4||-----------------------------------------+------------------------+----------------------+|GPUNamePersistence-M|Bus-IdDisp.A|VolatileUncorr.ECC||FanTempPerfPwr:Usage/Cap|Memory-Usage|GPU-UtilComputeM.||||MIGM.||=========================================+========================+======================||0NVIDIAGeForceRTX4060TiOff|00000000:02:00.0Off|N/A||0%34CP027W/165W|1MiB/16380MiB|0%Default||||N/A|+-----------------------------------------+------------------------+----------------------++-----------------------------------------------------------------------------------------+|Processes:||GPUGICIPIDTypeProcessnameGPUMemory||IDIDUsage||=========================================================================================||Norunningprocessesfound|+-----------------------------------------------------------------------------------------+[root@server3AIGC]#编译装置OpenMPI[root@server3AIGC]#tarxvfopenmpi-4.1.6.tar.gz[root@server3openmpi-4.1.6]#[root@server3openmpi-4.1.6]#mkdir-p/home/lichao/lib/openmpi[root@server3openmpi-4.1.6]#./configure--prefix=/home/lichao/lib/openmpi-with-cuda=/usr/local/cuda-12.4-with-nccl=/usr/lib64OpenMPIconfiguration:-----------------------Version:4.1.6BuildMPICbindings:yesBuildMPIC++bindings(deprecated):noBuildMPIFortranbindings:mpif.h,usempiMPIBuildJavabindings(experimental):noBuildOpenSHMEMsupport:yesDebugbuild:noPlatformfile:(none)Miscellaneous-----------------------CUDAsupport:yesHWLOCsupport:internalLibeventsupport:internalOpenUCC:noPMIxsupport:InternalTransports-----------------------CiscousNIC:noCrayuGNI(Gemini/Aries):noIntelOmnipath(PSM2):noIntelTrueScale(PSM):noMellanoxMXM:noOpenUCX:yesOpenFabricsOFILibfabric:noOpenFabricsVerbs:yesPortals4:noSharedmemory/copyin+copyout:yesSharedmemory/LinuxCMA:yesSharedmemory/LinuxKNEM:noSharedmemory/XPMEM:noTCP:yesResourceManagers-----------------------CrayAlps:noGridEngine:noLSF:noMoab:noSlurm:yesssh/rsh:yesTorque:noOMPIOFileSystems-----------------------DDNInfiniteMemoryEngine:noGenericUnixFS:yesIBMSpectrumScale/GPFS:noLustre:noPVFS2/OrangeFS:no[root@server3openmpi-4.1.6]#编译装置NCCL-Test[root@server3lichao]#cdAIGC/[root@server3AIGC]#gitclonehttps://github.com/NVIDIA/nccl-tests.git[root@server3AIGC]#cdnccl-tests/[root@server3nccl-tests]#makeclean[root@server3nccl-tests]#makeMPI=1MPI_HOME=/home/lichao/opt/openmpi/CUDA_HOME=/usr/local/cuda-12.4/NCCL_HOME=/usr/lib64/调集通讯功能测验办法(all_reduce)[root@server1lichao]#catrun_nccl-test.sh/home/lichao/opt/openmpi/bin/mpirun--allow-run-as-root-np3-hostserver1,server2,server3-mcabtl^openib-xNCCL_DEBUG=INFO-xNCCL_ALGO=ring-xNCCL_IB_DISABLE=0-xNCCL_IB_GID_INDEX=3-xNCCL_SOCKET_IFNAME=ens11f1-xNCCL_IB_HCA=mlx5_1:1/home/lichao/AIGC/nccl-tests/build/all_reduce_perf-b128-e8G-f2-g1[root@server1lichao]#./run_nccl-test.sh#nThread1nGpus1minBytes128maxBytes8589934592step:2(factor)warmupiters:5iters:20aggiters:1validation:1graph:0##Usingdevices#Rank0Group0Pid18697onserver1device0[0x02]NVIDIAGeForceRTX4060Ti#Rank1Group0Pid20893onserver2device0[0x02]NVIDIAGeForceRTX4060Ti#Rank2Group0Pid2458onserver3device0[0x02]NVIDIAGeForceRTX4060Ti##ReducingmaxBytesto5261099008duetomemorylimitationserver1:18697:18697[0]NCCLINFONCCL_SOCKET_IFNAMEsetbyenvironmenttoens11f1server1:18697:18697[0]NCCLINFOBootstrap:Usingens11f1:172.16.0.11server1:18697:18697[0]NCCLINFONET/Plugin:Nopluginfound(libnccl-net.so)server1:18697:18697[0]NCCLINFONET/Plugin:Pluginloadreturned2:libnccl-net.so:cannotopensharedobjectfile:Nosuchfileordirectory:whenloadinglibnccl-net.soserver1:18697:18697[0]NCCLINFONET/Plugin:Usinginternalnetworkplugin.server2:20893:20893[0]NCCLINFOcudaDriverVersion12040server2:20893:20893[0]NCCLINFONCCL_SOCKET_IFNAMEsetbyenvironmenttoens11f1server2:20893:20893[0]NCCLINFOBootstrap:Usingens11f1:172.16.0.12server2:20893:20893[0]NCCLINFONET/Plugin:Nopluginfound(libnccl-net.so)server2:20893:20893[0]NCCLINFONET/Plugin:Pluginloadreturned2:libnccl-net.so:cannotopensharedobjectfile:Nosuchfileordirectory:whenloadinglibnccl-net.soserver2:20893:20893[0]NCCLINFONET/Plugin:Usinginternalnetworkplugin.server1:18697:18697[0]NCCLINFOcudaDriverVersion12040NCCLversion2.21.5+cuda12.4server3:2458:2458[0]NCCLINFOcudaDriverVersion12040server3:2458:2458[0]NCCLINFONCCL_SOCKET_IFNAMEsetbyenvironmenttoens11f1server3:2458:2458[0]NCCLINFOBootstrap:Usingens11f1:172.16.0.13server3:2458:2458[0]NCCLINFONET/Plugin:Nopluginfound(libnccl-net.so)server3:2458:2458[0]NCCLINFONET/Plugin:Pluginloadreturned2:libnccl-net.so:cannotopensharedobjectfile:Nosuchfileordirectory:whenloadinglibnccl-net.soserver3:2458:2458[0]NCCLINFONET/Plugin:Usinginternalnetworkplugin.server2:20893:20907[0]NCCLINFONCCL_IB_DISABLEsetbyenvironmentto0.server2:20893:20907[0]NCCLINFONCCL_SOCKET_IFNAMEsetbyenvironmenttoens11f1server2:20893:20907[0]NCCLINFONCCL_IB_HCAsettomlx5_1:1server2:20893:20907[0]NCCLINFONET/IB:Using[0]mlx5_1:1/RoCE[RO];OOBens11f1:172.16.0.12server2:20893:20907[0]NCCLINFOUsingnon-devicenetpluginversion0server2:20893:20907[0]NCCLINFOUsingnetworkIBserver3:2458:2473[0]NCCLINFONCCL_IB_DISABLEsetbyenvironmentto0.server3:2458:2473[0]NCCLINFONCCL_SOCKET_IFNAMEsetbyenvironmenttoens11f1server3:2458:2473[0]NCCLINFONCCL_IB_HCAsettomlx5_1:1server1:18697:18712[0]NCCLINFONCCL_IB_DISABLEsetbyenvironmentto0.server1:18697:18712[0]NCCLINFONCCL_SOCKET_IFNAMEsetbyenvironmenttoens11f1server3:2458:2473[0]NCCLINFONET/IB:Using[0]mlx5_1:1/RoCE[RO];OOBens11f1:172.16.0.13server1:18697:18712[0]NCCLINFONCCL_IB_HCAsettomlx5_1:1server3:2458:2473[0]NCCLINFOUsingnon-devicenetpluginversion0server3:2458:2473[0]NCCLINFOUsingnetworkIBserver1:18697:18712[0]NCCLINFONET/IB:Using[0]mlx5_1:1/RoCE[RO];OOBens11f1:172.16.0.11server1:18697:18712[0]NCCLINFOUsingnon-devicenetpluginversion0server1:18697:18712[0]NCCLINFOUsingnetworkIBserver1:18697:18712[0]NCCLINFOncclCommInitRankcomm0x23622c0rank0nranks3cudaDev0nvmlDev0busId2000commId0x35491327c8228dd0-InitSTARTserver3:2458:2473[0]NCCLINFOncclCommInitRankcomm0x346ffc0rank2nranks3cudaDev0nvmlDev0busId2000commId0x35491327c8228dd0-InitSTARTserver2:20893:20907[0]NCCLINFOncclCommInitRankcomm0x2a1af20rank1nranks3cudaDev0nvmlDev0busId2000commId0x35491327c8228dd0-InitSTARTserver3:2458:2473[0]NCCLINFOSettingaffinityforGPU0to0f,ff000fffserver2:20893:20907[0]NCCLINFOSettingaffinityforGPU0to0f,ff000fffserver1:18697:18712[0]NCCLINFOSettingaffinityforGPU0to0f,ff000fffserver1:18697:18712[0]NCCLINFOcomm0x23622c0rank0nRanks3nNodes3localRanks1localRank0MNNVL0server1:18697:18712[0]NCCLINFOChannel00/02:012server1:18697:18712[0]NCCLINFOChannel01/02:012server1:18697:18712[0]NCCLINFOTrees[0]2/-1/-1->0->-1[1]2/-1/-1->0->1server1:18697:18712[0]NCCLINFOP2PChunksizesetto131072server3:2458:2473[0]NCCLINFOcomm0x346ffc0rank2nRanks3nNodes3localRanks1localRank0MNNVL0server2:20893:20907[0]NCCLINFOcomm0x2a1af20rank1nRanks3nNodes3localRanks1localRank0MNNVL0server3:2458:2473[0]NCCLINFOTrees[0]1/-1/-1->2->0[1]-1/-1/-1->2->0server3:2458:2473[0]NCCLINFOP2PChunksizesetto131072server2:20893:20907[0]NCCLINFOTrees[0]-1/-1/-1->1->2[1]0/-1/-1->1->-1server2:20893:20907[0]NCCLINFOP2PChunksizesetto131072server3:2458:2473[0]NCCLINFOChannel00/0:1[0]->2[0][receive]viaNET/IB/0server3:2458:2473[0]NCCLINFOChannel01/0:1[0]->2[0][receive]viaNET/IB/0server3:2458:2473[0]NCCLINFOChannel00/0:2[0]->0[0][send]viaNET/IB/0server3:2458:2473[0]NCCLINFOChannel01/0:2[0]->0[0][send]viaNET/IB/0server2:20893:20907[0]NCCLINFOChannel00/0:0[0]->1[0][receive]viaNET/IB/0server2:20893:20907[0]NCCLINFOChannel01/0:0[0]->1[0][receive]viaNET/IB/0server2:20893:20907[0]NCCLINFOChannel00/0:1[0]->2[0][send]viaNET/IB/0server2:20893:20907[0]NCCLINFOChannel01/0:1[0]->2[0][send]viaNET/IB/0server1:18697:18712[0]NCCLINFOChannel00/0:2[0]->0[0][receive]viaNET/IB/0server1:18697:18712[0]NCCLINFOChannel01/0:2[0]->0[0][receive]viaNET/IB/0server1:18697:18712[0]NCCLINFOChannel00/0:0[0]->1[0][send]viaNET/IB/0server1:18697:18712[0]NCCLINFOChannel01/0:0[0]->1[0][send]viaNET/IB/0server3:2458:2475[0]NCCLINFONCCL_IB_GID_INDEXsetbyenvironmentto3.server1:18697:18714[0]NCCLINFONCCL_IB_GID_INDEXsetbyenvironmentto3.server2:20893:20909[0]NCCLINFONCCL_IB_GID_INDEXsetbyenvironmentto3.server1:18697:18712[0]NCCLINFOConnectedallringsserver1:18697:18712[0]NCCLINFOChannel01/0:1[0]->0[0][receive]viaNET/IB/0server3:2458:2473[0]NCCLINFOConnectedallringsserver2:20893:20907[0]NCCLINFOConnectedallringsserver1:18697:18712[0]NCCLINFOChannel00/0:0[0]->2[0][send]viaNET/IB/0server2:20893:20907[0]NCCLINFOChannel00/0:2[0]->1[0][receive]viaNET/IB/0server1:18697:18712[0]NCCLINFOChannel01/0:0[0]->2[0][send]viaNET/IB/0server3:2458:2473[0]NCCLINFOChannel00/0:0[0]->2[0][receive]viaNET/IB/0server2:20893:20907[0]NCCLINFOChannel01/0:1[0]->0[0][send]viaNET/IB/0server3:2458:2473[0]NCCLINFOChannel01/0:0[0]->2[0][receive]viaNET/IB/0server3:2458:2473[0]NCCLINFOChannel00/0:2[0]->1[0][send]viaNET/IB/0server3:2458:2473[0]NCCLINFOConnectedalltreesserver1:18697:18712[0]NCCLINFOConnectedalltreesserver1:18697:18712[0]NCCLINFONCCL_ALGOsetbyenvironmenttoringserver3:2458:2473[0]NCCLINFONCCL_ALGOsetbyenvironmenttoringserver3:2458:2473[0]NCCLINFOthreadThresholds8/8/64|24/8/64|512|512server3:2458:2473[0]NCCLINFO2collchannels,2collnetchannels,0nvlschannels,2p2pchannels,2p2pchannelsperpeerserver2:20893:20907[0]NCCLINFOConnectedalltreesserver2:20893:20907[0]NCCLINFONCCL_ALGOsetbyenvironmenttoringserver2:20893:20907[0]NCCLINFOthreadThresholds8/8/64|24/8/64|512|512server2:20893:20907[0]NCCLINFO2collchannels,2collnetchannels,0nvlschannels,2p2pchannels,2p2pchannelsperpeerserver1:18697:18712[0]NCCLINFOthreadThresholds8/8/64|24/8/64|512|512server1:18697:18712[0]NCCLINFO2collchannels,2collnetchannels,0nvlschannels,2p2pchannels,2p2pchannelsperpeerserver2:20893:20907[0]NCCLINFOTUNER/Plugin:Pluginloadreturned11:libnccl-net.so:cannotopensharedobjectfile:Nosuchfileordirectory:whenloadinglibnccl-tuner.soserver2:20893:20907[0]NCCLINFOTUNER/Plugin:Usinginternaltunerplugin.server2:20893:20907[0]NCCLINFOncclCommInitRankcomm0x2a1af20rank1nranks3cudaDev0nvmlDev0busId2000commId0x35491327c8228dd0-InitCOMPLETEserver3:2458:2473[0]NCCLINFOTUNER/Plugin:Pluginloadreturned11:libnccl-net.so:cannotopensharedobjectfile:Nosuchfileordirectory:whenloadinglibnccl-tuner.soserver3:2458:2473[0]NCCLINFOTUNER/Plugin:Usinginternaltunerplugin.server3:2458:2473[0]NCCLINFOncclCommInitRankcomm0x346ffc0rank2nranks3cudaDev0nvmlDev0busId2000commId0x35491327c8228dd0-InitCOMPLETEserver1:18697:18712[0]NCCLINFOTUNER/Plugin:Pluginloadreturned11:libnccl-net.so:cannotopensharedobjectfile:Nosuchfileordirectory:whenloadinglibnccl-tuner.soserver1:18697:18712[0]NCCLINFOTUNER/Plugin:Usinginternaltunerplugin.server1:18697:18712[0]NCCLINFOncclCommInitRankcomm0x23622c0rank0nranks3cudaDev0nvmlDev0busId2000commId0x35491327c8228dd0-InitCOMPLETE##out-of-placein-place#sizecounttyperedoproottimealgbwbusbw#wrongtimealgbwbusbw#wrong#(B)(elements)(us)(GB/s)(GB/s)(us)(GB/s)(GB/s)12832floatsum-128.390.000.01027.350.000.01025664floatsum-129.440.010.01028.540.010.010512128floatsum-129.990.020.02029.660.020.0201024256floatsum-132.890.030.04030.640.030.0402048512floatsum-134.810.060.08031.870.060.09040961024floatsum-137.320.110.15036.090.110.15081922048floatsum-145.110.180.24043.120.190.250163844096floatsum-157.920.280.38056.980.290.380327688192floatsum-172.680.450.60070.790.460.6206553616384floatsum-195.770.680.91093.730.700.93013107232768floatsum-1162.70.811.070161.50.811.08026214465536floatsum-1177.31.481.970177.41.481.970524288131072floatsum-1301.41.742.320302.01.742.3101048576262144floatsum-1557.91.882.510559.21.882.5002097152524288floatsum-11089.81.922.5701092.21.922.56041943041048576floatsum-12165.71.942.5802166.61.942.58083886082097152floatsum-14315.71.942.5904316.11.942.590167772164194304floatsum-18528.81.972.6208529.31.972.620335544328388608floatsum-1166222.022.690166102.022.6906710886416777216floatsum-1326022.062.740325422.062.75013421772833554432floatsum-1639462.102.800638312.102.80026843545667108864floatsum-11265292.122.8301264122.122.830536870912134217728floatsum-12515992.132.8502513272.142.8501073741824268435456floatsum-15006642.142.8605019112.142.8502147483648536870912floatsum-110014152.142.86010001782.152.86042949672961073741824floatsum-119993612.152.86019973802.152.870server1:18697:18697[0]NCCLINFOcomm0x23622c0rank0nranks3cudaDev0busId2000-DestroyCOMPLETEserver2:20893:20893[0]NCCLINFOcomm0x2a1af20rank1nranks3cudaDev0busId2000-DestroyCOMPLETEserver3:2458:2458[0]NCCLINFOcomm0x346ffc0rank2nranks3cudaDev0busId2000-DestroyCOMPLETE#Outofboundsvalues:0OK#Avgbusbandwidth:1.66163#[root@server1lichao]#成果详解-size(B):师生操作处理的数据的巨细,师生以字节为单位。同样地,情系装置完结后,需求装备一些环境变量(运用镜像站hf-mirror.com)来处理网络问题。

昆艺师生情系“一老一小” 看护“朝夕夸姣”

[root@server3LLaMA-Factory-0.8.3]#tree-hdata/data/├──[841K]alpaca_en_demo.json├──[621K]alpaca_zh_demo.json├──[32]belle_multiturn│└──[2.7K]belle_multiturn.py├──[733K]c4_demo.json├──[13K]dataset_info.json├──[1.5M]dpo_en_demo.json├──[833K]dpo_zh_demo.json├──[722K]glaive_toolcall_en_demo.json├──[665K]glaive_toolcall_zh_demo.json├──[27]hh_rlhf_en│└──[3.3K]hh_rlhf_en.py├──[20K]identity.json├──[892K]kto_en_demo.json├──[45]mllm_demo_data│├──[12K]1.jpg│├──[22K]2.jpg│└──[16K]3.jpg├──[3.1K]mllm_demo.json├──[9.8K]README.md├──[9.2K]README_zh.md├──[27]ultra_chat│└──[2.3K]ultra_chat.py└──[1004K]wiki_demo.txt4directories,20files[root@server3LLaMA-Factory-0.8.3]#运用预备好的模型与数据集,朝夕在单机上进行练习测验LLaMA-Factory支撑经过WebUI微调大言语模型。图7:昆艺看护夸姣试验拓扑和根底装备暗示软件预备以上,昆艺看护夸姣咱们现已完结了硬件选型,接下来咱们将进行软件层面的装备:布置RoCEv2交流机、装备GPU服务器、装置GPU驱动和调集通讯库。GPU类型:师生NVIDIAGeForceRTX4060Ti16GB预练习模型:师生Qwen/Qwen1.5-0.5B-Chat数据集:identity、alpaca_zh_demo#Makesureyouhavegit-lfsinstalled(https://git-lfs.com)gitlfsinstallgitclonehttps://hf-mirror.com/Qwen/Qwen1.5-0.5B-Chat#Ifyouwanttoclonewithoutlargefiles-justtheirpointersGIT_LFS_SKIP_SMUDGE=1gitclonehttps://hf-mirror.com/Qwen/Qwen1.5-0.5B-Chat因为网络问题经过命令行很难直接下载,这儿运用huggingface的国内镜像站拉取预练习模型数据,并运用GIT_LFS_SKIP_SMUDGE=1变量越过大文件,随后手艺下载后再上传。

昆艺师生情系“一老一小” 看护“朝夕夸姣”

当经过核算使命承认算力需求,情系从而承认了所需求的GPU类型和数量之后,咱们也就能够再持续规划整个GPU集群的组网了。别的两张办理网中,朝夕事务办理网用于GPU服务器互联,进行AIOS办理面通讯,带外办理则衔接整个智算中心的一切设备,用于运维接入办理

昆艺师生情系“一老一小” 看护“朝夕夸姣”

示例:昆艺看护夸姣#defineenqueue(queue,data)enqueue_bytes(queue,&data,sizeof(typeof(data)))在上述宏中:昆艺看护夸姣typeof(data)会揣度出data的类型,然后经过sizeof(typeof(data))确认该类型占用的字节数。

2.2函数重载的完结原理问题:师生在C++等言语中,函数重载答应你界说多个同名的函数,但参数类型或数量不同。会议期间,情系王志厚掌管举行列席人大常委会会议的人大代表座谈会,听取饯别全过程人民民主和市人大常委会领导联络代表准则等方面的定见主张。

第一次全体会议听取了市人大法制委员会关于《长春市前史文化名城维护法令(草案)》《长春市乡村供水法令(草案)》《长春市社会急救医疗管理法令(修订草案)》审议成果的陈述、朝夕市人大常委会法律查看组关于查看《长春市饮用水水源维护法令》贯彻实施状况的陈述,朝夕听取了市政府关于全市优化营商环境状况、粮食产能建造状况、科技产业园建造状况、全市民族工作状况、全市工作工作状况的陈述,听取了市中级人民法院关于行政审判工作状况、市人民查看院关于行政查看工作状况的陈述。第一次全体会议后,昆艺看护夸姣常委会组成人员对第一次全体会议的有关议题进行了分组审议

刚刚,师生搭载天舟八号货运飞船的长征七号遥九运载火箭,在我国文昌航天发射场焚烧发射大理州原州长杨健材料图此前一天,情系即11月14日,云南省纪委监委发布音讯,大理州原州长杨健被查,副州长、公安局局长杨坤已自动投案。

随机为您推荐
友情链接
版权声明:本站资源均来自互联网,如果侵犯了您的权益请与我们联系,我们将在24小时内删除。

Copyright © 2025 Powered by 昆艺师生情系“一老一小” 看护“朝夕夸姣”,龙神马壮网   sitemap

回顶部