丁香激情综合色伊人久久,999zyz玖玖资源站永久

基于IP包拆分重组技术的混合语音压缩编码算法研究

电子技术应用

李凌云，李肖克，陈奕钊，王国法，王辉

中国电子科技集团公司第三十四研究所

摘要： 针对某特殊通信网业务系统中，在10 kb/s的窄带信道上传输1路标准G.729编码格式的VoIP语音数据的特殊通信场景，提出一种基于IP包拆分重组技术的混合语音压缩编码算法，将G.729压缩后的语音数据进行解压缩，再通过AMBE进行二次压缩，结合IP包拆分重组技术，保留语音数据中有效载荷，剔除多余开销数据，减小语音数据传输所需带宽。仿真实验验证了该方法的有效性，当G.729和AMBE的语音压缩编码速率分别为8 kb/s、2.4 kb/s，载荷长度为20 ms，IP包打包周期为8包时，实验表明无论在何种光路状态下，平均句子可懂度达85%以上，话音信号等级达3级以上，满足话音传输系统要求。

關(guān)鍵詞： 语音压缩编码 G.729 AMBE IP包拆分重组窄带通信

中圖分類號：TN912.3 文獻(xiàn)標(biāo)志碼：A DOI: 10.16157/j.issn.0258-7998.245688
中文引用格式： 李凌云，李肖克，陳奕釗，等. 基于IP包拆分重組技術(shù)的混合語音壓縮編碼算法研究[J]. 電子技術(shù)應(yīng)用，2025，51(2)：70-74.
英文引用格式： Li Lingyun，Li Xiaoke，Chen Yizhao，et al. Research on hybird speech compression coding algorithm based on IP packet splitting and reassembling technology[J]. Application of Electronic Technique，2025，51(2)：70-74.

Research on hybird speech compression coding algorithm based on IP packet splitting and reassembling technology

Li Lingyun，Li Xiaoke，Chen Yizhao，Wang Guofa，Wang Hui

The 34th Research Institute of CETC

Abstract： Aiming at the special communication network service system, in order to transmit 1 channel of standard G.729 Voice over Internet Protocol（VoIP）voice data over 10 kb/s narrowband channel in the special communication scenario, a hybrid speech compression coding algorithm based on IP packet splitting and reassembling technology is proposed. The algorithm decomposes the voice data after G.729 compression, and then performs secondary compression through Advanced Multi-Band Excitation (AMBE). Combined with IP packet splitting and reassembly technology, the payload in the voice data is retained, the redundant overhead data is eliminated, and the bandwidth required for voice data transmission is reduced. The effectiveness of the method is verified by simulation experiment. The experiments show that when the speech compression coding rate of G.729 and AMBE is 8 kb/s and 2.4 kb/s respectively, the load length is 20 ms, and the IP packet packaging cycle is 8 packets, the average sentence intelligibility is above 85% and the voice signal level is above level 3 under any optical path state, which meets voice transmission system requirements.

Key words : speech compression coding；G.729；AMBE；IP packet splitting and reassembling；narrowband communication

引言

語音壓縮編碼是指為提高通信網(wǎng)中的信息傳輸效率及實現(xiàn)語音的高效存儲，對編碼后的數(shù)字語音進(jìn)行壓縮的技術(shù)。由于現(xiàn)代通信網(wǎng)對傳輸帶寬、數(shù)據(jù)保密性等各種特殊場景的需要，低速率語音壓縮編碼技術(shù)因其占用帶寬少、抗干擾、保密性強(qiáng)及系統(tǒng)容量高等特點而成為語音研究領(lǐng)域中的一個重要課題。

在某特殊通信網(wǎng)業(yè)務(wù)系統(tǒng)中，既要在平均通信速率僅有10 kb/s的信道上傳輸1路基于IP的語音傳輸VoIP語音，又要求話音編碼標(biāo)準(zhǔn)采用G.729標(biāo)準(zhǔn)。傳統(tǒng)G.729標(biāo)準(zhǔn)話音數(shù)據(jù)需要34.4 kb/s的傳輸帶寬，僅采用一種語音壓縮編碼技術(shù)已明顯不能滿足要求。

研究人員提出一種基于混合激勵線性預(yù)測（Mixed Excited Linear Prediction，MELP）的0.6 kb/s的聲碼器算法，將多個連續(xù)語音幀合成一個超級幀，充分利用參數(shù)的幀間相關(guān)性進(jìn)行聯(lián)合量化，通過仿真驗證了該算法可得到一個可懂度較高、清晰度和自然度較好的合成語音[1-5]。常亮等提出一種基于正弦激勵線性預(yù)測（Sinusoidal Excitation Linear Prediction，SELP）的0.56 kb/s多幀聯(lián)合分模式矢量量化算法，獲得接近電話質(zhì)量的語音[6]。Huang等提出一種矩陣量化方案和低速率的聲碼器算法，在低速率通信鏈路中獲得了高質(zhì)量語音[7]。Ozaydin等針對窄帶通信鏈路中語音信號特征，基于共軛結(jié)構(gòu)代數(shù)碼激勵線性預(yù)測編碼（Conjugate Structure-Algebraic Code Excited Linear Prediction，CS-ACELP）設(shè)計了一種低復(fù)雜度、高效的語音激活檢測（Voice Activity Detection，VAD）算法，該算法的實現(xiàn)將語音的平均通信速率約降至4 kb/s[8]。上述語音壓縮編碼算法的速率雖都達(dá)到了4.6 kb/s以下，甚至達(dá)到了0.56 kb/s，具有一定的借鑒意義，但上述算法并未使用G.729語音編碼標(biāo)準(zhǔn)。

鑒于此，本文提出一種基于IP包拆分重組技術(shù)的混合語音壓縮編碼算法，在使用G.729標(biāo)準(zhǔn)的基礎(chǔ)上，利用改進(jìn)的多帶激勵（Advanced Multi-Band Excitation，AMBE）語音編碼技術(shù)對語音數(shù)據(jù)進(jìn)行二次壓縮解壓，結(jié)合IP包拆分重組技術(shù)，使語音數(shù)據(jù)傳輸比特率達(dá)到5.7 kb/s，有效避免開銷數(shù)據(jù)消耗過多信道帶寬，提高語音有效載荷的傳輸效率和質(zhì)量。

本文詳細(xì)內(nèi)容請下載：

http://ihrv.cn/resource/share/2000006328

作者信息：

李凌云，李肖克，陳奕釗，王國法，王輝

（中國電子科技集團(tuán)公司第三十四研究所，廣西桂林 541004）

Magazine.Subscription.jpg

原創(chuàng)聲明：此內(nèi)容為AET網(wǎng)站原創(chuàng)，未經(jīng)授權(quán)禁止轉(zhuǎn)載。

相關(guān)內(nèi)容