智能论文笔记

DeepJSCC-Q: Channel Input Constrained Deep Joint Source-Channel Coding

Tze-Yang Tung , David Burth Kurka , Mikolaj Jankowski , Deniz Gündüz

分类：机器学习

2021-11-25

最近的作品表明，可以通过使用机器学习技术来学习图像的无线传输的任务。已经通过训练了自动化器，非常有前沿图像质量，优于利用源和信道编码分离的流行数字方案，以具有中间的不可培训的沟道层，优于利用源和信道编码分离。然而，这些方法假设可以通过信道传输任何复数，这可以防止硬件或协议只能承认某些信道输入的场景中的算法，例如使用数字星座的使用。这里，我们提出了DeepJSCC-Q，用于无线图像传输的端到端优化的联合源信道编码方案，其能够用固定信道输入字母操作。我们表明DeepJSCC-Q可以对使用连续值通道输入的模型来实现类似的性能。重要的是，在信道条件恶化的情况下，保留在现有工作中观察到的图像质量的正常劣化，使DeepJSCC-Q在实际系统中部署更具吸引力。

translated by 谷歌翻译

DeepJSCC-Q: Constellation Constrained Deep Joint Source-Channel Coding

Tze-Yang Tung , David Burth Kurka , Mikolaj Jankowski , Deniz Gunduz

分类：机器学习

2022-06-16

最近的作品表明，现代机器学习技术可以为长期存在的联合源通道编码（JSCC）问题提供另一种方法。非常有希望的初始结果，优于使用单独的源代码和通道代码的流行数字方案，已被证明用于使用深神经网络（DNNS）的无线图像和视频传输。但是，此类方案的端到端培训需要可区分的通道输入表示。因此，先前的工作假设可以通过通道传输任何复杂值。这可以防止在硬件或协议只能接收数字星座规定的某些频道输入集的情况下应用这些代码。本文中，我们建议使用有限通道输入字母的端到端优化的JSCC解决方案DeepJSCC-Q。我们表明，DEEPJSCC-Q可以实现与允许任何复杂的有价值通道输入的先前作品相似的性能，尤其是在可用的高调制订单时，并且在调制顺序增加的情况下，性能渐近接近无约束通道输入的情况。重要的是，DEEPJSCC-Q保留了不可预测的渠道条件下图像质量的优雅降级，这是在频道迅速变化的移动系统中部署的理想属性。

translated by 谷歌翻译

DeepWiVe: Deep-Learning-Aided Wireless Video Transmission

Tze-Yang Tung , Deniz Gündüz

分类：机器学习

2021-11-25

我们呈现深度，第一端到端联合源通道编码（JSCC）视频传输方案，其利用深神经网络（DNN）的力量直接将视频信号映射到信道符号，组合视频压缩，信道编码并且调制步骤进入单个神经变换。我们的DNN解码器预测无失真反馈的残差，这通过占闭塞/脱离和相机运动来提高视频质量。我们同时培训不同的带宽分配网络，以允许变量带宽传输。然后，我们使用强化学习（RL）训练带宽分配网络，该钢筋学习（RL）优化视频帧之间的有限可用信道带宽的分配，以最大限度地提高整体视觉质量。我们的研究结果表明，深度可以克服悬崖效应，这在传统的分离的数字通信方案中普遍存在，并在估计和实际信道质量之间取得不匹配来实现优雅的降级。 DeepWive优于H.264视频压缩，然后在所有信道条件下的低密度奇偶校验（LDPC）代码在多尺度结构相似性指数（MS-SSIM）方面平均达到0.0462，同时跳动H.265 + LDPC平均高达0.0058。我们还说明了通过显示我们的最佳带宽分配策略优于NA \“IVE统一分配来优化JSCC视频传输中的带宽分配的重要性。我们相信这是实现端到端潜力的重要一步优化的JSCC无线视频传输系统优于当前的基于分离的设计。

translated by 谷歌翻译

Nonlinear Transform Source-Channel Coding for Semantic Communications

Jincheng Dai , Sixian Wang , Kailin Tan , Zhongwei Si , Xiaoqi Qin , Kai Niu , Ping Zhang

分类：计算机视觉 | 机器学习

2021-12-21

在本文中，我们提出了一类新的高效的深源通道编码方法，可以在非线性变换下的源分布下，可以在名称非线性变换源通道编码（NTSCC）下收集。在所考虑的模型中，发射器首先了解非线性分析变换以将源数据映射到潜伏空间中，然后通过深关节源通道编码将潜在的表示发送到接收器。我们的模型在有效提取源语义特征并提供源通道编码的侧面信息之前，我们的模型包括强度。与现有的传统深度联合源通道编码方法不同，所提出的NTSCC基本上学习源潜像和熵模型，作为先前的潜在表示。因此，开发了新的自适应速率传输和高辅助辅助编解码器改进机制以升级深关节源通道编码。整个系统设计被制定为优化问题，其目标是最小化建立感知质量指标下的端到端传输率失真性能。在简单的示例源和测试图像源上，我们发现所提出的NTSCC传输方法通常优于使用标准的深关节源通道编码和基于经典分离的数字传输的模拟传输。值得注意的是，由于其剧烈的内容感知能力，所提出的NTSCC方法可能会支持未来的语义通信。

translated by 谷歌翻译

Adaptive Semantic Communications: Overfitting the Source and Channel for Profit

Jincheng Dai , Sixian Wang , Ke Yang , Kailin Tan , Xiaoqi Qin , Zhongwei Si , Kai Niu , Ping Zhang

分类：机器学习

2022-11-08

Most semantic communication systems leverage deep learning models to provide end-to-end transmission performance surpassing the established source and channel coding approaches. While, so far, research has mainly focused on architecture and model improvements, but such a model trained over a full dataset and ergodic channel responses is unlikely to be optimal for every test instance. Due to limitations on the model capacity and imperfect optimization and generalization, such learned models will be suboptimal especially when the testing data distribution or channel response is different from that in the training phase, as is likely to be the case in practice. To tackle this, in this paper, we propose a novel semantic communication paradigm by leveraging the deep learning model's overfitting property. Our model can for instance be updated after deployment, which can further lead to substantial gains in terms of the transmission rate-distortion (RD) performance. This new system is named adaptive semantic communication (ASC). In our ASC system, the ingredients of wireless transmitted stream include both the semantic representations of source data and the adapted decoder model parameters. Specifically, we take the overfitting concept to the extreme, proposing a series of ingenious methods to adapt the semantic codec or representations to an individual data or channel state instance. The whole ASC system design is formulated as an optimization problem whose goal is to minimize the loss function that is a tripartite tradeoff among the data rate, model rate, and distortion terms. The experiments (including user study) verify the effectiveness and efficiency of our ASC system. Notably, the substantial gain of our overfitted coding paradigm can catalyze semantic communication upgrading to a new era.

translated by 谷歌翻译

AirNet: Neural Network Transmission over the Air

Mikolaj Jankowski , Deniz Gunduz , Krystian Mikolajczyk

分类：计算机视觉 | 机器学习

2021-05-24

State-of-the-art performance for many emerging edge applications is achieved by deep neural networks (DNNs). Often, these DNNs are location and time sensitive, and the parameters of a specific DNN must be delivered from an edge server to the edge device rapidly and efficiently to carry out time-sensitive inference tasks. In this paper, we introduce AirNet, a novel training and transmission method that allows efficient wireless delivery of DNNs under stringent transmit power and latency constraints. We first train the DNN with noise injection to counter the wireless channel noise. Then we employ pruning to reduce the network size to the available channel bandwidth, and perform knowledge distillation from a larger model to achieve satisfactory performance, despite pruning. We show that AirNet achieves significantly higher test accuracy compared to digital alternatives under the same bandwidth and power constraints. The accuracy of the network at the receiver also exhibits graceful degradation with channel quality, which reduces the requirement for accurate channel estimation. We further improve the performance of AirNet by pruning the network below the available bandwidth, and using channel expansion to provide better robustness against channel noise. We also benefit from unequal error protection (UEP) by selectively expanding more important layers of the network. Finally, we develop an ensemble training approach, which trains a whole spectrum of DNNs, each of which can be used at different channel condition, resolving the impractical memory requirements.

translated by 谷歌翻译

SVD-Embedded Deep Autoencoder for MIMO Communications

Xinliang Zhang , Mojtaba Vaezi , Timothy J. O'Shea

分类：机器学习

2021-11-03

在多输入多输出（MIMO）系统中使用深度自动码器（DAE）进行端到端通信，是一种具有重要潜力的新概念。在误码率（BER）方面，已示出DAE-ADED MIMO以占地识别的奇异值分解（SVD）为基础的预编码MIMO。本文提出将信道矩阵的左右奇异矢量嵌入到DAE编码器和解码器中，以进一步提高MIMO空间复用的性能。 SVD嵌入式DAE主要优于BER的理论线性预编码。这是显着的，因为它表明所提出的DAES通过将通信系统视为单个端到端优化块来超出当前系统设计的极限。基于仿真结果，在SNR = 10dB，所提出的SVD嵌入式设计可以实现近10美元，并将BER减少至少10次，而没有SVD，相比增长了18倍的增长率最高18倍具有理论线性预编码。我们将这一点归因于所提出的DAE可以将输入和输出与具有有限字母输入的自适应调制结构匹配。我们还观察到添加到DAE的剩余连接进一步提高了性能。

translated by 谷歌翻译

AttentionCode: Ultra-Reliable Feedback Codes for Short-Packet Communications

Yulin Shao , Emre Ozfatura , Alberto Perotti , Branislav Popovic , Deniz Gunduz

分类：机器学习

2022-05-30

Ultra-reliable short-packet communication is a major challenge in future wireless networks with critical applications. To achieve ultra-reliable communications beyond 99.999%, this paper envisions a new interaction-based communication paradigm that exploits feedback from the receiver. We present AttentionCode, a new class of feedback codes leveraging deep learning (DL) technologies. The underpinnings of AttentionCode are three architectural innovations: AttentionNet, input restructuring, and adaptation to fading channels, accompanied by several training methods, including large-batch training, distributed learning, look-ahead optimizer, training-test signal-to-noise ratio (SNR) mismatch, and curriculum learning. The training methods can potentially be generalized to other wireless communication applications with machine learning. Numerical experiments verify that AttentionCode establishes a new state of the art among all DL-based feedback codes in both additive white Gaussian noise (AWGN) channels and fading channels. In AWGN channels with noiseless feedback, for example, AttentionCode achieves a block error rate (BLER) of $10^{-7}$ when the forward channel SNR is 0 dB for a block size of 50 bits, demonstrating the potential of AttentionCode to provide ultra-reliable short-packet communications.

translated by 谷歌翻译

Semantic Communications with Discrete-time Analog Transmission: A PAPR Perspective

Yulin Shao , Deniz Gunduz

分类：机器学习

2022-08-17

基于深度学习（DL）的联合源通道编码（DEEPJSCC）的最新进展导致了语义通信的新范式。基于DEEPJSCC的语义通信的两个显着特征是直接从源信号中对语义感知功能的开发以及这些功能的离散时间模拟传输（DTAT）。与传统的数字通信相比，与DEEPJSCC的语义通信在接收器上提供了出色的重建性能，并具有较高的频道质量降解，但在传输信号中也表现出较大的峰值功率比（PAPR）。一个空旷的问题是，DeepJSCC的收益是否来自高PAPR连续振幅信号带来的额外自由。在本文中，我们通过在图像传输的应用中探索三种PAPR还原技术来解决这个问题。我们确认，基于DEEPJSCC的语义通信的出色图像重建性能可以保留，而传输的PAPR被抑制至可接受的水平。该观察是在实用语义通信系统中实施DEEPJSCC的重要一步。

translated by 谷歌翻译

All you need is feedback: Communication with block attention feedback codes

Emre Ozfatura , Yulin Shao , Alberto Perotti , Branislav Popovic , Deniz Gunduz

分类：人工智能 | 机器学习

2022-06-19

基于深度学习的渠道代码设计最近引起了人们的兴趣，可以替代传统的编码算法，尤其是对于现有代码不提供有效解决方案的渠道。通过反馈渠道进行的沟通就是一个这样的问题，最近通过采用各种深度学习体系结构来获得有希望的结果。在本文中，我们为反馈渠道介绍了一种新颖的学习辅助代码设计，称为广义块注意反馈（GBAF）代码，i）使用模块化体系结构，可以使用不同的神经网络体系结构实现；ii）与现有设计相比，错误的可能性提高了误顺序；iii）可以以所需的代码速率传输。

translated by 谷歌翻译

Beyond Transmitting Bits: Context, Semantics, and Task-Oriented Communications

Deniz Gunduz , Zhijin Qin , Inaki Estella Aguerri , Harpreet S. Dhillon , Zhaohui Yang , Aylin Yener , Kai Kit Wong , Chan-Byoung Chae

分类：人工智能 | 机器学习

2022-07-19

迄今为止，通信系统主要旨在可靠地交流位序列。这种方法提供了有效的工程设计，这些设计对消息的含义或消息交换所旨在实现的目标不可知。但是，下一代系统可以通过将消息语义和沟通目标折叠到其设计中来丰富。此外，可以使这些系统了解进行交流交流的环境，从而为新颖的设计见解提供途径。本教程总结了迄今为止的努力，从早期改编，语义意识和以任务为导向的通信开始，涵盖了基础，算法和潜在的实现。重点是利用信息理论提供基础的方法，以及学习在语义和任务感知通信中的重要作用。

translated by 谷歌翻译

Robust Semantic Communications with Masked VQ-VAE Enabled Codebook

Qiyu Hu , Guangyi Zhang , Zhijin Qin , Yunlong Cai , Guanding Yu , Geoffrey Ye Li

分类：机器学习

2022-06-08

尽管语义通信对大量任务表现出令人满意的性能，但语义噪声和系统的鲁棒性的影响尚未得到很好的研究。语义噪声是指预期的语义符号和接收到的语义符号之间的误导性，从而导致任务失败。在本文中，我们首先提出了一个框架，用于稳健的端到端语义通信系统来对抗语义噪声。特别是，我们分析了样品依赖性和样本无关的语义噪声。为了打击语义噪声，开发了具有重量扰动的对抗训练，以在训练数据集中纳入带有语义噪声的样品。然后，我们建议掩盖一部分输入，在该输入中，语义噪声经常出现，并通过噪声相关的掩蔽策略设计蒙版vector量化量化的量化自动编码器（VQ-VAE）。我们使用发射器共享的离散代码簿和接收器用于编码功能表示。为了进一步提高系统鲁棒性，我们开发了一个功能重要性模块（FIM），以抑制与噪声相关和任务无关的功能。因此，发射器只需要在代码簿中传输这些重要的任务相关功能的索引即可。仿真结果表明，所提出的方法可以应用于许多下游任务，并显着提高针对语义噪声的鲁棒性，并显着减少了传输开销。

translated by 谷歌翻译

Semantic optical fiber communication system

Zhenming Yu , Hongyu Huang , Liming Cheng , Wei Zhang , Yueqiu Mu , Kun Xu

分类：人工智能

2022-12-27

The current optical communication systems minimize bit or symbol errors without considering the semantic meaning behind digital bits, thus transmitting a lot of unnecessary information. We propose and experimentally demonstrate a semantic optical fiber communication (SOFC) system. Instead of encoding information into bits for transmission, semantic information is extracted from the source using deep learning. The generated semantic symbols are then directly transmitted through an optical fiber. Compared with the bit-based structure, the SOFC system achieved higher information compression and a more stable performance, especially in the low received optical power regime, and enhanced the robustness against optical link impairments. This work introduces an intelligent optical communication system at the human analytical thinking level, which is a significant step toward a breakthrough in the current optical communication architecture.

translated by 谷歌翻译

Robust Information Bottleneck for Task-Oriented Communication with Digital Modulation

Songjie Xie , Youlong Wu , Shuai Ma , Ming Ding , Yuanming Shi , Mingjian Tang

分类：机器学习

2022-09-21

以任务为导向的通信，主要是使用基于学习的联合源通道编码（JSCC），旨在通过将与任务相关的信息传输到接收方来设计通信有效的边缘推理系统。但是，只有在不引入任何冗余的情况下传输与任务相关的信息可能会导致由于渠道变化引起的学习鲁棒性问题，而JSCC将源数据直接映射到连续的通道输入符号中会对现有数字通信系统提出兼容性问题。在本文中，我们通过首先调查编码表示形式的信息性与接收到的信息失真的鲁棒性之间的固有权衡解决这两个问题，然后提出一种具有任务调制的导向的通信方案，名为Inveete Task-定向的JSCC（DT-JSCC），其中发射器将功能编码为离散表示形式，并使用数字调制方案将其传输到接收器。在DT-JSCC方案中，我们开发了一个可靠的编码框架，称为强大的信息瓶颈（rib），以改善对信道变化的稳健性，并使用变量近似来得出肋骨目标的可拖动变异上限，以克服克服相互信息的计算棘手性。实验结果表明，所提出的DT-JSCC比具有低通信延迟的基线方法更好的推理性能更好，并且由于施加的肋骨框架而表现出对通道变化的鲁棒性。

translated by 谷歌翻译

Vector Quantized Semantic Communication System

Qifan Fu , Huiqiang Xie , Zhijin Qin , Gregory Slabaugh , Xiaoming Tao

分类：计算机视觉

2022-09-23

尽管模拟语义通信系统在文献中受到了很大的关注，但在数字语义通信系统上的工作较少。在本文中，我们开发了一个深度学习（DL）启用的矢量量化（VQ）语义通信系统，用于图像传输，名为VQ-Deepsc。具体而言，我们提出了一个基于卷积的神经网络（CNN）的收发器来提取图像的多尺度语义特征，并引入多尺度语义嵌入空间以执行语义特征量化，从而使数据与数字通信系统兼容。此外，我们通过引入Patchgan歧视者来采用对抗训练来提高接收图像的质量。实验结果表明，根据SSIM，所提出的VQ-Deepsc优于传统图像传输方法。

translated by 谷歌翻译

Learning Based Joint Coding-Modulation for Digital Semantic Communication Systems

Yufei Bo , Yiheng Duan , Shuo Shao , Meixia Tao

分类：机器学习

2022-08-11

在基于学习的语义沟通中，神经网络在传统通信系统中取代了不同的构件。但是，数字调制仍然是神经网络的挑战。基于神经网络的数字调制的固有机制是将神经网络编码器的连续输出映射到离散的星座符号中，这是一个不可差的函数，无法使用现有的梯度下降算法进行训练。为了克服这一挑战，在本文中，我们为使用BPSK调制的数字语义通信制定了联合编码调节方案。在我们的方法中，神经网络输出了每个星座点的可能性，而不是具有混凝土映射。因此，使用了随机代码而不是确定性代码，该代码在每个星座上都有可能的符号保留更多信息。联合编码调制设计可以与频道状态相匹配，从而提高数字语义通信的性能。实验结果表明，我们的方法在广泛的SNR上优于语义通信中现有的数字调制方法，并且低SNR制度中的基于神经网络的模拟调制方法优于基于神经网络的模拟调制方法。

translated by 谷歌翻译

Few-Shot Domain Adaptation For End-to-End Communication

Jayaram Raghuram , Yijing Zeng , Dolores García Martí , Rafael Ruiz Ortiz , Somesh Jha , Joerg Widmer , Suman Banerjee

分类：机器学习 | (统计)机器学习

2021-08-02

最近，使用自动编码器（由使用神经网络建模的编码器，渠道和解码器组成）的通信系统的端到端学习问题最近被证明是一种有希望的方法。实际采用这种学习方法面临的挑战是，在变化的渠道条件（例如无线链接）下，它需要经常对自动编码器进行重新训练，以保持低解码错误率。由于重新培训既耗时又需要大量样本，因此当通道分布迅速变化时，它变得不切实际。我们建议使用不更改编码器和解码器网络的快速和样本（几射击）域的适应方法来解决此问题。不同于常规的训练时间无监督或半监督域的适应性，在这里，我们有一个训练有素的自动编码器，来自源分布，我们希望（在测试时间）使用仅使用一个小标记的数据集和无标记的数据来适应（测试时间）到目标分布。我们的方法着重于基于高斯混合物网络的通道模型，并根据类和组件条件仿射变换制定其适应性。学习的仿射转换用于设计解码器的最佳输入转换以补偿分布变化，并有效地呈现在接近源分布的解码器输入中。在实际MMWAVE FPGA设置以及无线设置共有的许多模拟分布变化上，使用非常少量的目标域样本来证明我们方法在适应时的有效性。

translated by 谷歌翻译

Semantic Information Retrieval in Wireless Networks

Edgar Beck , Carsten Bockelmann , Armin Dekorsy

分类：人工智能 | 机器学习 | (统计)机器学习

2022-04-28

Motivated by recent success of Machine Learning (ML) tools in wireless communications, the idea of semantic communication by Weaver from 1949 has received considerable attention. It breaks with the classic design paradigm of Shannon by aiming to transmit the meaning of a message, i.e., semantics, rather than its exact copy and thus allows for savings in channel uses or information rate. In this work, we extend the fundamental approach from Basu et al. for modeling semantics from logical to probabilistic entailment relations between meaning and messages. Thus, we model semantics by means of a hidden random variable and define the task of semantic communication as transmission of messages over a communication channel such that semantics is best preserved. We formulate the semantic communication design either as an Information Maximization or as an Information Bottleneck optimization problem. Finally, we propose the ML-based semantic communication system SINFONI for a distributed multipoint scenario: SINFONI communicates the meaning behind multiple messages that are observed at different senders to a single receiver for semantic retrieval. We analyze SINFONI by processing images as an example of messages. Numerical results reveal a tremendous rate normalized SNR shift up to 20 dB compared to classically designed communication systems.

translated by 谷歌翻译

MIMO Channel Estimation using Score-Based Generative Models

Marius Arvinte , Jonathan I Tamir

分类：机器学习

2022-04-14

Channel estimation is a critical task in multiple-input multiple-output (MIMO) digital communications that substantially effects end-to-end system performance. In this work, we introduce a novel approach for channel estimation using deep score-based generative models. A model is trained to estimate the gradient of the logarithm of a distribution and is used to iteratively refine estimates given measurements of a signal. We introduce a framework for training score-based generative models for wireless MIMO channels and performing channel estimation based on posterior sampling at test time. We derive theoretical robustness guarantees for channel estimation with posterior sampling in single-input single-output scenarios, and experimentally verify performance in the MIMO setting. Our results in simulated channels show competitive in-distribution performance, and robust out-of-distribution performance, with gains of up to $5$ dB in end-to-end coded communication performance compared to supervised deep learning methods. Simulations on the number of pilots show that high fidelity channel estimation with $25$% pilot density is possible for MIMO channel sizes of up to $64 \times 256$. Complexity analysis reveals that model size can efficiently trade performance for estimation latency, and that the proposed approach is competitive with compressed sensing in terms of floating-point operation (FLOP) count.

translated by 谷歌翻译

Scalable and Efficient Neural Speech Coding: A Hybrid Design

Kai Zhen , Jongmo Sung , Mi Suk Lee , Seungkwon Beak , Minje Kim

分类：机器学习

2021-03-27

我们提出了一种可扩展高效的神经波形编码系统，用于语音压缩。我们将语音编码问题作为一种自动汇总任务，其中卷积神经网络（CNN）在其前馈例程期间执行编码和解码作为神经波形编解码器（NWC）。所提出的NWC还将量化和熵编码定义为可培训模块，因此在优化过程期间处理编码伪像和比特率控制。通过将紧凑的模型组件引入NWC，如Gated Reseal Networks和深度可分离卷积，我们实现了效率。此外，所提出的模型具有可扩展的架构，跨模块残差学习（CMRL），以覆盖各种比特率。为此，我们采用残余编码概念来连接多个NWC自动汇总模块，其中每个NWC模块执行残差编码以恢复其上一模块已创建的任何重建损失。 CMRL也可以缩小以覆盖下比特率，因为它采用线性预测编码（LPC）模块作为其第一自动化器。混合设计通过将LPC的量化作为可分散的过程重新定义LPC和NWC集成，使系统培训端到端的方式。所提出的系统的解码器在低至中等比特率范围（12至20kbps）或高比特率（32kbps）中的两个NWC中的一个NWC（0.12百万个参数）。尽管解码复杂性尚不低于传统语音编解码器的复杂性，但是从其他神经语音编码器（例如基于WVENET的声码器）显着降低。对于宽带语音编码质量，我们的系统对AMR-WB的性能相当或卓越的性能，并在低和中等比特率下的速度试验话题上的表现。所提出的系统可以扩展到更高的比特率以实现近透明性能。

translated by 谷歌翻译