智能论文笔记

Imaging with Equivariant Deep Learning

Dongdong Chen , Mike Davies , Matthias J. Ehrhardt , Carola-Bibiane Schönlieb , Ferdia Sherry , Julián Tachella

分类：计算机视觉

2022-09-05

从早期图像处理到现代计算成像，成功的模型和算法都依赖于自然信号的基本属性：对称性。在这里，对称是指信号集的不变性属性，例如翻译，旋转或缩放等转换。对称性也可以以模棱两可的形式纳入深度神经网络中，从而可以进行更多的数据效率学习。虽然近年来端到端的图像分类网络的设计方面取得了重要进展，但计算成像引入了对等效网络解决方案的独特挑战，因为我们通常只通过一些嘈杂的不良反向操作员观察图像，可能不是均等的。我们回顾了现象成像的新兴领域，并展示它如何提供改进的概括和新成像机会。在此过程中，我们展示了采集物理学与小组动作之间的相互作用，以及与迭代重建，盲目的压缩感应和自我监督学习之间的联系。

translated by 谷歌翻译

Regularising Inverse Problems with Generative Machine Learning Models

Margaret Duff , Neill D. F. Campbell , Matthias J. Ehrhardt

分类：计算机视觉 | 机器学习

2021-07-22

在过去的几年中，深层神经网络方法的反向成像问题产生了令人印象深刻的结果。在本文中，我们考虑在跨问题方法中使用生成模型。所考虑的正规派对图像进行了惩罚，这些图像远非生成模型的范围，该模型学会了产生类似于训练数据集的图像。我们命名这个家庭\ textit {生成正规派}。生成常规人的成功取决于生成模型的质量，因此我们提出了一组所需的标准来评估生成模型并指导未来的研究。在我们的数值实验中，我们根据我们所需的标准评估了三种常见的生成模型，自动编码器，变异自动编码器和生成对抗网络。我们还测试了三个不同的生成正规疗法仪，关于脱毛，反卷积和断层扫描的逆问题。我们表明，逆问题的限制解决方案完全位于生成模型的范围内可以给出良好的结果，但是允许与发电机范围的小偏差产生更一致的结果。

translated by 谷歌翻译

Biomedical image analysis competitions: The state of current participation practice

Matthias Eisenmann , Annika Reinke , Vivienn Weru , Minu Dietlinde Tizabi , Fabian Isensee , Tim J. Adler , Patrick Godau , Veronika Cheplygina , Michal Kozubek , Sharib Ali

分类：计算机视觉 | 机器学习

2022-12-16

The number of international benchmarking competitions is steadily increasing in various fields of machine learning (ML) research and practice. So far, however, little is known about the common practice as well as bottlenecks faced by the community in tackling the research questions posed. To shed light on the status quo of algorithm development in the specific field of biomedical imaging analysis, we designed an international survey that was issued to all participants of challenges conducted in conjunction with the IEEE ISBI 2021 and MICCAI 2021 conferences (80 competitions in total). The survey covered participants' expertise and working environments, their chosen strategies, as well as algorithm characteristics. A median of 72% challenge participants took part in the survey. According to our results, knowledge exchange was the primary incentive (70%) for participation, while the reception of prize money played only a minor role (16%). While a median of 80 working hours was spent on method development, a large portion of participants stated that they did not have enough time for method development (32%). 25% perceived the infrastructure to be a bottleneck. Overall, 94% of all solutions were deep learning-based. Of these, 84% were based on standard architectures. 43% of the respondents reported that the data samples (e.g., images) were too large to be processed at once. This was most commonly addressed by patch-based training (69%), downsampling (37%), and solving 3D analysis tasks as a series of 2D tasks. K-fold cross-validation on the training set was performed by only 37% of the participants and only 50% of the participants performed ensembling based on multiple identical models (61%) or heterogeneous models (39%). 48% of the respondents applied postprocessing steps.

translated by 谷歌翻译

A novel GAN-based paradigm for weakly supervised brain tumor segmentation of MR images

Jay J. Yoo , Khashayar Namdar , Matthias W. Wagner , Liana Nobre , Uri Tabori , Cynthia Hawkins , Birgit B. Ertl-Wagner , Farzad Khalvati

分类：计算机视觉

2022-11-10

Segmentation of regions of interest (ROIs) for identifying abnormalities is a leading problem in medical imaging. Using Machine Learning (ML) for this problem generally requires manually annotated ground-truth segmentations, demanding extensive time and resources from radiologists. This work presents a novel weakly supervised approach that utilizes binary image-level labels, which are much simpler to acquire, to effectively segment anomalies in medical Magnetic Resonance (MR) images without ground truth annotations. We train a binary classifier using these labels and use it to derive seeds indicating regions likely and unlikely to contain tumors. These seeds are used to train a generative adversarial network (GAN) that converts cancerous images to healthy variants, which are then used in conjunction with the seeds to train a ML model that generates effective segmentations. This method produces segmentations that achieve Dice coefficients of 0.7903, 0.7868, and 0.7712 on the MICCAI Brain Tumor Segmentation (BraTS) 2020 dataset for the training, validation, and test cohorts respectively. We also propose a weakly supervised means of filtering the segmentations, removing a small subset of poorer segmentations to acquire a large subset of high quality segmentations. The proposed filtering further improves the Dice coefficients to up to 0.8374, 0.8232, and 0.8136 for training, validation, and test, respectively.

translated by 谷歌翻译

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Teven Le Scao , Angela Fan , Christopher Akiki , Ellie Pavlick , Suzana Ilić , Daniel Hesslow , Roman Castagné , Alexandra Sasha Luccioni , François Yvon , Matthias Gallé

分类：自然语言处理

2022-11-09

Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by resource-rich organizations and are frequently kept from the public. As a step towards democratizing this powerful technology, we present BLOOM, a 176B-parameter open-access language model designed and built thanks to a collaboration of hundreds of researchers. BLOOM is a decoder-only Transformer language model that was trained on the ROOTS corpus, a dataset comprising hundreds of sources in 46 natural and 13 programming languages (59 in total). We find that BLOOM achieves competitive performance on a wide variety of benchmarks, with stronger results after undergoing multitask prompted finetuning. To facilitate future research and applications using LLMs, we publicly release our models and code under the Responsible AI License.

translated by 谷歌翻译

Graph Neural Networks for Low-Energy Event Classification & Reconstruction in IceCube

R. Abbasi , M. Ackermann , J. Adams , N. Aggarwal , J. A. Aguilar , M. Ahlers , M. Ahrens , J. M. Alameddine , A. A. Alves Jr. , N. M. Amin

分类：机器学习

2022-09-07

ICECUBE是一种用于检测1 GEV和1 PEV之间大气和天体中微子的光学传感器的立方公斤阵列，该阵列已部署1.45 km至2.45 km的南极的冰盖表面以下1.45 km至2.45 km。来自ICE探测器的事件的分类和重建在ICeCube数据分析中起着核心作用。重建和分类事件是一个挑战，这是由于探测器的几何形状，不均匀的散射和冰中光的吸收，并且低于100 GEV的光，每个事件产生的信号光子数量相对较少。为了应对这一挑战，可以将ICECUBE事件表示为点云图形，并将图形神经网络（GNN）作为分类和重建方法。 GNN能够将中微子事件与宇宙射线背景区分开，对不同的中微子事件类型进行分类，并重建沉积的能量，方向和相互作用顶点。基于仿真，我们提供了1-100 GEV能量范围的比较与当前ICECUBE分析中使用的当前最新最大似然技术，包括已知系统不确定性的影响。对于中微子事件分类，与当前的IceCube方法相比，GNN以固定的假阳性速率（FPR）提高了信号效率的18％。另外，GNN在固定信号效率下将FPR的降低超过8（低于半百分比）。对于能源，方向和相互作用顶点的重建，与当前最大似然技术相比，分辨率平均提高了13％-20％。当在GPU上运行时，GNN能够以几乎是2.7 kHz的中位数ICECUBE触发速率的速率处理ICECUBE事件，这打开了在在线搜索瞬态事件中使用低能量中微子的可能性。

translated by 谷歌翻译

Stochastic Primal-Dual Three Operator Splitting with Arbitrary Sampling and Preconditioning

Junqi Tang , Matthias Ehrhardt , Carola-Bibiane Schönlieb

分类：机器学习

2022-08-02

在这项工作中，我们提出了一种随机原始偶对预处理的三操作算法，用于解决一类凸的三复合优化问题。我们提出的方案是SPDHG算法的直接三操作员分裂扩展[Chambolle等。2018]。我们提供了理论收敛分析，显示了厄贡O（1/K）收敛率，并证明了我们方法在成像反问题中的有效性。

translated by 谷歌翻译

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

Aarohi Srivastava , Abhinav Rastogi , Abhishek Rao , Abu Awal Md Shoeb , Abubakar Abid , Adam Fisch , Adam R. Brown , Adam Santoro , Aditya Gupta , Adrià Garriga-Alonso

分类：自然语言处理 | 人工智能 | 机器学习 | (统计)机器学习

2022-06-09

语言模型既展示了定量的改进，又展示了新的定性功能，随着规模的增加。尽管它们具有潜在的变革性影响，但这些新能力的特征却很差。为了为未来的研究提供信息，为破坏性的新模型能力做准备，并改善社会有害的效果，至关重要的是，我们必须了解目前和近乎未来的能力和语言模型的局限性。为了应对这一挑战，我们介绍了超越模仿游戏基准（Big Bench）。 Big Bench目前由204个任务组成，由132家机构的442位作者贡献。任务主题是多样的，从语言学，儿童发展，数学，常识性推理，生物学，物理学，社会偏见，软件开发等等。 Big-Bench专注于被认为超出当前语言模型的功能的任务。我们评估了OpenAI的GPT型号，Google内部密集变压器体系结构和大型基础上的开关稀疏变压器的行为，跨越了数百万到数十亿个参数。此外，一个人类专家评估者团队执行了所有任务，以提供强大的基准。研究结果包括：模型性能和校准都随规模改善，但绝对的术语（以及与评估者的性能相比）；在模型类中的性能非常相似，尽管带有稀疏性。逐渐和预测的任务通常涉及大量知识或记忆成分，而在临界规模上表现出“突破性”行为的任务通常涉及多个步骤或组成部分或脆性指标；社交偏见通常会随着含糊不清的环境而随着规模而增加，但这可以通过提示来改善。

translated by 谷歌翻译

Dynamical simulation via quantum machine learning with provable generalization

Joe Gibbs , Zoë Holmes , Matthias C. Caro , Nicholas Ezzell , Hsin-Yuan Huang , Lukasz Cincio , Andrew T. Sornborger , Patrick J. Coles

分类：机器学习

2022-04-21

作为量子优势的应用，对动态模拟和量子机学习（QML）的关注很大，而使用QML来增强动态模拟的可能性尚未得到彻底研究。在这里，我们开发了一个框架，用于使用QML方法模拟近期量子硬件上的量子动力学。我们使用概括范围，即机器学习模型在看不见的数据上遇到的错误，以严格分析此框架内算法的训练数据要求。这提供了一种保证，就量子和数据要求而言，我们的算法是资源有效的。我们的数字具有问题大小的有效缩放，我们模拟了IBMQ-Bogota上的Trotterization的20倍。

translated by 谷歌翻译

Capturing Shape Information with Multi-Scale Topological Loss Terms for 3D Reconstruction

Dominik J. E. Waibel , Scott Atwell , Matthias Meier , Carsten Marr , Bastian Rieck

分类：计算机视觉 | 机器学习 | (统计)机器学习

2022-03-03

从2D图像重建3D对象对于我们的大脑和机器学习算法都有挑战。为了支持此空间推理任务，有关对象整体形状的上下文信息至关重要。但是，此类信息不会通过既定的损失条款（例如骰子损失）捕获。我们建议通过在重建损失中包括多尺度拓扑特征，例如连接的组件，周期和空隙来补充几何形状信息。我们的方法使用立方复合物来计算3D体积数据的拓扑特征，并采用最佳传输距离来指导重建过程。这种拓扑感知的损失是完全可区分的，在计算上有效，并且可以添加到任何神经网络中。我们通过将损失纳入SHAPR来证明我们的损失的实用性，该模型用于根据2D显微镜图像预测单个细胞的3D细胞形状。使用利用单个对象的几何信息和拓扑信息来评估其形状的混合损失，我们发现拓扑信息大大提高了重建质量，从而突出了其从图像数据集中提取更多相关特征的能力。

translated by 谷歌翻译