智能论文笔记

PicArrange -- Visually Sort, Search, and Explore Private Images on a Mac Computer

Klaus Jung , Kai Uwe Barthel , Nico Hezel , Konstantin Schall

分类：计算机视觉 | 机器学习

2021-11-26

本机MacOS应用PicArrange集成了最先进的图像排序和相似性搜索，以使用户能够更好地概述其图像。已添加许多文件和图像管理功能以使其成为一个解决完整图像管理工作流的工具。自排序地图算法的修改使得列表的图像布置能够在不丢失视觉排序的情况下实现。有效的计算和存储视觉功能以及使用许多麦斯科斯州API的使用导致流体使用的应用程序。

translated by 谷歌翻译

Deep Lake: a Lakehouse for Deep Learning

Sasun Hambardzumyan , Abhinav Tuli , Levon Ghukasyan , Fariz Rahman , Hrant Topchyan , David Isayan , Mikayel Harutyunyan , Tatevik Hakobyan , Ivo Stranic , Davit Buniatyan

分类：人工智能 | 计算机视觉

2022-09-22

传统的数据湖泊通过启用时间旅行，运行SQL查询，使用酸性交易摄入数据以及可视化PBABYTE尺度数据集在云存储中，为分析工作负载提供了关键的数据基础架构。它们使组织能够分解数据孤岛，解锁数据驱动的决策，提高运营效率并降低成本。但是，随着深度学习接管常见的分析工作流程，传统数据湖泊对诸如自然语言处理（NLP），音频处理，计算机视觉和涉及非尾巴数据集的应用程序的有用程度降低。本文介绍了Deep Lake，这是一个开源湖泊，用于在Activeloop开发的深度学习应用程序。 Deep Lake保持了一项关键区别的香草数据湖的好处：它以张量的形式存储复杂数据，例如图像，视频，注释以及表格数据，并将数据迅速流式传输到网络上（a ）张量查询语言，（b）浏览器可视化引擎或（c）不牺牲GPU利用率的深度学习框架。可以从Pytorch，Tensorflow，Jax，与许多MLOPS工具集成在一起的数据集。

translated by 谷歌翻译

GPU backed Data Mining on Android Devices

Robert Fritze , Claudia Plant

分类：机器学习

2021-12-09

为低功耗设备上的高性能计算选择适当的编程范例可以很有用来加快计算。许多Android设备都有一个集成的GPU，虽然没有正式支持 - OpenCL框架可以在Android设备上用于寻址这些GPU。 OpenCL支持线程和数据并行性。使用GPU的应用程序必须考虑到用户可以在任何时刻暂停用户或Android操作系统。我们已创建一个包装器库，允许在Android设备上使用OpenCL。已经写入OpenCL程序可以用几乎没有修改来执行。我们使用此库将DBSCAN和kmeans算法的性能与同一设备上的其他单个和多线程实现的ARM-V7平板电脑的集成GPU进行比较。我们调查了哪些编程范式和语言允许执行速度和能耗之间的最佳权衡。在Android设备上使用GPU进行HPC，可以帮助在遥控区域下进行计算密集型机器学习或数据挖掘任务，在恶劣的环境条件下以及能源供应是一个问题的领域。

translated by 谷歌翻译

Proceedings of the 2nd International Workshop on Reading Music Systems

Jorge Calvo-Zaragoza , Alexander Pacha

分类：计算机视觉 | 机器学习

2022-12-01

The International Workshop on Reading Music Systems (WoRMS) is a workshop that tries to connect researchers who develop systems for reading music, such as in the field of Optical Music Recognition, with other researchers and practitioners that could benefit from such systems, like librarians or musicologists. The relevant topics of interest for the workshop include, but are not limited to: Music reading systems; Optical music recognition; Datasets and performance evaluation; Image processing on music scores; Writer identification; Authoring, editing, storing and presentation systems for music scores; Multi-modal systems; Novel input-methods for music to produce written music; Web-based Music Information Retrieval services; Applications and projects; Use-cases related to written music. These are the proceedings of the 2nd International Workshop on Reading Music Systems, held in Delft on the 2nd of November 2019.

translated by 谷歌翻译

Deployment of ML Models using Kubeflow on Different Cloud Providers

Aditya Pandey , Maitreya Sonawane , Sumit Mamtani

分类：机器学习

2022-06-27

该项目旨在使用称为KubeFlow [1]的开源工具（端到端ML堆栈编排工具包）探索在Kubernetes上部署机器学习模型的过程。我们以管道形式创建端到端的机器学习模型，并分析各个点，包括设置，部署模型，性能，限制，限制和功能。我们希望我们的项目几乎像一个研讨会/入门报告一样，可以帮助Vanilla Cloud/Kubernetes用户对KubeFlow的零知识使用KubeFlow来部署ML模型。从不同的云上的设置到通过互联网提供训练有素的模型 - 我们提供详细信息和指标，详细介绍KubeFlow的性能。

translated by 谷歌翻译

A Survey of Plagiarism Detection Systems: Case of Use with English, French and Arabic Languages

Mehdi Abdelhamid , Faical Azouaou , Sofiane Batata

分类：自然语言处理

2022-01-10

在学术界，抄袭肯定不是一个新兴的关注，但它随着互联网的普及和对全球内容来源的易于访问而变得更大的程度，使人类干预不足。尽管如此，由于计算机辅助抄袭检测，抄袭远远远非是一个未被解除的问题，目前是一个有效的研究领域，该研究落在信息检索（IR）和自然语言处理（NLP）领域。许多软件解决方案有助于满足这项任务，本文概述了用于阿拉伯语，法国和英语学术和教育环境的抄袭检测系统。比较在八个系统之间持有，并在检测不同来源的三个混淆水平的特征，可用性，技术方面以及它们的性能之间进行：逐字，释义和跨语言抄袭。在本研究的背景下也进行了对技术形式的抄袭技术形式的关注检查。此外，还提供了对不同作者提出的抄袭类型和分类的调查。

translated by 谷歌翻译

Guided interactive image segmentation using machine learning and color based data set clustering

Adrian Friebel , Tim Johann , Dirk Drasdo , Stefan Hoehme

分类：计算机视觉

2020-05-15

我们提出了一种新颖的方法，该方法将基于机器学习的交互式图像分割结合在一起，使用Supersoxels与聚类方法结合了用于自动识别大型数据集中类似颜色的图像的聚类方法，从而使分类器的指导重复使用。我们的方法解决了普遍的颜色可变性的问题，并且在生物学和医学图像中通常不可避免，这通常会导致分割恶化和量化精度，从而大大降低了必要的训练工作。效率的这种提高促进了大量图像的量化，从而为高通量成像中的最新技术进步提供了交互式图像分析。所呈现的方法几乎适用于任何图像类型，并代表通常用于图像分析任务的有用工具。

translated by 谷歌翻译

Democratizing Machine Translation with OPUS-MT

Jörg Tiedemann , Mikko Aulamo , Daria Bakshandaeva , Michele Boggia , Stig-Arne Grönroos , Tommi Nieminen , Alessandro Raganato , Yves Scherrer , Raul Vazquez , Sami Virpioja

分类：自然语言处理

2022-12-04

This paper presents the OPUS ecosystem with a focus on the development of open machine translation models and tools, and their integration into end-user applications, development platforms and professional workflows. We discuss our on-going mission of increasing language coverage and translation quality, and also describe on-going work on the development of modular translation models and speed-optimized compact solutions for real-time translation on regular desktops and small devices.

translated by 谷歌翻译

PIMIP: An Open Source Platform for Pathology Information Management and Integration

Jialun Wu , Anyu Mao , Xinrui Bao , Haichuan Zhang , Zeyu Gao , Chunbao Wang , Tieliang Gong , Chen Li

分类：人工智能 | 计算机视觉

2021-11-09

数字病理学在医疗领域的人工智能发展中起着至关重要的作用。数字病理平台可以使病态资源数字和网络，并实现视觉数据的永久存储和同步浏览处理，而不限制时间和空间。它已广泛用于各种病理领域。然而，仍然缺乏开放式和通用的数字病理平台，可以帮助医生在数字病理部分的管理和分析中，以及相关患者信息的管理和结构化描述。大多数平台无法集成图像查看，注释和分析以及文本信息管理。为了解决上述问题，我们提出了一个全面而可扩展的平台PIMIP。我们的PIMIP基于数字病理部分的可视化开发了图像注释功能。我们的注释功能支持多用户协作注释和多设备注释，并实现某些注释任务的自动化。在注释任务中，我们邀请了一个专业的病理学家进行了指导。我们介绍了一种用于图像分析的机器学习模块。我们收集的数据包括来自当地医院和临床示例的公共数据。我们的平台更临床，适合临床使用。除了图像数据外，还构建了文本信息的管理和显示。所以我们的平台是全面的。平台框架是以模块化的方式构建的，以支持用户独立添加机器学习模块，这使我们的平台可扩展。

translated by 谷歌翻译

Sharing Linkable Learning Objects with the use of Metadata and a Taxonomy Assistant for Categorization

Valentina Franzoni , Sergio Tasso , Simonetta Pallottelli , Damiano Perri

分类：人工智能

2022-12-09

In this work, a re-design of the Moodledata module functionalities is presented to share learning objects between e-learning content platforms, e.g., Moodle and G-Lorep, in a linkable object format. The e-learning courses content of the Drupal-based Content Management System G-Lorep for academic learning is exchanged designing an object incorporating metadata to support the reuse and the classification in its context. In such an Artificial Intelligence environment, the exchange of Linkable Learning Objects can be used for dialogue between Learning Systems to obtain information, especially with the use of semantic or structural similarity measures to enhance the existent Taxonomy Assistant for advanced automated classification.

translated by 谷歌翻译

3D Labeling Tool

John Rachwan , Charbel Zalaket

分类：计算机视觉 | 人工智能

2022-07-23

培训和测试监督对象检测模型需要大量带有地面真相标签的图像。标签定义图像中的对象类及其位置，形状以及可能的其他信息，例如姿势。即使存在人力，标签过程也非常耗时。我们引入了一个新的标签工具，用于2D图像以及3D三角网格：3D标记工具（3DLT）。这是一个独立的，功能丰富和跨平台软件，不需要安装，并且可以在Windows，MacOS和基于Linux的发行版上运行。我们不再像当前工具那样在每个图像上分别标记相同的对象，而是使用深度信息从上述图像重建三角形网格，并仅在上述网格上标记一次对象。我们使用注册来简化3D标记，离群值检测来改进2D边界框的计算和表面重建，以将标记可能性扩展到大点云。我们的工具经过最先进的方法测试，并且在保持准确性和易用性的同时，它极大地超过了它们。

translated by 谷歌翻译

The Platform for non-metallic pipes defects recognition. Design and Implementation

Fabio Cacciatori , Sergei Nikolaev , Dmitrii Grigorev

分类：机器学习

2022-12-09

This paper describes a prototype software and hardware platform to provide support to field operators during the inspection of surface defects of non-metallic pipes. Inspection is carried out by video filming defects created on the same surface in real-time using a "smart" helmet device and other mobile devices. The work focuses on the detection and recognition of the defects which appears as colored iridescence of reflected light caused by the diffraction effect arising from the presence of internal stresses in the inspected material. The platform allows you to carry out preliminary analysis directly on the device in offline mode, and, if a connection to the network is established, the received data is transmitted to the server for post-processing to extract information about possible defects that were not detected at the previous stage. The paper presents a description of the stages of design, formal description, and implementation details of the platform. It also provides descriptions of the models used to recognize defects and examples of the result of the work.

translated by 谷歌翻译

Computer Vision Based Parking Optimization System

Siddharth Chandrasekaran , Jeffrey Matthew Reginald , Wei Wang , Ting Zhu

分类：计算机视觉 | 人工智能

2022-01-01

技术的改进与时间和时间相关的问题线性相关。已经看到，随着时间的推移，人类面临的问题数量也会增加。然而，解决这些问题的技术也往往会改善。最早的现有问题之一开始于车辆的发明内容是停车位。多年来，使用技术的易于解决这个问题已经发展，但停车问题仍然仍未解决。这背后的主要原因是停车不仅涉及一个问题，而且它包括一系列问题。其中一个问题是分布式停车生态系统中停车槽的占用检测。在分布式系统中，用户将找到优选的停车位，而不是随机停车位。在本文中，我们将基于Web的应用提出了一种用于在不同停车位停车空间检测的解决方案。该解决方案基于计算机视觉（CV），并使用Python 3.0中编写的Django框架构建。解决方案用于解决占用检测问题以及提供用户基于可用性和偏好确定块的选项。我们提出的系统的评估结果是有前途和有效的。所提出的系统也可以与不同的系统集成，并用于解决其他相关停车问题。

translated by 谷歌翻译

DendroMap: Visual Exploration of Large-Scale Image Datasets for Machine Learning with Treemaps

Donald Bertucci , Md Montaser Hamid , Yashwanthi Anand , Anita Ruangrotsakun , Delyar Tabatabai , Melissa Perez , Minsuk Kahng

分类：人工智能 | 机器学习

2022-05-14

在本文中，我们提出了DendroMap，这是一种新颖的方法，用于互动地探索用于机器学习的大规模图像数据集（ML）。 ML从业人员通常通过使用降低降低技术（例如T-SNE）生成图像的网格或将图像的高维表示分为2-D来探索图像数据集。但是，两种方法都没有有效地扩展到大型数据集，因为图像是无效组织的，并且相互作用不足。为了应对这些挑战，我们通过适应Treemaps（一种众所周知的可视化技术）来开发树突。树突图通过从图像的高维表示中提取层次群集结构来有效地组织图像。它使用户能够理解数据集的整体分布，并在多个抽象级别上进行交互放大到特定的兴趣领域。我们使用广泛使用的图像数据集进行深度学习的案例研究表明，用户可以通过检查图像的多样性，确定表现不佳的子组并分析分类错误，从而发现有关数据集和训练模型的见解。我们进行了一项用户研究，该研究通过将其与T-SNE的网状版本进行比较，评估了树突图在分组和搜索任务中的有效性，并发现参与者更喜欢DendroMap。 DendroMap可在https://div-lab.github.io/dendromap/上获得。

translated by 谷歌翻译

RoboStack: Using the Robot Operating System alongside the Conda and Jupyter Data Science Ecosystems

Tobias Fischer , Wolf Vollprecht , Silvio Traversaro , Sean Yen , Carlos Herrero , Michael Milford

分类：机器人

2021-04-26

我们认为，利用公共，跨平台，语言 - 不可止结的包管理器和jupyter紧密地耦合广泛使用的机器人操作系统，这是有益的，这是一种提供科学计算的基于网络的互动计算环境。我们为公务员提供新的ROS套餐，可以轻松地安装ROS沿着数据科学和机器学习套件。多个ROS版本（目前ROS1 Melodic和Neatic以及ROS2 Foxy和Galactic）可以同时在一台机器上运行，具有适用于Linux，Windows和OSX的预编译二进制文件，以及ARM架构（例如Raspberry PI和新的苹果硅）。要处理ROS生态系统的大尺寸，我们通过重写C ++的关键零件来显着提高公共求解器和构建系统的速度。我们进一步为ROS提供了一系列jupyterlab扩展，包括用于实时绘图，调试和机器人控制的插件，以及与ZETHU的紧密集成，RVIZ如可视化工具。罗布斯特克在一起结合了最好的数据科学和机器人世界，帮助研究人员和开发人员为学术和工业项目建立定制解决方案。

translated by 谷歌翻译

Analyzing social media with crowdsourcing in Crowd4SDG

Carlo Bono , Mehmet Oğuz Mülâyim , Cinzia Cappiello , Mark Carman , Jesus Cerquides , Jose Luis Fernandez-Marquez , Rosy Mondardini , Edoardo Ramalli , Barbara Pernici

分类：人工智能

2022-08-04

社交媒体有可能提供有关紧急情况和突然事件的及时信息。但是，在每天发布的数百万帖子中找到相关信息可能很困难，并且开发数据分析项目通常需要时间和技术技能。这项研究提出了一种为分析社交媒体的灵活支持的方法，尤其是在紧急情况下。引入了可以采用社交媒体分析的不同用例，并讨论了从大量帖子中检索信息的挑战。重点是分析社交媒体帖子中包含的图像和文本，以及一组自动数据处理工具，用于过滤，分类和使用人类的方法来支持数据分析师的内容。这种支持包括配置自动化工具的反馈和建议，以及众包收集公民的投入。通过讨论Crowd4SDG H2020欧洲项目中开发的三个案例研究来验证结果。

translated by 谷歌翻译

Analyzing the State of Computer Science Research with the DBLP Discovery Dataset

Lennart Küll

分类：自然语言处理

2022-12-01

The number of scientific publications continues to rise exponentially, especially in Computer Science (CS). However, current solutions to analyze those publications restrict access behind a paywall, offer no features for visual analysis, limit access to their data, only focus on niches or sub-fields, and/or are not flexible and modular enough to be transferred to other datasets. In this thesis, we conduct a scientometric analysis to uncover the implicit patterns hidden in CS metadata and to determine the state of CS research. Specifically, we investigate trends of the quantity, impact, and topics for authors, venues, document types (conferences vs. journals), and fields of study (compared to, e.g., medicine). To achieve this we introduce the CS-Insights system, an interactive web application to analyze CS publications with various dashboards, filters, and visualizations. The data underlying this system is the DBLP Discovery Dataset (D3), which contains metadata from 5 million CS publications. Both D3 and CS-Insights are open-access, and CS-Insights can be easily adapted to other datasets in the future. The most interesting findings of our scientometric analysis include that i) there has been a stark increase in publications, authors, and venues in the last two decades, ii) many authors only recently joined the field, iii) the most cited authors and venues focus on computer vision and pattern recognition, while the most productive prefer engineering-related topics, iv) the preference of researchers to publish in conferences over journals dwindles, v) on average, journal articles receive twice as many citations compared to conference papers, but the contrast is much smaller for the most cited conferences and journals, and vi) journals also get more citations in all other investigated fields of study, while only CS and engineering publish more in conferences than journals.

translated by 谷歌翻译

Visualization Of Class Activation Maps To Explain AI Classification Of Network Packet Captures

Igor Cherepanov , Alex Ulmer , Jonathan Geraldi Joewono , Jörn Kohlhammer

分类：机器学习

2022-09-05

由于当今网络和应用程序的快速增长，互联网流量的分类变得越来越重要。我们网络中的连接数量和新应用程序的添加会导致大量日志数据，并使专家搜索常见模式变得复杂。在特定类别的应用程序中找到此类模式对于满足网络分析中的各种要求是必要的。深度学习方法同时从单个系统中的数据中提供特征提取和分类。但是，这些网络非常复杂，被用作黑框模型，它削弱了专家对分类的信任。此外，通过将它们用作黑色框，尽管其表现出色，但仍无法从模型预测中获得新知识。因此，分类的解释性至关重要。除了增加信任外，该解释还可以用于模型评估，从数据中获得新的见解并改善模型。在本文中，我们提出了一个视觉交互式工具，该工具将网络数据的分类与解释技术结合在一起，以在专家，算法和数据之间形成接口。

translated by 谷歌翻译

PhishMatch: A Layered Approach for Effective Detection of Phishing URLs

Harshal Tupsamudre , Sparsh Jain , Sachin Lodha

分类：机器学习

2021-12-04

网络钓鱼袭击在互联网上继续成为一个重大威胁。先前的研究表明，可以确定网站是否是网络钓鱼，也可以更仔细地分析其URL。基于URL的方法的一个主要优点是它即使在浏览器中呈现网页之前，它也可以识别网络钓鱼网站，从而避免了其他潜在问题，例如加密和驾驶下载。但是，传统的基于URL的方法有它们的局限性。基于黑名单的方法容易出现零小时网络钓鱼攻击，基于先进的机器学习方法消耗高资源，而其他方法将URL发送到远程服务器，损害用户的隐私。在本文中，我们提出了一个分层的防护防御，PhishMatch，这是强大，准确，廉价和客户端的。我们设计一种节省空间高效的AHO-Corasick算法，用于精确串联匹配和基于N-GRAM的索引技术，用于匹配的近似字符串，以检测网络钓鱼URL中的各种弧度标准技术。为了减少误报，我们使用全球白名单和个性化用户白名单。我们还确定访问URL的上下文并使用该信息更准确地对输入URL进行分类。 PhishMatch的最后一个组成部分涉及机器学习模型和受控搜索引擎查询以对URL进行分类。发现针对Chrome浏览器开发的PhishMatch的原型插件，是快速轻便的。我们的评价表明，PhishMatch既有效又有效。

translated by 谷歌翻译

Music Recommendation System based on Emotion, Age and Ethnicity

Ramiz Mammadli , Huma Bilgin , Ali Can Karaca

分类：计算机视觉 | 人工智能

2022-12-09

A Music Recommendation System based on Emotion, Age, and Ethnicity is developed in this study, using FER-2013 and ``Age, Gender, and Ethnicity (Face Data) CSV'' datasets. The CNN architecture, which is extensively used for this kind of purpose has been applied to the training of the models. After adding several appropriate layers to the training end of the project, in total, 3 separate models are trained in the Deep Learning side of the project: Emotion, Ethnicity, and Age. After the training step of these models, they are used as classifiers on the web application side. The snapshot of the user taken through the interface is sent to the models to predict their mood, age, and ethnic origin. According to these classifiers, various kinds of playlists pulled from Spotify API are proposed to the user in order to establish a functional and user-friendly atmosphere for the music selection. Afterward, the user can choose the playlist they want and listen to it by following the given link.

translated by 谷歌翻译