智能论文笔记

DIY Graphics Tab: A Cost-Effective Alternative to Graphics Tablet for Educators

Mohammad Imrul Jubair , Arafat Ibne Yousuf , Tashfiq Ahmed , Hasanath Jamy , Foisal Reza , Mohsena Ashraf

分类：计算机视觉

2021-12-05

每天，越来越多的人正在转向在线学习，这改变了我们的传统课堂方法。录音讲座一直是在线教育者的正常任务，并且在疫情中最近变得更加重要，因为实际的课程仍在推迟在几个国家。录制讲座时，由于其与计算机接口的便携性和能力，图形平板电脑是一个很大的白板替代白板。然而，这种图形平板电脑对于大多数教师来说太昂贵了。在本文中，我们向教师和教育工作者提出了一种基于计算机视觉的图形平板电脑，这主要以与图形平板电脑相同的方式，而只是需要笔，纸张和笔记本电脑的网络摄像头。我们称之为“自己为自己的图形标签”或“DIY图形选项卡”。我们的系统在由摄像机获取的纸上收到一系列人员写作作为输入的纸张，并输出包含纸张写入内容的屏幕。由于人的手，由于人的手，随机运动，纸张，照明条件不佳，由于视角，透视失真等诸如遮挡等许多障碍物而言。一种管道通过我们的系统，在生成适当的输出之前，进行实例分段和预处理。我们还从教师和学生进行了用户体验评估，并在本文中审查了他们的回复。

translated by 谷歌翻译

Transferable Energy Storage Bidder

Yousuf Baker , Ningkun Zheng , Bolun Xu

分类：机器学习

2023-01-02

Energy storage resources must consider both price uncertainties and their physical operating characteristics when participating in wholesale electricity markets. This is a challenging problem as electricity prices are highly volatile, and energy storage has efficiency losses, power, and energy constraints. This paper presents a novel, versatile, and transferable approach combining model-based optimization with a convolutional long short-term memory network for energy storage to respond to or bid into wholesale electricity markets. We apply transfer learning to the ConvLSTM network to quickly adapt the trained bidding model to new market environments. We test our proposed approach using historical prices from New York State, showing it achieves state-of-the-art results, achieving between 70% to near 90% profit ratio compared to perfect foresight cases, in both price response and wholesale market bidding setting with various energy storage durations. We also test a transfer learning approach by pre-training the bidding model using New York data and applying it to arbitrage in Queensland, Australia. The result shows transfer learning achieves exceptional arbitrage profitability with as little as three days of local training data, demonstrating its significant advantage over training from scratch in scenarios with very limited data availability.

translated by 谷歌翻译

A Dependable Hybrid Machine Learning Model for Network Intrusion Detection

Md. Alamin Talukder , Khondokar Fida Hasan , Md. Manowarul Islam , Md Ashraf Uddin , Arnisha Akhter , Mohammand Abu Yousuf , Fares Alharbi , Mohammad Ali Moni

分类：机器学习

2022-12-08

Network intrusion detection systems (NIDSs) play an important role in computer network security. There are several detection mechanisms where anomaly-based automated detection outperforms others significantly. Amid the sophistication and growing number of attacks, dealing with large amounts of data is a recognized issue in the development of anomaly-based NIDS. However, do current models meet the needs of today's networks in terms of required accuracy and dependability? In this research, we propose a new hybrid model that combines machine learning and deep learning to increase detection rates while securing dependability. Our proposed method ensures efficient pre-processing by combining SMOTE for data balancing and XGBoost for feature selection. We compared our developed method to various machine learning and deep learning algorithms to find a more efficient algorithm to implement in the pipeline. Furthermore, we chose the most effective model for network intrusion based on a set of benchmarked performance analysis criteria. Our method produces excellent results when tested on two datasets, KDDCUP'99 and CIC-MalMem-2022, with an accuracy of 99.99% and 100% for KDDCUP'99 and CIC-MalMem-2022, respectively, and no overfitting or Type-1 and Type-2 issues.

translated by 谷歌翻译

Moving Beyond Downstream Task Accuracy for Information Retrieval Benchmarking

Keshav Santhanam , Jon Saad-Falcon , Martin Franz , Omar Khattab , Avirup Sil , Radu Florian , Md Arafat Sultan , Salim Roukos , Matei Zaharia , Christopher Potts

分类：自然语言处理

2022-12-02

Neural information retrieval (IR) systems have progressed rapidly in recent years, in large part due to the release of publicly available benchmarking tasks. Unfortunately, some dimensions of this progress are illusory: the majority of the popular IR benchmarks today focus exclusively on downstream task accuracy and thus conceal the costs incurred by systems that trade away efficiency for quality. Latency, hardware cost, and other efficiency considerations are paramount to the deployment of IR systems in user-facing settings. We propose that IR benchmarks structure their evaluation methodology to include not only metrics of accuracy, but also efficiency considerations such as a query latency and the corresponding cost budget for a reproducible hardware setting. For the popular IR benchmarks MS MARCO and XOR-TyDi, we show how the best choice of IR system varies according to how these efficiency considerations are chosen and weighed. We hope that future benchmarks will adopt these guidelines toward more holistic IR evaluation.

translated by 谷歌翻译

SPARTAN: Sparse Hierarchical Memory for Parameter-Efficient Transformers

Ameet Deshpande , Md Arafat Sultan , Anthony Ferritto , Ashwin Kalyan , Karthik Narasimhan , Avirup Sil

分类：自然语言处理 | 人工智能 | 机器学习

2022-11-29

Fine-tuning pre-trained language models (PLMs) achieves impressive performance on a range of downstream tasks, and their sizes have consequently been getting bigger. Since a different copy of the model is required for each task, this paradigm is infeasible for storage-constrained edge devices like mobile phones. In this paper, we propose SPARTAN, a parameter efficient (PE) and computationally fast architecture for edge devices that adds hierarchically organized sparse memory after each Transformer layer. SPARTAN freezes the PLM parameters and fine-tunes only its memory, thus significantly reducing storage costs by re-using the PLM backbone for different tasks. SPARTAN contains two levels of memory, with only a sparse subset of parents being chosen in the first level for each input, and children cells corresponding to those parents being used to compute an output representation. This sparsity combined with other architecture optimizations improves SPARTAN's throughput by over 90% during inference on a Raspberry Pi 4 when compared to PE baselines (adapters) while also outperforming the latter by 0.1 points on the GLUE benchmark. Further, it can be trained 34% faster in a few-shot setting, while performing within 0.9 points of adapters. Qualitative analysis shows that different parent cells in SPARTAN specialize in different topics, thus dividing responsibility efficiently.

translated by 谷歌翻译

Device Modeling Bias in ReRAM-based Neural Network Simulations

Osama Yousuf , Imtiaz Hossen , Matthew W. Daniels , Martin Lueker-Boden , Andrew Dienstfrey , Gina C. Adam

分类：机器学习

2022-11-29

Data-driven modeling approaches such as jump tables are promising techniques to model populations of resistive random-access memory (ReRAM) or other emerging memory devices for hardware neural network simulations. As these tables rely on data interpolation, this work explores the open questions about their fidelity in relation to the stochastic device behavior they model. We study how various jump table device models impact the attained network performance estimates, a concept we define as modeling bias. Two methods of jump table device modeling, binning and Optuna-optimized binning, are explored using synthetic data with known distributions for benchmarking purposes, as well as experimental data obtained from TiOx ReRAM devices. Results on a multi-layer perceptron trained on MNIST show that device models based on binning can behave unpredictably particularly at low number of points in the device dataset, sometimes over-promising, sometimes under-promising target network accuracy. This paper also proposes device level metrics that indicate similar trends with the modeling bias metric at the network level. The proposed approach opens the possibility for future investigations into statistical device models with better performance, as well as experimentally verified modeling bias in different in-memory computing and neural network architectures.

translated by 谷歌翻译

AfroLM: A Self-Active Learning-based Multilingual Pretrained Language Model for 23 African Languages

Bonaventure F. P. Dossou , Atnafu Lambebo Tonja , Oreen Yousuf , Salomey Osei , Abigail Oppong , Iyanuoluwa Shode , Oluwabusayo Olufunke Awoyomi , Chris Chinenye Emezue

分类：自然语言处理 | 人工智能 | 机器学习

2022-11-07

In recent years, multilingual pre-trained language models have gained prominence due to their remarkable performance on numerous downstream Natural Language Processing tasks (NLP). However, pre-training these large multilingual language models requires a lot of training data, which is not available for African Languages. Active learning is a semi-supervised learning algorithm, in which a model consistently and dynamically learns to identify the most beneficial samples to train itself on, in order to achieve better optimization and performance on downstream tasks. Furthermore, active learning effectively and practically addresses real-world data scarcity. Despite all its benefits, active learning, in the context of NLP and especially multilingual language models pretraining, has received little consideration. In this paper, we present AfroLM, a multilingual language model pretrained from scratch on 23 African languages (the largest effort to date) using our novel self-active learning framework. Pretrained on a dataset significantly (14x) smaller than existing baselines, AfroLM outperforms many multilingual pretrained language models (AfriBERTa, XLMR-base, mBERT) on various NLP downstream tasks (NER, text classification, and sentiment analysis). Additional out-of-domain sentiment analysis experiments show that \textbf{AfroLM} is able to generalize well across various domains. We release the code source, and our datasets used in our framework at https://github.com/bonaventuredossou/MLM_AL.

translated by 谷歌翻译

Single MR Image Super-Resolution using Generative Adversarial Network

Shawkh Ibne Rashid , Elham Shakibapour , Mehran Ebrahimi

分类：计算机视觉 | 机器学习

2022-07-16

可以使用超分辨率方法改善医学图像的空间分辨率。实际增强的超级分辨率生成对抗网络（Real-Esrgan）是最近用于产生较高分辨率图像的最新有效方法之一，给定较低分辨率的输入图像。在本文中，我们应用这种方法来增强2D MR图像的空间分辨率。在我们提出的方法中，我们稍微修改了从脑肿瘤分割挑战（BRATS）2018数据集中训练2D磁共振图像（MRI）的结构。通过计算SSIM（结构相似性指数量度），NRMSE（归一化根平方误），MAE（平均绝对误差）和VIF（视觉信息保真度）值，通过计算SSIM（结构相似性指数量度）进行定性和定量验证。

translated by 谷歌翻译

Lessons from Deep Learning applied to Scholarly Information Extraction: What Works, What Doesn't, and Future Directions

Raquib Bin Yousuf , Subhodip Biswas , Kulendra Kumar Kaushal , James Dunham , Rebecca Gelles , Sathappan Muthiah , Nathan Self , Patrick Butler , Naren Ramakrishnan

分类：人工智能

2022-07-08

了解全文学术文章的关键见解至关重要，因为它使我们能够确定有趣的趋势，洞悉研究和发展，并构建知识图。但是，只有在考虑全文时才可用一些有趣的关键见解。尽管研究人员在简短文档中的信息提取方面取得了重大进展，但从全文学术文献中提取科学实体仍然是一个具有挑战性的问题。这项工作提出了一种称为ENEREX的自动端对端研究实体提取器，用于提取技术集，客观任务，全文学术学术研究文章等技术方面。此外，我们提取了三个新颖的方面，例如源代码，计算资源，编程语言/库中的链接。我们演示了Enerex如何从计算机科学领域的大规模数据集中提取关键见解和趋势。我们进一步测试了多个数据集上的管道，发现ENEREX在最新模型的状态下进行了改进。我们强调了现有数据集的能力如何受到限制，以及enerex如何适应现有知识图。我们还向未来研究的指针进行了详细的讨论。我们的代码和数据可在https://github.com/discoveryanalyticscenter/enerex上公开获取。

translated by 谷歌翻译

A novel approach to increase scalability while training machine learning algorithms using Bfloat 16 in credit card fraud detection

Bushra Yousuf , Rejwan Bin Sulaiman , Musarrat Saberin Nipun

分类：机器学习 | 人工智能

2022-06-24

如今，随着数字银行业务已成为常态，信用卡的使用已变得很普遍。随着这一增加，信用卡中的欺诈也对银行和客户都有一个巨大的问题和损失。正常的欺诈检测系统无法检测欺诈，因为欺诈者使用新技术出现欺诈。这创造了使用基于机器学习的软件来检测欺诈的需求。当前，可用的机器学习软件仅着眼于检测欺诈的准确性，但不关注检测的成本或时间因素。这项研究重点是银行信用卡欺诈检测系统的机器学习可伸缩性。我们已经比较了新提出的技术可用的现有机器学习算法和方法。目的是证明，使用较少的位训练机器学习算法将导致更可扩展的系统，这将减少时间，并且实施成本也较低。

translated by 谷歌翻译