智能论文笔记

Sentiment analysis and opinion mining on E-commerce site

Fatema Tuz Zohra Anny , Oahidul Islam

分类：自然语言处理 | 机器学习

2022-11-28

Sentiment analysis or opinion mining help to illustrate the phrase NLP (Natural Language Processing). Sentiment analysis has been the most significant topic in recent years. The goal of this study is to solve the sentiment polarity classification challenges in sentiment analysis. A broad technique for categorizing sentiment opposition is presented, along with comprehensive process explanations. With the results of the analysis, both sentence-level classification and review-level categorization are conducted. Finally, we discuss our plans for future sentiment analysis research.

translated by 谷歌翻译

Impact of Sentiment Analysis in Fake Review Detection

Amira Yousif , James Buckley

分类：自然语言处理 | 机器学习

2022-12-18

Fake review identification is an important topic and has gained the interest of experts all around the world. Identifying fake reviews is challenging for researchers, and there are several primary challenges to fake review detection. We propose developing an initial research paper for investigating fake reviews by using sentiment analysis. Ten research papers are identified that show fake reviews, and they discuss currently available solutions for predicting or detecting fake reviews. They also show the distribution of fake and truthful reviews through the analysis of sentiment. We summarize and compare previous studies related to fake reviews. We highlight the most significant challenges in the sentiment evaluation process and demonstrate that there is a significant impact on sentiment scores used to identify fake feedback.

translated by 谷歌翻译

Urdu Speech and Text Based Sentiment Analyzer

Waqar Ahmad , Maryam Edalati

分类：自然语言处理

2022-07-19

发现别人认为是我们信息收集策略的关键方面。现在，人们可以积极利用信息技术来寻找和理解他人的想法，这要归功于越来越多的意见资源（例如在线评论网站和个人博客）的越来越多。由于其在理解人们的意见方面的关键功能，因此情感分析（SA）是一项至关重要的任务。另一方面，现有的研究主要集中在英语上，只有少量研究专门研究低资源语言。对于情感分析，这项工作根据用户评估提供了一个新的多级乌尔都语数据集。高音扬声器网站用于获取乌尔都语数据集。我们提出的数据集包括10,000项评论，这些评论已被人类专家精心归类为两类：正面，负面。这项研究的主要目的是构建一个手动注释的数据集进行乌尔都语情绪分析，并确定基线结果。采用了五种不同的词典和规则的算法，包括NaiveBayes，Stanza，TextBlob，Vader和Flair，实验结果表明，其精度为70％的天赋优于其他经过测试的算法。

translated by 谷歌翻译

Cross-Domain Consumer Review Analysis

Aditya Pandey , Kunal Joshi

分类：机器学习

2022-12-23

The paper presents a cross-domain review analysis on four popular review datasets: Amazon, Yelp, Steam, IMDb. The analysis is performed using Hadoop and Spark, which allows for efficient and scalable processing of large datasets. By examining close to 12 million reviews from these four online forums, we hope to uncover interesting trends in sales and customer sentiment over the years. Our analysis will include a study of the number of reviews and their distribution over time, as well as an examination of the relationship between various review attributes such as upvotes, creation time, rating, and sentiment. By comparing the reviews across different domains, we hope to gain insight into the factors that drive customer satisfaction and engagement in different product categories.

translated by 谷歌翻译

Sentiment analysis on electricity twitter posts

Pardeep Kaur , Maryam Edalati

分类：自然语言处理

2022-06-10

在当今的世界中，每个人都以某种方式表现出来，而该项目的重点是人们使用Twitter的数据（一个微博平台）的数据，人们对英国和印度的电价上涨的看法，人们在该平台上发布了消息，人们发布了消息，称为Tweets。因为许多人的收入不好，他们必须缴纳如此多的税款和账单，因此如今，维持房屋已成为有争议的问题。尽管政府提供了补贴计划来补偿人们的电费，但不受人们的欢迎。在这个项目中，目的是对Twitter上表达的人们的表达和观点进行情感分析。为了掌握电价的意见，有必要对能源市场的政府和消费者进行情感分析。此外，这些媒体上存在的文本本质上是非结构化的，因此要处理它们，我们首先需要预处理数据。有很多功能提取技术，例如单词袋，tf-idf（术语频率为单位的文档频率），单词嵌入，基于NLP的功能，例如Word Count。在该项目中，我们分析了特征TF-IDF单词级别对情感分析数据集的影响。我们发现，通过使用TF-IDF单词级别的性能分析的表现比使用N-Gram功能高3-4。使用四种分类算法进行分析，包括幼稚的贝叶斯，决策树，随机森林和逻辑回归，并考虑F评分，准确性，精度和召回性能参数。

translated by 谷歌翻译

Mining and summarizing customer reviews

分类：

Merchants selling products on the Web often ask their customers to review the products that they have purchased and the associated services. As e-commerce is becoming more and more popular, the number of customer reviews that a product receives grows rapidly. For a popular product, the number of reviews can be in hundreds or even thousands. This makes it difficult for a potential customer to read them to make an informed decision on whether to purchase the product. It also makes it difficult for the manufacturer of the product to keep track and to manage customer opinions. For the manufacturer, there are additional difficulties because many merchant sites may sell the same product and the manufacturer normally produces many kinds of products. In this research, we aim to mine and to summarize all the customer reviews of a product. This summarization task is different from traditional text summarization because we only mine the features of the product on which the customers have expressed their opinions and whether the opinions are positive or negative. We do not summarize the reviews by selecting a subset or rewrite some of the original sentences from the reviews to capture the main points as in the classic text summarization. Our task is performed in three steps: (1) mining product features that have been commented on by customers; (2) identifying opinion sentences in each review and deciding whether each opinion sentence is positive or negative; (3) summarizing the results. This paper proposes several novel techniques to perform these tasks. Our experimental results using reviews of a number of products sold online demonstrate the effectiveness of the techniques.

translated by 谷歌翻译

A Comprehensive Review of Visual-Textual Sentiment Analysis from Social Media Networks

Israa Khalaf Salman Al-Tameemi , Mohammad-Reza Feizi-Derakhshi , Saeed Pashazadeh , Mohammad Asadpour

分类：自然语言处理 | 人工智能

2022-07-05

社交媒体网络已成为人们生活的重要方面，它是其思想，观点和情感的平台。因此，自动化情绪分析（SA）对于以其他信息来源无法识别人们的感受至关重要。对这些感觉的分析揭示了各种应用，包括品牌评估，YouTube电影评论和医疗保健应用。随着社交媒体的不断发展，人们以不同形式发布大量信息，包括文本，照片，音频和视频。因此，传统的SA算法已变得有限，因为它们不考虑其他方式的表现力。通过包括来自各种物质来源的此类特征，这些多模式数据流提供了新的机会，以优化基于文本的SA之外的预期结果。我们的研究重点是多模式SA的最前沿领域，该领域研究了社交媒体网络上发布的视觉和文本数据。许多人更有可能利用这些信息在这些平台上表达自己。为了作为这个快速增长的领域的学者资源，我们介绍了文本和视觉SA的全面概述，包括数据预处理，功能提取技术，情感基准数据集以及适合每个字段的多重分类方法的疗效。我们还简要介绍了最常用的数据融合策略，并提供了有关Visual Textual SA的现有研究的摘要。最后，我们重点介绍了最重大的挑战，并调查了一些重要的情感应用程序。

translated by 谷歌翻译

BanglaSarc: A Dataset for Sarcasm Detection

Tasnim Sakib Apon , Ramisa Anan , Elizabeth Antora Modhu , Arjun Suter , Ifrit Jamal Sneha , MD. Golam Rabiul Alam

分类：自然语言处理 | 人工智能

2022-09-27

作为世界上口语最广泛的语言之一，孟加拉国的使用在社交媒体世界中也在增加。讽刺是一种积极的陈述或言论，其基本的负面动机在当今的社交媒体平台中广泛使用。在过去的许多年中，英语的讽刺检测有了显着改善，但是有关孟加拉讽刺检测的情况仍然没有改变。结果，仍然很难识别孟加拉国中的讽刺，缺乏高质量的数据是主要因素。本文提出了Banglasarc，该数据集是专门为孟加拉文本数据讽刺检测的数据集。该数据集包含5112条评论/状态和从各种在线社交平台（例如Facebook，YouTube）以及一些在线博客中收集的内容。由于孟加拉语中分类评论的数据收集数量有限，因此该数据集将有助于确定讽刺的研究，认识到人们的情绪，检测到各种类型的孟加拉语表达式和其他领域。该数据集可在https://www.kaggle.com/datasets/sakibapon/banglasarc上公开获得。

translated by 谷歌翻译

Comparative Study of Sentiment Analysis for Multi-Sourced Social Media Platforms

Keshav Kapur , Rajitha Harikrishnan

分类：自然语言处理 | 人工智能

2022-12-09

There is a vast amount of data generated every second due to the rapidly growing technology in the current world. This area of research attempts to determine the feelings or opinions of people on social media posts. The dataset we used was a multi-source dataset from the comment section of various social networking sites like Twitter, Reddit, etc. Natural Language Processing Techniques were employed to perform sentiment analysis on the obtained dataset. In this paper, we provide a comparative analysis using techniques of lexicon-based, machine learning and deep learning approaches. The Machine Learning algorithm used in this work is Naive Bayes, the Lexicon-based approach used in this work is TextBlob, and the deep-learning algorithm used in this work is LSTM.

translated by 谷歌翻译

5-Star Hotel Customer Satisfaction Analysis Using Hybrid Methodology

Yongmin Yoo , Yeongjoon Park , Dongjin Lim , Deaho Seo

分类：人工智能 | 自然语言处理

2022-09-26

由于由于电晕病毒而迅速开发了非面对面服务，因此通过互联网（例如销售和保留）的商业正在迅速增长。消费者还会在网站上发布有关商品或服务的评论，建议或判断。消费者直接使用的审查数据为消费者提供了积极的反馈和良好的影响，例如创造业务价值。因此，从营销的角度来看，分析审核数据非常重要。我们的研究提出了一种通过审核数据来找到客户满意度因素的新方法。我们采用了一种方法来通过混合和使用数据挖掘技术来找到客户满意度的因素，这是一种大数据分析方法，而自然语言处理技术（我们的研究中）是一种语言处理方法。与过去对客户满意度进行的许多研究不同，我们的研究通过使用各种技术来对论文的新颖性。由于分析，我们的实验结果非常准确。

translated by 谷歌翻译

Can Machine Learning Tools Support the Identification of Sustainable Design Leads From Product Reviews? Opportunities and Challenges

Michael Saidani , Harrison Kim , Bernard Yannou

分类：机器学习 | 人工智能

2021-12-17

在线发布的产品评论数量越来越多的是设计师的金矿，通过捕捉客户的声音，并相应地改善这些产品，了解他们开发的产品。与此同时，产品设计和开发在创造更可持续的未来方面具有重要作用。随着自然语言处理领域的人工智能技术最近，该研究旨在开发一体化机器学习解决方案，以便自动从线产品评论获得可持续设计的洞察。在本文中，讨论了，说明了现有框架 - 包括Python库，软件包以及伯爵等最先进的算法的机会和挑战。这一贡献讨论了达成的机会和建立机器学习管道的挑战，以便从产品审查中获取有限性，以设计更可持续的产品，包括五个阶段，包括与解释的可持续性相关的审查可持续设计引导：数据收集，数据格式，模型培训，模型评估和模型部署。给出了可持续设计见解的例子，可提供退出产品审查采矿和加工。最后，提供了用于该领域未来研究的有希望的线，包括与可持续替代品的平行标准产品的案例研究，以比较客户价值的特征，并在优质的可持续设计引线中产生。

translated by 谷歌翻译

Attention-based Bidirectional LSTM for Deceptive Opinion Spam Classification

Ashish Salunkhe

分类：自然语言处理

2021-12-29

在线评论在电子商务中发挥重要作用进行决策。大部分人口做出了哪些地方，餐厅访问，以根据各自的平台发布的评论来购买的地方，从哪里购买。欺诈性审查或意见垃圾邮件被分类为一个不诚实或欺骗性的审查。产品或餐厅的肯定审查有助于吸引客户，从而导致销售额增加，而负面评论可能会妨碍餐厅或产品销售的进展，从而导致令人害羞的声誉和损失。欺诈性评论是故意发布的各种在线审查平台，以欺骗客户购买，访问或分散产品或餐厅的注意力。它们也被编写或诋毁产品的辩护。该工作旨在检测和分类审查作为欺骗性或真实性。它涉及使用各种深入学习技术来分类审查和概述涉及基于人的双向LSTM的提出的方法，以解决与基线机器学习技术的评论和比较研究中的语义信息有关的问题，以进行审查分类。

translated by 谷歌翻译

Natural Language Processing in Customer Service: A Systematic Review

Malak Mashaabi , Areej Alotaibi , Hala Qudaih , Raghad Alnashwan , Hend Al-Khalifa

分类：自然语言处理 | 人工智能

2022-12-16

Artificial intelligence and natural language processing (NLP) are increasingly being used in customer service to interact with users and answer their questions. The goal of this systematic review is to examine existing research on the use of NLP technology in customer service, including the research domain, applications, datasets used, and evaluation methods. The review also looks at the future direction of the field and any significant limitations. The review covers the time period from 2015 to 2022 and includes papers from five major scientific databases. Chatbots and question-answering systems were found to be used in 10 main fields, with the most common use in general, social networking, and e-commerce areas. Twitter was the second most commonly used dataset, with most research also using their own original datasets. Accuracy, precision, recall, and F1 were the most common evaluation methods. Future work aims to improve the performance and understanding of user behavior and emotions, and address limitations such as the volume, diversity, and quality of datasets. This review includes research on different spoken languages and models and techniques.

translated by 谷歌翻译

Happy or grumpy? A Machine Learning Approach to Analyze the Sentiment of Airline Passengers' Tweets

Shengyang Wu , Yi Gao

分类：自然语言处理

2022-09-28

作为最广泛的社交网络服务之一，Twitter截至2022年，Twitter拥有超过3亿个活跃用户。在其许多功能中，Twitter现在是消费者分享他们对产品或体验的看法的首选平台之一商业航空公司提供的服务。这项研究旨在通过分析使用机器学习方法提及航空公司的推文的观点来衡量客户满意度。从Twitter的API检索相关推文，并通过令牌化和矢量化处理。之后，这些处理后的向量被传递到预训练的机器学习分类器中以预测情感。除了情感分析外，我们还对收集的推文执行词汇分析，以模拟关键字的频率，这些频率提供了有意义的上下文以促进情感的解释。然后，我们应用时间序列方法，例如鲍林（Bollinger）频段来检测情绪数据中的异常。使用从2022年1月到7月的历史记录，我们的方法被证明能够捕捉乘客情绪的突然变化。这项研究有可能发展为可以帮助航空公司以及其他几家面向客户的企业的应用程序，有效地检测到客户情绪的突然变化，并采取足够的措施来抵消他们。

translated by 谷歌翻译

Classifying YouTube Comments Based on Sentiment and Type of Sentence

Rhitabrat Pokharel , Dixit Bhatta

分类：机器学习

2021-10-31

随着YouTube频道的增长，每个视频都可以收集从观众提供直接反馈的巨大评论。这些评论是了解观众期望和改善渠道参与的主要手段。但是，评论仅代表了关于频道和内容的用户意见的一般集合。许多评论构造不当，微不足道，拼写不当和语法错误。因此，确定最有利息的评论是一个繁琐的工作。在本文中，我们根据情绪和句子类型提取并将原始评论分类为不同类别，这将帮助您能够帮助您查找相关评论以越来越多的信息。现有的研究在文本语料库上的相同句子类型（例如，问题类型）中的情感分析（正面和负数）或分类的分类集中在一起。这些对非传统文本语料库有限，如Youtube评论。我们使用众所周知的统计测量和机器学习模型来解决YouTube评论的文本提取和分类的这一挑战。我们使用交叉验证和$ F_1 $分数评估统计测量和机器学习模型的每个组合。结果表明，我们在分类任务中融入了传统方法的方法，验证其在辅助内容创作者中的潜力会增加他们的渠道的观看者参与。

translated by 谷歌翻译

Turkish Sentiment Analysis Using Machine Learning Methods: Application on Online Food Order Site Reviews

Özlem Aktaş , Berkay Coşkuner , İlker Soner

分类：自然语言处理 | 机器学习

2022-01-11

满意度测量，在今天的每个部门都出现，是许多公司的一个非常重要的因素。在本研究中，旨在通过使用yemek Sepeti的数据和该数据的变化来达到各种机器学习算法的最高精度率。每种算法的精度值都与所使用的各种自然语言处理方法一起计算。在计算这些精度值时，尝试优化使用的算法的参数。在本研究中培训的模型可以在未标记的数据上使用，并且可以在衡量客户满意度时给公司一个想法。观察到施加的3种不同的自然语言处理方法导致大部分开发模型中的大约5％的精度增加。

translated by 谷歌翻译

The impact of Twitter on political influence on the choice of a running mate: Social Network Analysis and Semantic Analysis -- A Review

Immaculate Wanza , Irad Kamuti , David Gichohi , Kinyua Gikunda

分类：自然语言处理

2022-07-31

在这个新的社交媒体时代，社交网络正在成为互联网上用户生成内容的越来越重要的来源。这些信息资源包括许多人的感受，意见，反馈和评论，对大型企业，市场，政治，新闻和许多其他领域都非常有用。政治是目前社交媒体网络上最受关注和流行的主题之一。许多政客使用Twitter等微博客服务，因为他们在这些网络上有大量的追随者和支持者。政客，政党，政治组织和基金会使用社交媒体网络提前与公民进行交流。如今，社交媒体已被成千上万的政治团体和政客使用。在这些社交媒体网络上，每个政治家和政党都有数百万追随者，而政客们发现了新的创新方式来敦促个人参与政治。此外，社交媒体通过提供建议，例如根据以前的经验制定政策和策略，推荐并选择适合特定选区的候选人，建议在党中的特定职位，并选择合适的候选人，并选择合适的候选人，并选择合适的候选人，并选择合适的候选人，以及基于对各种问题和争议的公民情绪发起政治运动。这项研究是关于在Twitter平台上使用社交网络分析（SNA）和语义分析（SA）的综述，以研究政治领导者的支持者网络，因为它可以在预测其政治未来时有助于决策。

translated by 谷歌翻译

A Comparative Study of Classical and Quantum Machine Learning Models for Sentimental Analysis

Diksha Sharma , Parvinder Singh , Atul Kumar

分类：机器学习

2022-09-12

我们分析和分类从电影评论构建的文本数据的观点。为此，我们使用量子机学习算法的基于内核的方法。为了组合量子内核，我们使用使用不同Pauli旋转门组合构造的电路，其中旋转参数是从文本数据获得的数据点的经典非线性函数。为了分析提出的模型的性能，我们使用决策树，增强分类器以及经典和量子支持向量机分析量子模型。我们的结果表明，就所有评估指标而言，量子内核模型或量子支持向量机优于用于分析的所有其他算法。与经典的支持向量机相比，量子支持向量机也会带来明显更好的结果，即使功能数量增加或尺寸增加。结果清楚地表明，如果功能的数量为$ 15 $，则使用量子支持向量机使用量子支持向量机的精度分数提高了$ 9.4 \％$，而经典支持向量机则将其提高。

translated by 谷歌翻译

A CNN-LSTM-based hybrid deep learning approach to detect sentiment polarities on Monkeypox tweets

Krishna Kumar Mohbey , Gaurav Meena , Sunil Kumar , K Lokesh

分类：计算机视觉 | 人工智能 | 机器学习

2022-08-25

人们最近开始通过社交网站上用户生成的多媒体材料来传达自己的思想和观点。此信息可以是图像，文本，视频或音频。近年来，这种模式的发生频率有所增加。 Twitter是最广泛使用的社交媒体网站之一，它也是最好的地点之一，可以使人们对与蒙基波疾病有关的事件有一种了解。这是因为Twitter上的推文被缩短并经常更新，这两者都促成了平台的角色。这项研究的基本目标是对人们对这种情况的存在的各种反应进行更深入的理解。这项研究重点是找出个人对猴蛋白酶疾病的看法，该疾病介绍了基于CNN和LSTM的混合技术。我们已经考虑了用户推文的所有三个可能的极性：正，负和中立。使用CNN和LSTM构建的架构来确定预测模型的准确性。推荐模型的准确性在Monkeypox Tweet数据集上为94％。其他性能指标（例如准确性，召回和F1得分）也用于测试我们的模型和最大程度和资源有效的方式。然后将发现与更传统的机器学习方法进行比较。这项研究的发现有助于提高对普通人群中蒙基托感染的认识。

translated by 谷歌翻译

Using Online Customer Reviews to Classify, Predict, and Learn about Domestic Robot Failures

Shanee Honig , Alon Bartal , Yisrael Parmet , Tal Oron-Gilad

分类：机器人

2022-01-10

关于哪些类型的故障机器人在家庭环境中以及这些失败如何影响客户体验时，存在知识差距。我们在亚马逊上分类了10,072个客户评论，通过它们中描述的机器人失败，将故障分组为十二种类型和三类（技术，互动和服务）。我们确定了先前忽略了文献中忽视的失败的来源和类型，将它们结合到更新的失败分类。我们分析了他们的频率和关系与客户明星评级。结果表明，对于功利主义国内机器人来说，技术故障比互动或服务失败更有害。常常报告任务完成和鲁棒性和恢复力的问题，并具有最大的负面影响。未来的预防和反应战略应解决机器人的技术能力，以满足功能目标，运作和保持结构完整性随着时间的推移。可用性和互动设计对客户体验不利，表明客户可能更宽容影响机器人和实际用途的影响的失败。此外，我们开发了一种能够预测客户审查是否包含描述故障的内容以及它描述的故障类型的自然语言处理模型。借鉴了这些知识，机器人系统的设计者和研究人员可以优先考虑设计和开发努力实现基本问题。

translated by 谷歌翻译