Trustpilot Customer Review Intelligent Analysis Workflow
This workflow aims to automate the scraping of customer reviews for specified companies on Trustpilot, utilizing a vector database for efficient management and analysis. It employs the K-means clustering algorithm to identify review themes and applies a large language model for in-depth summarization. The final analysis results are exported to Google Sheets for easy sharing and decision-making within the team. This process significantly enhances the efficiency of customer review data processing, helping businesses quickly identify key themes and sentiment trends that matter to customers, thereby optimizing customer experience and product strategies.
Tags
Workflow Name
Trustpilot Customer Review Intelligent Analysis Workflow
Key Features and Highlights
This workflow automatically scrapes customer reviews of specified companies from Trustpilot, utilizes the Qdrant vector database to store and manage review data, applies the K-means clustering algorithm to automatically identify thematic clusters of reviews, and then leverages OpenAI’s large language model (LLM) to perform in-depth analysis and summarization of each cluster. Finally, the insights are exported to Google Sheets for easy team sharing and decision support.
Highlights include:
- Automated multi-page review data scraping and structured extraction
- Efficient similarity search and clustering analysis using a vector database
- Intelligent insight generation combining machine learning clustering with large language models
- Flexible time-range filtering supporting periodic or on-demand analysis
- Direct output to Google Sheets for convenient subsequent data processing and presentation
Core Problems Addressed
- Manual collection and analysis of large volumes of customer reviews is time-consuming and inefficient
- Review data is unstructured, making it difficult to summarize genuine customer feedback and pain points
- Inability to quickly identify trending topics and sentiment trends among customers
- Lack of automated tools to transform massive review data into actionable insights
Application Scenarios
- Brand and product management teams regularly monitoring customer satisfaction and feedback trends
- Market research departments analyzing competitors or industry reputation
- Customer service teams quickly identifying key complaints and improvement opportunities
- Data analysts building business reports driven by customer review data
- Any enterprise or individual seeking to extract structured insights from Trustpilot reviews
Main Process Steps
- Initialization: Configure target company ID and clear historical review data in Qdrant to ensure data freshness.
- Review Data Scraping: Automatically scrape the latest three pages of review HTML pages from Trustpilot via HTTP requests.
- Review Content Extraction: Parse HTML nodes to extract fields such as reviewer, rating, title, body, date, and country.
- Data Structuring: Organize extracted review information into array format for subsequent processing.
- Text Vectorization: Use OpenAI Embeddings API to convert review texts into vector representations.
- Vector Data Storage: Insert vectors and their metadata into the Qdrant vector database to support efficient similarity search.
- Trigger Analysis Sub-Workflow: Filter review vectors by time range and execute clustering and analysis processes.
- Clustering Analysis: Run K-means algorithm via Python code node to automatically group review vectors into multiple thematic clusters.
- Retrieve Cluster Review Details: Filter valid clusters (containing 3 or more reviews) and fetch corresponding review content from Qdrant.
- Intelligent Insight Generation: Use LLM to summarize each cluster’s reviews, generating insights, sentiment orientation, and improvement suggestions.
- Export Results: Write insights and original review counts into a Google Sheets document for easy viewing and sharing.
Involved Systems or Services
- Trustpilot: Source of customer review data.
- Qdrant: High-performance vector database for storing and querying review vectors.
- OpenAI API: Generates text embeddings and intelligent text summaries (GPT-4 model).
- Google Sheets: Stores final analysis results, supporting team collaboration and data sharing.
- n8n: Low-code automation platform used to build the entire data scraping, processing, and analysis workflow.
Target Users and Value
- Brand managers and marketers, aiding in understanding customer reputation and product feedback.
- Data analysts and business intelligence teams, enhancing the efficiency and depth of customer review analysis.
- Customer service leaders, quickly pinpointing major complaint areas and improvement directions.
- Any enterprise or individual user seeking to automate extraction of valuable insights from Trustpilot reviews.
This workflow significantly lowers the barrier to processing customer review data by transforming massive unstructured reviews into structured, thematic, and insight-rich information, empowering businesses to improve customer experience and product competitiveness.
Automated Workflow for Sentiment Analysis and Storage of Twitter and Form Content
This workflow automates the scraping and sentiment analysis of Twitter and external form content. It regularly monitors the latest tweets related to "strapi" or "n8n.io" and filters out unnecessary information. Using natural language processing technology, it intelligently assesses the sentiment of the text and automatically stores positively rated content in the Strapi content management system, enhancing data integration efficiency. It is suitable for brand reputation monitoring, market research, and customer relationship management, providing data support and high-quality content for decision-making.
Intelligent E-commerce Product Information Collection and Structured Processing Workflow
This workflow automates the collection and structured processing of e-commerce product information. By scraping the HTML content of specified web pages, it intelligently extracts key information such as product names, descriptions, ratings, number of reviews, and prices using an AI model. The data is then cleaned and structured, with the final results stored in Google Sheets. This process significantly enhances the efficiency and accuracy of data collection, making it suitable for market research, e-commerce operations, and data analysis scenarios.
My workflow 2
This workflow automatically fetches popular keywords and related information from Google Trends in the Italian region, filters out new trending keywords, and uses the jina.ai API to obtain relevant webpage content to generate summaries. Finally, the data is stored in Google Sheets as an editorial planning database. Through this process, users can efficiently monitor market dynamics, avoid missing important information, and enhance the accuracy and efficiency of keyword monitoring, making it suitable for content marketing, SEO optimization, and market analysis scenarios.
GitHub Stars Pagination Retrieval and Web Data Extraction Example Workflow
This workflow demonstrates how to automate the retrieval and processing of API data, specifically by making paginated requests to fetch the favorite projects of GitHub users. It supports automatic incrementing of page numbers, determining the end condition for data, and achieving complete data retrieval. Additionally, this process illustrates how to extract article titles from random Wikipedia pages, combining HTTP requests with HTML content extraction. It is suitable for scenarios that require batch scraping and processing of data from multiple sources, helping users efficiently build automated workflows.
Dashboard
The Dashboard workflow automatically fetches and integrates key metrics from multiple platforms such as Docker Hub, npm, GitHub, and Product Hunt, updating and displaying them in a customized dashboard in real-time. It addresses the issues of data fragmentation and delayed updates that developers face when managing open-source projects, enhancing the efficiency and accuracy of data retrieval. This workflow is suitable for open-source project maintainers, product managers, and others, helping them to comprehensively monitor project health, optimize decision-making, and manage community operations.
HubSpot Contact Data Pagination Retrieval and Integration
This workflow automates the pagination retrieval and integration of contact data through the HubSpot CRM API, simplifying the complexity of manually managing pagination logic. Users only need to manually trigger the process, and the system will loop through requests for all paginated data and consolidate it into a complete list. This process prevents data omissions and enhances the efficiency and accuracy of data retrieval, making it suitable for various scenarios such as marketing, customer management, and data analysis, helping businesses manage customer resources more effectively.
Bulk Upload Contacts Through CSV | Airtable Integration with Grid View Synchronization
This workflow automates the process of batch uploading contact data from a CSV file to Airtable. It supports real-time monitoring of newly uploaded files, automatically downloading and parsing the content. It can intelligently determine marketing campaign fields, batch create or update contact records, and update the upload status in real-time, ensuring efficient and accurate data management. This solution addresses the cumbersome and error-prone issues of manual imports, making it particularly suitable for marketing and sales teams.
Mock Data Transformation Workflow
This workflow focuses on generating and transforming simulated data, providing efficient data preprocessing capabilities. It splits the initial array-form simulated data into independent data items, facilitating subsequent processing and operations. It is suitable for testing and debugging during the process development phase, as well as for scenarios that require batch data processing. It can quickly address issues related to mismatched simulated data formats and item-by-item processing, enhancing the efficiency and flexibility of workflow design.