Scrape Trustpilot Reviews with DeepSeek, Analyze Sentiment with OpenAI
This workflow can automatically crawl user reviews of specified companies from the Trustpilot website, extract key information from the reviews, and perform sentiment analysis. Using the DeepSeek model, it accurately retrieves multidimensional information such as the reviewer's name, rating, date, and more. It then utilizes OpenAI to classify the sentiment of the reviews, achieving automatic collection and intelligent analysis of review data. Finally, the data is synchronized and updated to Google Sheets, providing strong support for brand management, market research, and customer service.
Tags
Workflow Name
Scrape Trustpilot Reviews with DeepSeek, Analyze Sentiment with OpenAI
Key Features and Highlights
This workflow automatically scrapes user reviews of specified companies from the Trustpilot website. It leverages the DeepSeek model to accurately extract key information from each review—such as author, rating, date, title, content, user country, and the total number of reviews posted by the user. Subsequently, it employs OpenAI to perform sentiment analysis, categorizing the review sentiment as positive, neutral, or negative. Finally, the structured review data and sentiment classification results are synchronized and updated in Google Sheets, enabling automated collection and intelligent analysis of review data.
Core Problems Addressed
- Automates the collection of large volumes of Trustpilot user reviews, eliminating the tedious manual process of searching and copy-pasting reviews one by one.
- Precisely extracts multidimensional information from reviews to ensure data completeness and accuracy.
- Utilizes AI-driven sentiment analysis to quickly gain insights into customer feedback sentiment trends, helping businesses understand customer satisfaction and identify potential issues.
- Enables real-time data updates and storage for convenient subsequent statistical analysis and business decision-making.
Application Scenarios
- Brand Reputation Management: Automatically monitor and analyze customer reviews to promptly detect negative feedback.
- Market Research: Gather user review data for competitors or own products to support product optimization.
- Customer Service: Prioritize handling of negative reviews based on sentiment classification to improve customer satisfaction.
- Data Reporting: Provide management with structured, dynamically updated customer feedback reports.
Main Workflow Steps
- Manually trigger the workflow start.
- Set the target company name and the maximum number of review pages to scrape.
- Use HTTP requests to paginate and fetch review pages for the specified company on Trustpilot.
- Extract the list of review links using HTML nodes.
- Split and limit the number of review links as needed.
- Fetch detailed HTML content for each individual review.
- Invoke the DeepSeek model to extract information such as review author, rating, date, title, body, user country, and total review count.
- Use the OpenAI model to perform sentiment analysis on the review body, classifying it as positive, neutral, or negative.
- Query Google Sheets to check if the review already exists to avoid duplicate storage.
- Append or update new review data and sentiment results into Google Sheets.
Involved Systems and Services
- Trustpilot (review data source)
- DeepSeek (information extraction AI model)
- OpenAI (sentiment analysis AI model)
- Google Sheets (data storage and management)
- n8n Automation Platform (workflow orchestration)
Target Users and Value
- Marketing professionals and brand managers: Easily grasp authentic customer feedback to optimize brand image.
- Data analysts and product managers: Obtain structured user review data to support product decision-making.
- Customer service teams: Quickly identify negative reviews to enhance customer experience.
- Automation enthusiasts and developers: Implement complex scraping plus AI analysis automated workflows to save labor costs.
This workflow combines web scraping with advanced AI natural language processing technologies to efficiently and intelligently manage and analyze Trustpilot user reviews, making it a powerful tool for data-driven business operations.
Extract & Summarize Bing Copilot Search Results with Gemini AI and Bright Data
This workflow automatically scrapes Bing Copilot's search results through the Bright Data API and utilizes the Google Gemini AI model for structured data extraction and content summarization. It addresses the issue of disorganized traditional search result data, enhancing information utilization efficiency. Users can quickly obtain search information related to keywords, aiding in market research, competitive intelligence analysis, and content creation. Ultimately, the processed results are pushed via Webhook, facilitating subsequent integration and automation.
Brand Content Extract, Summarize & Sentiment Analysis with Bright Data
This workflow utilizes advanced web scraping and artificial intelligence technologies to automatically capture, extract text, generate summaries, and perform sentiment analysis on the content of specified brand webpages. By overcoming web scraping restrictions, it enables real-time access to high-quality content, systematically analyzes consumer attitudes towards the brand, and provides clear text summaries and sentiment classifications. It is suitable for brand monitoring, market research, and user feedback processing, helping relevant personnel quickly gain deep insights and optimize decisions and strategies.
Remove PII from CSV Files (Automated Personal Information Masking for CSV Files)
This workflow automatically monitors a Google Drive folder for new CSV files, and once a new file is detected, it initiates the process. It utilizes OpenAI to intelligently identify personally identifiable information (PII) columns and automatically removes this sensitive data, generating a de-identified file and re-uploading it to the designated folder. The entire process is efficient, intelligent, and requires no manual intervention, effectively reducing the risk of data breaches, making it suitable for businesses and teams that need to process privacy data in bulk.
Google Page Entity Extraction Template
This workflow utilizes the Google Natural Language API to automatically extract named entities such as people, organizations, and locations from any webpage, enabling structured analysis of information. Users submit the webpage URL via a webhook, and the system automatically fetches the content and performs entity recognition, returning detailed entity information along with its importance score. This tool is particularly suitable for scenarios such as media monitoring, market research, and data integration, significantly enhancing the efficiency and accuracy of information processing and helping users quickly obtain key data.
Extract Text from PDF and Images Using Vertex AI (Gemini) into CSV
This workflow can automatically extract text from newly uploaded PDF files and images in a specified Google Drive folder, using Google Vertex AI and Openrouter AI for intelligent recognition and analysis. The extracted transaction data will be converted into a CSV file with classification information and automatically uploaded back to Google Drive, thereby streamlining the manual data entry and classification process, improving the efficiency and accuracy of data processing, and making it suitable for various scenarios such as financial management and data analysis.
Calculate the Centroid of a Set of Vectors
This workflow can automatically receive and process multiple vectors, ensuring the consistency of input data dimensions. It calculates the centroid of these vectors, which is the average value across all dimensions, and returns the results in a user-friendly format. It effectively addresses common issues in multidimensional data processing and is applicable in fields such as data analysis, machine learning, and geographic information systems, enhancing the automation and accuracy of data processing.
AI Agent Conversational Assistant for Supabase/PostgreSQL Database
This workflow builds an intelligent dialogue assistant that combines natural language processing with database management, allowing users to query and analyze data using natural language without needing to master SQL skills. It can dynamically generate SQL queries, retrieve database table structures, process JSON data, and provide clear and understandable feedback on query results. This tool significantly lowers the barrier to database operations and is suitable for scenarios such as internal data analysis, customer service, product support, and education and training, enhancing the convenience and efficiency of data querying.
Spot Workplace Discrimination Patterns with AI
This workflow automates the scraping and analysis of employee review data from Glassdoor, utilizing AI technology to deeply analyze company ratings and the differences in workplace experiences among various demographic groups. It calculates statistical indicators and generates visual charts. It helps HR and management quantify workplace discrimination, supports fair improvement measures, promotes organizational culture enhancement and inclusivity assessments, and enables the effective implementation of data-driven diversity, equity, and inclusion initiatives.