From Reddit to Insights: Building an AI-Powered Data Pipeline with Gemini (Cloud)
Introduction
Purpose
In this blog post, I document the process of building an AI-driven, cloud data pipeline to automate this task. Using Google’s Gemini AI, the pipeline collects, processes, and synthesizes discussions from AI-related subreddits into structured daily reports. The system is designed to filter out irrelevant or harmful content, ensuring the extracted insights are both meaningful and actionable.
Check out the project GitHub repository for the full code and detailed documentation and Web Application.
From Reddit to Insights: Building an AI-Powered Data Pipeline with Gemini (On-Prem)
Introduction
Purpose
In this blog post, I document the process of building an AI-driven, on-premises data pipeline to automate this task. Using Google’s Gemini AI, the pipeline collects, processes, and synthesizes discussions from AI-related subreddits into structured daily reports. The system is designed to filter out irrelevant or harmful content, ensuring the extracted insights are both meaningful and actionable.
Check out the project GitHub repository for the full code and detailed documentation and Web Application.
RecSys Challenge 2024: Model Selection
Introduction
Purpose
This article will cover model selection in the RecSys Challenge 2024. The content will be structured into the following sections:
- News RecSys
- Model Selection
For more in-depth analysis, please check out the notebook!
Recommendation Systems: Overview
Introduction
Purpose
This article will cover an overview of recommendation systems. The content will be structured to answer the following questions:
- What is RecSys?
- What is the importance of Recommender Systems?
- What are the main challenges faced in Recommender Systems?
- What are the common evaluation metrics utilized in Recommender Systems?
- What are the various methods employed in Recommendation Systems?
- How has neural network architecture been utilized in Recommender Systems?
RecSys Challenge 2024: Exploratory Data Analysis
Introduction
Purpose
This article will cover the exploratory data analysis of the RecSys 2024 Challenge dataset. The content will be structured into the following sections:
- Data Preprocessing
- Functions
- Plot Functions
- Feature Functions
- Feature Analysis
- Overall Feature Analysis
- Article
- User
- Session
- Topic
- Devices
- Age
For more in-depth analysis, please check out the notebook!