HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Clustering Urdu News Using Headlines

{Kamran Malik Faisal Bukhari Waheed Iqbal Samia Khaliq}

Abstract

This paper that proposes and evaluates a new algorithm to automatically cluster Urdu news from different news agencies. The task is challenging because there are no language processing libraries for the Urdu language. The authors' experimental dataset consists of news from famous Pakistani media houses, including Jang, BBC Urdu, Express, UrduPoint, and Voice of America Urdu (VOA). The proposed algorithm only uses headlines to cluster the news. The authors argue that news headlines provide a concise summary of the news, which motivates them to use it instead of using the entire news story. Their experimental evaluation shows micro and macro averages for precision of 0.45 and 0.48 respectively for identifying similar news using headlines.

Benchmarks

BenchmarkMethodologyMetrics
text-clustering-on-urdu-news-headlinesVector Space Model
Related Headlines: 85

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Clustering Urdu News Using Headlines | Papers | HyperAI