๐ŸŽต DAA Experimental Study

DiverseTune
Recommendation Algorithm Diversity Analysis

Comparing Greedy, Content Filtering-Based, and Graph DPP Rerank recommendation algorithms to evaluate which promotes the highest diversity and fairness in music recommendations.

0 Tracks Analyzed
0 Algorithms
0 Experiment Runs
0 Music Genres
Explore Results

Three Algorithms, One Question

Which algorithm best balances relevance, diversity, and fairness?

Greedy

Popularity-Based

Ranks all songs by popularity score and selects the top K=10. Simple but heavily biased toward mainstream hits.

Low Diversity High Bias Fast
Time: O(n log n) Space: O(n)

Content Filtering

Content Filtering

Computes cosine content_filtering between liked songs and candidates. Returns top K=10 most similar tracks. Better relevance, but echo chamber risk.

Moderate Diversity Relevant O(nยฒ)
Time: O(nยทmยทd) Space: O(n)

Key Metrics at a Glance

Averaged results over 8 independent experiment runs with random seed songs

Greedy

Popularity-Based
ILD (Diversity) 0.0347
Popularity Concentration Index 0.0070
Avg Popularity 95.90
Niche Songs 0.00%

Content Filtering

Content Filtering
ILD (Diversity) 0.0064
Popularity Concentration Index 0.1632
Avg Popularity 48.69
Niche Songs 26.25%
Experiment Output
Algorithm: Greedy (Popularity)
  ILD (Diversity):     0.0347
  Popularity Concentration Index:          0.0070
  Avg Popularity:      95.90
  Niche Songs (%):     0.00%

Algorithm: Content Filtering (CF)
  ILD (Diversity):     0.0064
  Popularity Concentration Index:          0.1632
  Avg Popularity:      48.69
  Niche Songs (%):     26.25%

Algorithm: Graph DPP Rerank (DPP)
  ILD (Diversity):     0.0187
  Popularity Concentration Index:          0.2156
  Avg Popularity:      46.64
  Niche Songs (%):     33.75%

โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€
๐Ÿ† Highest Diversity (ILD):  Greedy (Popularity)  (0.0347)
๐Ÿ† Best Niche Coverage:      Graph DPP Rerank (DPP)         (33.75%)
๐Ÿ† Lowest Popularity Bias:   Graph DPP Rerank (DPP)         (46.64)
โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€

Experiment Visualizations

Interactive charts generated from 8 experiment runs across all algorithms

๐Ÿ“Š Diversity Comparison (ILD)

Higher ILD = more diverse recommendations

โš–๏ธ Fairness โ€” Popularity Concentration Index

Lower Popularity Concentration = more equitable popularity distribution

๐Ÿ”ฅ Average Popularity Score

Measures mainstream bias in recommendations

๐Ÿ“ˆ Diversity vs Popularity Trade-off

Dual-axis view showing how diversity relates to popularity bias

๐ŸŒฑ Niche Song Representation

Percentage of recommended songs with popularity < 40

Algorithm Comparison Table

Complete metric breakdown across all evaluation dimensions

Metric Greedy
(Popularity)
Content Filtering
(Content Filtering)
Graph DPP Rerank
(Content Filtering + DPP)
Best
ILD (Diversity) โ†‘ 0.0347 0.0064 0.0187 Greedy
Popularity Concentration Index โ†“ 0.0070 0.1632 0.2156 Greedy
Avg Popularity 95.90 48.69 46.64 Graph DPP Rerank
Niche Songs % โ†‘ 0.00% 26.25% 33.75% Graph DPP Rerank
Popularity Bias Very High Medium Low Graph DPP Rerank
Genre Coverage 2 genres 3-4 genres 5+ genres Graph DPP Rerank
Fairness Constraint โŒ None โŒ None โœ… โ‰ฅ20% niche Graph DPP Rerank

Conclusions & Key Takeaways

๐Ÿ”ฌ

Key Finding

The Greedy algorithm achieves the highest raw ILD diversity (0.0347) because it selects from a global pool of popular songs spanning different genres. However, it recommends zero niche songs and has extreme popularity bias (avg 95.9).

๐Ÿ’ก

Critical Insight

Content Filtering-based filtering creates an echo chamber effect with the lowest ILD (0.0064), confirming that pure content filtering traps users in narrow taste profiles. However, it naturally discovers some niche content (26.25%).

๐Ÿ†

Best Overall: Graph DPP Rerank

The Graph DPP Rerank algorithm achieves the best balance: competitive diversity (ILD 0.0187), lowest popularity bias (46.64), and highest niche representation (33.75%). The DPP re-ranking combined with the fairness constraint creates the most equitable recommendations.

๐Ÿ“Œ

Practical Implication

Traditional recommendation systems (Greedy, CF) inherently reduce diversity. Graph DPP Rerank-based systems significantly improve diversity and fairness in music recommendations, promoting independent artists and reducing filter bubbles.

๐Ÿ“ Metrics Formulas

Intra-List Diversity (ILD)

ILD = 1|L|(|L|โˆ’1) โˆ‘iโ‰ j (1 โˆ’ sim(i,j))

Where sim(i,j) = cosine content_filtering between feature vectors

Popularity Concentration Index

G = โˆ‘i (2i โˆ’ n โˆ’ 1) ยท xin ยท โˆ‘ xi

Applied to popularity distribution of recommended songs

DPP Kernel

Lij = qi ยท qj ยท Sij

Quality-weighted content_filtering kernel for determinantal point process