Julio Sotelo
Julio Sotelo
  • About me
    • Bio
    • LinkedIn
    • Triathlete
  • NFL predictions
  • NBA predictions
  • Posts
MENU CLOSE back  

PCA, Singular Value Decomposition & Association Rules implementation in Python

I am experiment with Principal Component Analysis as a dimensionality reduction approach to assist in clustering high-dimensional data. For this I have use an image segmentation data set for clustering. After clustering the image data, I measure completeness and homogeneity of the generated clusters. The data set used in this problem is based on the Image Segmentation data set at the UCI Machine Learning Repository..

In addition I would perform association rule mining using the implementation provided in the textbook Machine Learning in Action, Chapter 11. For this implementation I use a modified version of the Apriori implementation in Machine Learning in Action. It has been modified to compute lift values for rules in addition to confidence

The final experimentation is with item-based recommendation for a joke recommender system using a modified version of the item-based recommender algorithm from Ch. 14 of the textbook mention above. I will use it on joke ratings data based on Jester Online Joke Recommender System .

See implementation

By JulioAssociation Rule Classification Machine Learning PCA Python SVD
Shelter Animal OutcomesSocial Network Analysis

Related

Amazon Reviews
Amazon reviews
October 15, 2023
Amazon-hadoop-fig9
Hadoop for Amazon product co-purchasing network
March 8, 2017
C4 copy
Predicting User Preferences, Creating Online P2P Lending Teams in Kiva
January 23, 2017
nfl
NFL Predictions
December 7, 2016
logo-nba
NBA predictions
December 7, 2016
u2u-color-by-modularity
Amazon product co-purchasing network
December 2, 2016
food-atlas
Food Atlas
December 1, 2016
Cap Rates
Cap Rates
November 22, 2016
requests_distribution
Load balancer using MC
September 10, 2016
Categories
  • Association Rule (2)
  • Canonical Correlation (1)
  • Classification (9)
  • Clustering (2)
  • Credit Risk (1)
  • Decision Trees (3)
  • Finance (3)
  • Gephi (2)
  • Hadoop (1)
  • LDA (3)
  • Linear Regression (1)
  • Machine Learning (5)
  • Monte Carlo (4)
  • PCA (7)
  • Python (12)
  • R (9)
  • Random Forest (3)
  • Social Network (4)
  • Sports (2)
  • SVD (2)
  • Thoughts (1)
  • TSA (4)
  • Uncategorized (1)
  • Web crawling (3)
Julio Sotelo

LinkedIn | Twitter