Clustering and classification

Hi,
I am new to using PYMC and am seeking help with a problem. I have a dataset that includes environmental variables(Temperature, Relative humidity, clothing level, air velocity) and thermal comfort votes (response variable) from 100 different occupants, each of whom voted 100 times. The thermal comfort votes have three classes: warm, cold, and neutral. I am trying to find clusters of occupants with similar thermal preferences (based on environmental variable and their vote), and then use these clusters to develop classification models for predicting their comfort status. I have used multinomial classification, but I am unsure of how to proceed with finding the clusters and determining the number of clusters. Any assistance would be greatly appreciated.

1 Like

Hi,

from my experience here it is easier for people to help you out if you share what you have tried so far including the code/results.

1 Like