Cluster Analysis of Heterogeneous Rank Data:: File Format: PDF/Adobe Acrobat - View as HTMLor concepts according to their personal order of prefer- Common practice in the analysis of hetero- geneous rank data is to delete partial rankings, and http://www.machinelearning.org/proceedings/icml2007/papers/341.pdfHOME | I have some data (hypothetically let's say that it is consumer data)
where consumer preferences for 20 items are ordered 1-20. That is
that a group of consumers likes product 1 best and product 20 the
least (out of 20 products).
I also have data from questionnnaires where the consumers rate
specific attributes of the products. For example, they may rate a
product 1 for color (like very much), 5 for functionality (like OK)
and 10 for reliability (poor). I have data on about 40 attributes for
each item.
What I would like to determine is the weighting (importance) that each
factor had on the consumers' overall evaluation of the products. I am
not interested in individual consumers, but in the group overall. For
instance, I would like to be able to say that color accounts for
approximately 30% of their choice, functionality 10%, reliability 0%,
etc.
I would like to know how to analyze this data and need references to
the technique used.
Thank you.
The difference between my problem and a typical regression model is
that my dependent variable is a rank position rather than a value.
The problem looks something like this:
Y = a * X1 + b * X2 + c * X3 ...
where
Y is the rank position(1-20)
X1, X2 .. are independent variables, such as consumer perceptions of
reliability, quality, etc.
a, b, c ... are the weights that need to be determined.
Sample data looks something like this:
Rank(Y) Quality Reliability Performance
1 1 7 2
2 3 3 1
3 4 4 5
4 4 2 3
.
.
20 10 9 10
For this data it is possible to see that quality is weighted most
heavily, followed by performance and reliabilty doesn't get much
weight.
Can this data be analyzed to determine the weights of the independent
variable? If so, please provide the method and a reference. Covariation assessment in rank order data:: with rank order data. In both studies, subjects were given sets of rank Level, Order), one between-subject (Data Type) mixed analysis of variance. http://doi.wiley.com/10.1002/bdm.3960010404HOME | [R] OT : sensible analysis of censored rank data:: [R] OT : sensible analysis of censored rank data I have censored rank order data .. electors have been asked to rank the 4 most important issues out of http://stat.ethz.ch/pipermail/r-help/2002-November/026687.htmlHOME |
Hi chrissandvig!
The only way I can think of to address this question would involve
using a regression-like technique (not a "typical" regression model),
but also would involve the following:
- You have 20 products, and several people that tried each of them. I
think then that you should run 20 different regressions, one for each
product. Then you would see how the attributes affect each product
(allowing each product to be affected differently by their attributes)
- You will get a coefficient (beta) for each attribute. The
interpretation of this coefficient will be "for every extra 'point'
assigned to this attribute, the probability that this product will be
ranked 1st, 2nd, ..., 20th will vary in such and such way (function of
beta)". This is different from the interpretation you want, in which
you would like to know that some attribute accounts for some
"percentage of the choice".
If this is acceptable to you, I'll go on with the answer. If it isn't,
I'll leave it to some other Researcher who might come up with a better
method to answer your question. Please tell me what you think about
this.
Best wishes!
elmarto
Dear elmarto,
Thank you for your thoughts on this problem. The approach that you
have suggested may work in some situations. Unfortunately I don't
think that it will work on my data for two reasons:
1. I have only one ranking for the items rather than one for each
consumer. The data may be aggregated from multiple consumers or it may
be ranked via an algorithm that produces the same results every time.
2. I need to use a recognized method of analysis.
cs
p.s. I apologize that it took me a few days to get back to you - I was
away on vacation. Does Logarithm Transformation of Microarray Data Affect Ranking :: Does Logarithm Transformation of Microarray Data Affect Ranking Order of A common practice in microarray analysis is to transform the microarray raw http://adsabs.harvard.edu/abs/2006q.bio..6018LHOME |
Given the large number of categories in your ordinal outcome scale,
OLS (typical) regression is likely to give very similar to appropriate
answers. Some software will not treat variables with that many
categories as ordinal, even if you tell it to.
Alternatively, you could transform your outcome so as to normalise it.
You could avoid the problem of predicting a rank of < 1 or > 20, but
you would still have predicted ranks which were not integers.
If you wanted to use a rank based technique, this would be possible,
but the interpretation of this kind of model is much more difficult.
What software (if any) are you planning to use for this analysis.
jeremymiles-ga
I would prefer to use SPSS but also have access to SAS.
You could use the SPSS ordinal logistic regression function (Analyze,
Regression, Ordinal).
jeremymiles-ga
Thanks Jeremy - I'll look into the ordinal logistic regression function.
Chris Emerald FullText Article : Ranking of handicaps in international :: There are different handicaps in the international financial analysis and . ( rank order data, paired comparative data or successive categories data). http://www.emeraldinsight.com/Insight/viewContentItem.do?contentType=Article&hdAction=lnkhtml&contentId=868854HOME |
Regression analysis is not suitable for your data
since it assumes a linear relationship between the
variables. As you dependent variable is rank, there
is no basis for such an expectation. For the same reason,
that variable does not has normal distribution.
This is just rephrasing what you have already said.
Technique which determines which independent variables
are important is called Factor Analysis
http://www.statsoftinc.com/textbook/stfacan.html
You may also consider transforming the rank data
into 'preference rating' by assuming some (standard)
distribution of preferences. You then can use conventional
techniques, such as ANOVA or step-wise regression.
http://www.wikipedia.org/wiki/Analysis_of_variance
Here is a paper which describes the conversion of rank
into 'preferences' in some detail:
http://www.ats.ucla.edu/stat/stata/faq/prank.htm
Please, do ask for clarification if needed and,
please, do in such a case indicate level of mathematics
you are comfortable with
so I can tailor the explanation to your needs.
SEARCH TERMS
z-score,
rank data
Factor analysis
hedgie
Get Smart About Monitoring Virtual Machines
Microsoft Gets Ex-Streamly Cozy with U.K.'s MediaWave
|