INCONSISTENCY: Rating The Rating Systems

By Lewis Perdue on October 29, 2014 in Current System Paradigms, Featured

How you ask a question can determine the answer you get.

The concept lies at the heart of journalists, scientists, those who interview job candidates, police detectives, opinion pollsters … and wine ratings.

In many settings — especially those where the vino-cognoscenti gather — the question, “What did you think of that wine?,” will frequently elicit a number between 70 and 100.

THE IRRELEVANCY OF THE 100-POINT SCALE

Numerous scholarly articles as well as those in the general media have pointed out that, even if the 100-point scale were a good numerical system, the actual ratings are so inconsistent among the best experts that the numbers expounded are mostly worthless. (Wine-quality scores mostly random, fail to be repeatable.)

Scores on the same wine by the same taster frequently differ from time to time: How close are repeated wine-quality scores?

One of the most thorough round-ups of rating flaws was recently published in The Guardian: “Wine-tasting: it’s junk science” which notes that, “Every year Robert Hodgson selects the finest wines from his small California winery and puts them into competitions around the state.

“And in most years, the results are surprisingly inconsistent: some whites rated as gold medalists in one contest do badly in another. Reds adored by some panels are dismissed by others. Over the decades Hodgson, a softly spoken retired oceanographer, became curious. Judging wines is by its nature subjective, but the awards appeared to be handed out at random.”

Hodgson’s most recent study (as are his oprevious ones) was published in the Cambridge University Press’s scholarly publication, Journal of Wine Economics An Examination of Judge Reliability at a major U.S. Wine Competition

Also see:

New York Times: Wine Ratings Might Not Pass the Sobriety Test
Wall Street Journal: A hint of hype, a taste of illusion
What is the purpose of wine criticism? (A thorough recitation and example of why reviews can be interesting reading, but fail dramatically as a recommendation source.)

100 POINTS FOR PSYCHOLOGICAL BAGGAGE.

The 100-point wine-rating scale also carries psychological baggage from school where below 70 is an F and 100 is an A+.

Those rating/grade associations solidify the false perception that a rating is an objective, unassailable judgement on quality.

Beyond that, what is the “meaning” of an 83 versus 85 or 87? Is the “meaning” biased by personal expectations? Grades in school or performance at work?? What a wine “deserves?”

ALTERNATIVES TO 100 POINTS HAVE THEIR OWN PROBLEMS

Alternative scales — the substitution of 10 or 5 point scales as well as stars and other icons — have found widespread acceptance in wine as well as in other consumer products. But they also have psychological biases, possibilities for misinterpretation (“what does it mean?”) .

Some eesearch shows that when confronted with an odd-numbered scale, respondents tend to cluster around the neutral point with a bias to the positive. This reflects potential anxiety over extreme positions and a tendency to avoid those.

This means that a middle point offers no guidance in in a “buy versus not buy” situation, the

On the other hand, bias may occur at the top or bottom of the scale depending upon context and whether or not the rater psychologically wants to reward or punish the product or company. See: The Problems with 5-Star Rating Systems, and How to Fix Them

This link explores the pros and cons of even- versus odd-numbered scales.The text, below, is excerpted from that page.

Disadvantages of odd-numbered (Thumbs up/down) scales

People may be less discriminating in response (respondents don’t take time to carefully consider all of the various response categories)

May not be collecting accurate responses (the mid-point can mean different things to different people)

Advantages of even-numbered scales

People may be more discriminating, be more thoughtful

Eliminates possible misinterpretation of mid-point

The biggest problem with rating systems…

…is that they have too many biases for too many unknown reasons to be accurately used for recommendations especially for products “of taste” such as wine, books, and movies. That is why Netflix and other savvy companies have moved to “big data” which tries to make recommendations on the basis of some measurable action such as watching a movie, buying a book or bottle of wine.

This moves the process into the realm of the current reigning paradigm: collaborative filtering.

The problem with big data collaborative filtering

PSYCHOLOGY: Anxiety, Stress and Social Pressure Sabotage Choice

MISINTERPRETATION: Words = Big Trouble

About Recommendation Insights

Amazon’s recommendations are amazingly lame! And Here’s why.

Nothing I like better than getting a recommendation for something […]

This is why people HATE recommendations and most always DON’T click on them (and yet …)

I bought this on June 21… Then they emailed me […]

Why ratings and reviews fail (A beginning of understanding)

The promise (and pitfalls) of current recommendation engines (collaborative filtering) […]

Retronasal perception of odors is often overlooked as a key metric

. 2012 Nov 5;107(4):484-7. doi: 10.1016/j.physbeh.2012.03.001. Epub 2012 Mar 8. […]

News

Amazon’s recommendations are amazingly lame! And Here’s why.

August 4, 2021

Nothing I like better than getting a recommendation for something I just bought. Happens all the time with Amazon Simple logic: IF bought recently, THEN omit from recommendations If the AMZN recommender had a double-digit IQ, it would recognize that I buy this hand lotion periodically and send me a […]

This is why people HATE recommendations and most always DON’T click on them (and yet …)

June 29, 2021

I bought this on June 21… Then they emailed me this recommendation on June 29! WTF? No wonder … And yet…

Why ratings and reviews fail (A beginning of understanding)

February 18, 2021

The promise (and pitfalls) of current recommendation engines (collaborative filtering) THE PROBLEM: Welcome to the Vino Casino WINE IS HARD TO LIKE Wine is hard to like Can I learn to like wine? Wine is too hard to like How Science Saved Me from Pretending to Love Wine PROFILING INCOMPATIBILITY: […]

Retronasal perception of odors is often overlooked as a key metric

July 9, 2020

. 2012 Nov 5;107(4):484-7. doi: 10.1016/j.physbeh.2012.03.001. Epub 2012 Mar 8. Retronasal Perception of Odors Viola Bojanowski 1 , Thomas Hummel Affiliations PMID: 22425641 DOI: 10.1016/j.physbeh.2012.03.001 Abstract We perceive odors orthonasally during sniffing; in contrast, we perceive odors retronasally during eating when they enter the nose through the pharynx. There are clear […]

Genetic science shows why wine reviews, expert suggestions, and taste profiles miss the target for recommendations

June 23, 2020

Genetic science shows that wine descriptions and flavor profiling are way off base when it comes to helping people find wines that they will like. This is because the odds are very, very small for two people to experience flavors in exactly the same way. This is because every individual […]

View all

Fatal Flaws In Current Recommendation Systems

Amazon’s recommendations are amazingly lame! And Here’s why.

Nothing I like better than getting a recommendation for something I just bought. Happens all the time with Amazon Simple logic: IF bought recently, THEN […]

This is why people HATE recommendations and most always DON’T click on them (and yet …)

I bought this on June 21… Then they emailed me this recommendation on June 29! WTF? No wonder … And yet…

Why ratings and reviews fail (A beginning of understanding)

The promise (and pitfalls) of current recommendation engines (collaborative filtering) THE PROBLEM: Welcome to the Vino Casino WINE IS HARD TO LIKE Wine is hard […]

Retronasal perception of odors is often overlooked as a key metric

Genetic science shows why wine reviews, expert suggestions, and taste profiles miss the target for recommendations

Genetic science shows that wine descriptions and flavor profiling are way off base when it comes to helping people find wines that they will like. […]

Scientific research reveals why wine ratings and recommendations fail so often

Everyone has a different sense of smell. Taste is mostly determined by smell. “Each of the individuals examined had a unique genotypic [odor receptor] pattern.” […]

66% of wine drinkers find wine descriptions unhelpful in choosing wine. Only 9% rely on critics

Data from the 2013 Laithwaites Wines Survey, of 1,000 wine drinkers by polling firm, One Poll. (Laithwaites customers were excluded from the survey.) Those taking […]

The promise (and pitfalls) of current recommendation engines

NOTE: You can right-click all images to view a larger version. Left-click to go to data source. Value As arbitrary as they often seem, “35 […]

Mouth bacteria: one more reason that individuals’ taste perception differs (especially from sip & spit experts)

Study shows some wine aromatics aren’t released until they meet the bacteria in our mouths “At least 700 bacterial species live in our saliva, on […]

How sensory taste profiling stops short of individual recommendation accuracy

I continually shop for new wines like an average consumer. I do this to get put of the “wine bubble” that keeps reality at an […]

Beyond Genetics: How Saliva Affects Your Unique Way Of Experiencing Wine

Recommendation Insights has previously described the many ways that genetic variations determine any single individual’s wine taste experience: Inherited Taste Chaos Sabotages Recommendations. Source: Food […]

New Research Shows Why Wine Descriptions Don’t Help Consumers Select Wine

New research recently published in the journal Neurocomputing shows one more reason why consumers get little help in choosing wines from the descriptions in wine […]

Turning White Wine Into Red: Recommendation Failures & The Alchemy Of Context

“[N]o event or object is ever experienced in perfect, objective isolation. It is instead subject to our past experiences, our current mood, our expectations, and […]

Wine And Music Are A Lot Alike & So Are The Ways Their Recommendation Systems Fail

Wine and music: please our senses, touch our emotions, beg to be shared, are deeply engrained in how we define ourselves and, can determine how […]

How Predictive Big Data Fails

Recommendations from big data predictive methods are a lot like the use of epidemiology to determine the cause of a disease: Both rely on the […]

Recommendation News – Nov. 4, 2014

Wine Shopping in Boston Just Got a Lot Easier

Recommendation News – Nov. 3, 2014

Corkz Update Extends Global Reach of the Wine Information App Pandora for beer? Vivino, Club W Use Tech To Help Wine Lovers Find Favorites Damned […]

Recommendation News – Oct. 30, 2014

Wine Lovers Should Open Their Palates To Technology To Find Favorites Corkz Update Extends Global Reach of the Wine Information App

INCOMPATIBILITY: Profile Matching

A PERSONAL NOTE: My apologies in advance to a number of good friends and colleagues who are still trying to get wine profile systems to […]

Welcome to the Vino Casino

The Casino. (The House Always Wins) Buying wine is a lot like walking into a casino. You know the odds favor the house, but the […]

GENETICS: How Inherited Taste Sabotages Recommendations

See also this update: Genetic science shows why wine reviews and taste profiles miss the target for recommendations Some people hate the taste of raw […]

SCALING: Most Wines Have NEVER Been Rated By Critics

UPDATE, August 18, 2020: New data from a highy respected source: 116,000+ new wine products approved by TTB in past 12 months: bw166. Consumers […]

MISINTERPRETATION: Words = Big Trouble

Words can mean different thing to different people. What does “complex” mean? Or “balanced?” “Big?” “Olive-tinged black currant?” “I don’t know what you mean by […]