Thursday, November 17, 2011

Big summary about matching algorithms

Big summary about matching algorithms for the Online Dating Industry.

The Online Dating Industry needs innovations but they will come from only one source: the latest discoveries in theories of romantic relationships development with commitment.
I) Several studies showing contraceptive pills users make different mate choices, on average, compared to non-users.
II) People often report partner preferences that are not compatible with their choices in real life.(FORGET Behavioural recommender systems or other system that learns your preferences)
III) Compatibility is all about a high level on personality* similarity* between prospective mates for long term mating with commitment.
*personality measured with a normative test.
*similarity: there are different ways to calculate similarity, it depends on how mathematically is defined.

Matching algorithms do not need to be complex or sophisticated, they need to be effective.

When you evaluate a matching algorithm for the Online Dating Industry, you FIRST need to estimate its resources consumption, the power calculation it requires.

In compatibility matching methods there are 2 steps:
1) to objectively measure personality traits or other human variables (with the 16PF5 test) without distortion.
2) to calculate compatibility between prospective mates

IPSATIVE personality tests are OBSOLETE (self-descriptive questionnaires) and should be discarded: MBTI (PerfectMatch), DISC (ThomasKnowsPeople), Enneagram (Dopasowani) or other proprietary ipsative tests like the ones used at Chemistry, ButterfliesAgain, Doskonalapara, PembePanjur, LittleHint, oneGoodLove and others.

NORMATIVE personality tests like Big5 versions used at PlentyOfFish, eHarmony, Meetic Affinity, Parship, Be2, True are good for orientative purposes but not good enough for predictive purposes.

eHarmony, True, PerfectMatch, MeeticAffinity, Parship, Be2, PlentyOfFish, Chemistry and others are in the range of 3 to 4 persons compatible per 1,000 persons screened. Any member of those sites receives on average 3 to 4 prospective mates as compatible for dating purposes per 1,000 (one thousand) members screened in the database. So in a 10,000,000 persons database, any member will see 30,000 to 40,000 members as highly compatible; 30,000 persons is the population of an average small city. Any person can achieve 3 to 4 persons as highly compatible per 1,000 persons screened, searching by his/her own or by mutual filtering methods!

eHarmony, True, PerfectMatch, MeeticAffinity, Parship, Be2, PlentyOfFish, Chemistry and others have a low effectiveness/efficiency level of their matching algorithms.
The Online Dating Industry does not need a 10% improvement, a 50% improvement or a 100% improvement. It does need "a 100 times better improvement"

LIFEPROJECT METHOD is in the range of 3 persons compatible per 100,000 persons screened, 100 times better than any other.

"Because the Big Five groups the more specific primary-level factors, feedback organized around the five Global Factor scales is more easily understood. For detailed feedback or predictive purposes, one should assess the more specific primary factors. Research has shown that more specific factors like the primary scales of the 16PF Questionnaire predict actual behavior better than the Big 5 Global Factors. For example, one extravert (a bold, fearless, high-energy type) may differ considerably from another (a sweet, warm, sensitive type), depending on the extraversion-related primary scale score patterns, so deeper analysis is typically warranted." Extracted from the 16PF5 Manual

Normative tests can not simply be translated, because you need the norm for that test, and that norm is actualized each and every time Census Figures are released

The 16PF5 test is available in
- English for the United States and the Norm for the United States (sample of individuals with the same demographic characteristics of the United States).
- English for Canada and the Norm for Canada (sample of individuals with the same demographic characteristics of Canada)
- English for the United Kingdom and the Norm for the United Kingdom (sample of individuals with the same demographic characteristics of the United Kingdom).
- English for Australia and the Norm for Australia (sample of individuals with the same demographic characteristics of Australia).
- French for France and the Norm for France.
- German for Germany and the Norm for Germany.
- Spanish for Spain and the Norm for Spain.
- Italian for Italy and the Norm for Italy.
and many more "With over 60 years of research and application behind it, the 16PF5 has become internationally well known and respected, with over 20 different translated versions."

The output of the 16PF5 test are 16 variables STens (Standard Tens) taking integer values from 1 to 10. STens divide the score scale into ten units.
STens have the advantage that they enable results to be thought of in terms of bands of scores, rather than absolute raw scores. These bands are narrow enough to distinguish statistically significant differences between candidates, but wide enough not to over emphasize minor differences between candidates.

Similarity is a word that has different meanings for different persons or companies, it exactly depends on how mathematically is defined. I calculate similarity in personality patterns with (a proprietary) pattern recognition by correlation method. It takes into account the score and the trend to score of any pattern.

LIFEPROJECT METHOD is like the "Teller Ulam design" for the Online Dating Industry.
In this case 100 times more powerful than actual matching algorithms.

Breaking "the online dating sound barrier" is to achieve at least:
3 most compatible persons in a 100,000 persons database.
12 most compatible persons in a 1,000,000 persons database.
48 most compatible persons in a 10,000,000 persons database.
100 times better than Compatibility Matching Algorithms used by actual online dating sites!

The only way to achieve that is:
- using the 16PF5 normative personality test, available in different languages to assess personality of members, or a proprietary test with exactly the same traits of the 16PF5. The ensemble of the 16PF5 is: 10E16, big number as All World Population is nearly 7.0 * 10E9 (estimated OCT 2011)
- expressing compatibility with eight decimals, like The pattern 6.7.6.8.9.6.7.7.8.7.2.5.8.7.3.4 is 92.55033557%  +/- 0.00000001% similar to the pattern 7.7.6.8.8.7.6.5.8.7.4.5.7.7.3.4
Using a quantized pattern comparison method (part of pattern recognition by cross-correlation) to calculate similarity between prospective mates.

That is the only way to revolutionize the Online Dating Industry.

All other proposals are .............. NOISE

See How LIFEPROJECT METHOD calculates similarity between quantized patterns using an adapted quantum mechanics math equation. All other methods are RUBBISH, because the ensemble of the 16PF5 is: 10E16, big number as All World Population is nearly 7.0 * 10E9 (estimated OCT 2011)

(7.0 * 10E9) / 10E16 == 7.0 * 10E(-7) or 0.7 * 10E(-6) or 0.7 micro part!

All World Population is less than 0.7 micro part of the 16PF5's ensemble.

e.g.the 16PF5 Brazilian version, in Portuguese for Brazil and the Norm for Brazil (sample of individuals with the same demographic characteristics of Brazil).

Brazil population is in the range of 190 million persons, 190 * 10E6 == 1.9 * 10E8.
demographic characteristics of Brazil 47.73% White, 43.13% Brown (Multiracial), 7.61% Black, 1.09% Asian, 0.43% Amerindian
(1.9 * 10E8)/ 10E16 == 1.9 * 10E(-8) or 0.019 * 10E(-6) or 0.019 micro part!

Brazil population is 0.019 micro part of the 16PF5's ensemble. You can not use simple regression equations to calculate similarity between quantized patterns because:
* women will "see" men as all the same.
* men will "see" women as all the same.

Do you want to know how to solve than problem?

No comments:

Post a Comment