- Advertising (1)
- Artificial Intelligence (AI) (10)
- Classification (2)
- Clustering (1)
- Coding / Programming (7)
- Cryptography (1)
- Data Mining (15)
- ewrt linux (2)
- Fixing Stuff (5)
- Machine Learning (25)
- Math (1)
- Politics (3)
- Psychology (3)
- Ramblings (21)
- Random (7)
- Security (14)
- Society (10)
- Sociology (4)
- spam (2)
- Statistics (12)
- December 22, 2008 10:27 am: Credit Card companies adjusting Credit Scores
- December 2, 2008 9:09 pm: Kids, Games and Sociopaths
- November 21, 2008 12:21 am: Back from Conference of the American Society of Criminology (ASC 2008)
- November 1, 2008 9:48 pm: Deploying SAS code in production
- October 15, 2008 12:18 am: Photo-based CAPTCHAs
- September 28, 2008 10:27 pm: Computer Models and the Mortgage Crisis
- September 1, 2008 8:19 pm: Can statistical models be intellectual property?
- August 21, 2008 9:17 pm: Taxons, Taxometrics and the Number of Clusters
- August 14, 2008 11:13 pm: CAPTCHAs - Not dead
- August 1, 2008 10:25 pm: ISC on the Future of Anti-Virus Protection
Blogroll
Useful Links
- December 2008
- November 2008
- October 2008
- September 2008
- August 2008
- July 2008
- June 2008
- May 2008
- April 2008
- March 2008
- February 2008
- January 2008
- December 2007
- November 2007
- October 2007
- September 2007
- August 2007
- July 2007
- June 2007
- May 2007
- April 2007
- March 2007
- February 2007
- January 2007
- December 2006
- November 2006
- October 2006
- September 2006
- August 2006
Can statistical models be intellectual property?
Recently I had a fun discussion with Bill over lunch about intellectual property and how that might apply to statistical modeling work. Given that there are more and more companies making a living from forming predictions with a model they have built (churn-prediction, credit-scores and other risk-models) we were wondering if there were any means of protecting them as intellectual property. For example, the ZETA-model for predicting corporate bankruptcies is a closely guarded secret with having published only the variables being used (Altman E. I. (2000); Predicting financial distress for companies: revisiting the Z-Score and ZETA models). Obviously this model is useful for lending and can make serious money for the user. Making decisions guided by a formula is becoming more popular. This might be something over which legal battles will be fought in the future.
Copyrighted works and patents often count towards what a company would be worth should somebody acquire it. This means there would be motivation for start-up companies to protect their models. A mathematical formula (e.g. a regression equation) cannot be patented, and copyright probably won’t apply either; even if copyright would apply, it’s trivial to build a formula that does essentially the same thing (e.g. multiply all the weights in the formula by 10). This leaves only trade secret protection and means there is no recourse once the cat is out of the bag. Often it’s also the data-collection method that is kept secret - a company called Epagogix developed a method to judge the success of movies from a script by scoring it against some scales that they keep secret.
Currently, I don’t see any legal protections with the exception of trade-secrets for this. And given that there is infinitely many ways to express the same scoring rules in a different way, this would be a fairly hard problem for lawyers and politicians to formulate sensible rules for establishing protection for this kind of intellectual property.