You are currently browsing the archives for the Society category.
| M | T | W | T | F | S | S |
|---|---|---|---|---|---|---|
| « Apr | ||||||
| 1 | 2 | 3 | 4 | |||
| 5 | 6 | 7 | 8 | 9 | 10 | 11 |
| 12 | 13 | 14 | 15 | 16 | 17 | 18 |
| 19 | 20 | 21 | 22 | 23 | 24 | 25 |
| 26 | 27 | 28 | 29 | 30 | 31 | |
- Advertising (1)
- Artificial Intelligence (AI) (8)
- Coding / Programming (6)
- Cryptography (1)
- Data Mining (10)
- ewrt linux (2)
- Fixing Stuff (5)
- Machine Learning (18)
- Math (1)
- Politics (2)
- Psychology (3)
- Ramblings (18)
- Random (6)
- Security (11)
- Society (9)
- Sociology (3)
- spam (2)
- Statistics (9)
- April 21, 2008 1:38 am: ART OF SEDUCTION: Not Pretty, Really
- March 25, 2008 2:25 am: "Internal Server Error" when converting phpBB v2 to phpBB v3
- March 6, 2008 1:29 am: Firewire and DRM
- February 28, 2008 10:46 pm: Using Psychological Domain Knowledge for the Netflix Challenge
- February 12, 2008 1:24 am: VPN Tunels from within VMWare (Windows XP and GRE weirdness)
- February 2, 2008 5:59 pm: License Key Copy Protection
- January 8, 2008 8:34 pm: Registering Domains with Network Solutions
- January 7, 2008 10:22 pm: Joe-job ...
- December 11, 2007 1:37 am: Back from NIPS 2007
- November 24, 2007 1:03 am: GMail Logout Strangeness
Blogroll
Useful Links
Archive for the Society Category
ART OF SEDUCTION: Not Pretty, Really
April 21, 2008 1:38 am by Markus.
Pretty interesting short-film: http://www.youtube.com/watch?v=bd4Gpi9ksXw
Posted in Society | Print | No Comments »
The GPL and Machine Learning Software - Should the GPL cover training data?
October 1, 2007 10:19 pm by Markus.
I’ve followed the discussion and introduction of the GPL v3 for a bit. One major change in the license is supposed to close the loophole commonly referred to as the “tivoization” of GPL software, i.e. mechanisms that prevent people from tinkering with the product they bought which includes GPL software. Tivo, in particular, accomplishes this by requiring a valid cryptographic signature for the code to run - the user has access to the code, but it’s of no use. One of the main ideas of the GPL was to allow people the freedom to tinker, improve and understand how something works. This got me thinking a bit about software that uses machine learning techniques.
For the sake of the argument, let’s assume that somebody releases a GPL version of a speech recognition system, or say an improved version of a GPL speech recognition system. While the algorithms would be in the open for everyone to see, two major components of speech recognition systems, the Acoustic Model and Language Model, do not have to be. The Acoustic Model is created by taking a very large number of audio recordings of speech and their transcriptions (Speech Corpus) and ‘compiling’ them into statistical representations of the sounds that make up each word. The Language Model is a very large file containing the probabilities of certain sequences of words in order to narrow down the search for what was said.
A big part of how well the speech recognition system works relies on the training. The author who improved upon the software should publish the training set as well. Otherwise people won’t be able to tinker with the system or understand why the software works well.
The same would hold for things like a handwriting recognition system. One could publish it along with a model (a bunch of numbers) that make the recognition work. It would be pretty hard for somebody to reverse engineer what the training examples were and how training was conducted to make the system work. Getting the training data is the expensive part in building speech-recognition and handwriting-recognition systems.
Think Spam-Assassin - what if the authors suddenly decide to not make their training corpus available anymore? How would users be able to determine the weights for the different tests?
I don’t think this case is covered by the GPL (v3 or older) - (However, I’m not a lawyer). Somebody could include the model in C code (i.e. define those weights for a Neural Net as a bunch of consts) and then argue that all is included to compile the program from scratch as per the requirements of the license. However, the source by itself wouldn’t allow anybody to understand or change (in a meaningful way) what the program is doing. With the growing importance of machine learning methods just being able to recompile something won’t be enough. I think this should be taken into consideration by the open source community for GPL v3.01.
Posted in Society, Machine Learning | Print | No Comments »
Text Mining for Tax Evaders on eBay
June 12, 2007 3:32 pm by Markus.
A (long) while ago there was a lot of talk about the IRS doing text-mining for people that did not report income they got from e.g. ebay internet auctions. The German version of this attempt is called XPIDER (see also here and there) was just shown to be totally ineffective. After years of trying and spending millions they did not identify conclusive evidence to go after a single tax-evader. The German GAO (Bundesrechnungshof) is trying to find out what went wrong, why and who misspend all that money. I wonder if the more sophisticated web-crawling program the Canadian IRS is using (it’s called XENON as reported by Wired) is similarly effective …
Posted in Society, Data Mining | Print | No Comments »
Copyright law
April 9, 2007 11:20 pm by Markus.
I found one most insightful piece on copyright law on youtube using the Amen break as an example.
Posted in Society, Ramblings | Print | No Comments »
Artificial Intelligence Cited for Unlicensed Practice of Law
March 8, 2007 5:52 pm by Markus.
I just read an article in the Wired blog titled “AI Cited for Unlicensed Practice of Law” citing a ruling from a court upholding its decision that the owner through the expert system he developed has given unlicensed legal advise. While an expert system is a clear cut case (as the system always does exactly what it was told [minus errors in the rules]; it just follows given rules and makes logical conclusions), this becomes more interesting in cases in which the machine learns or otherwise modifies its behavior over time. For example, lets say I put an AI software online that interacts with people and learns over time. Should I be held responsible if the program does something bad? What if I was not the person that taught it that particular behavior? This will probably be a topic that the courts will have to figure out in the future. For one, people should not be able to hide behind actions their computer has done. But what if it is reasonably beyond the capability of the individual to forsee what the AI has done?
This will probably end up being the next big challenge for courts just like the internet has been. It is interesting how the internet has created legal problems just with people being able to communicate more easily with each other: think trademark issues, advertising restrictions for tobacco or copyright violations (fair use differs from country to country; what is legal in one might be illegal in another) …
Update: And it just started. Check out this article: Colorado Woman Sues To Hold Web Crawlers To Contracts
Posted in Society, Artificial Intelligence (AI), Ramblings | Print | No Comments »
“I’m sorry, but I’m married …”
March 5, 2007 8:52 pm by Markus.
As I enjoy going out with friends and mingling a lot, I noticed a very interesting trend lately. Some of my single friends will go and chat up some woman they find attractive (and sometimes with success) and if the woman is not interested, she will tend to show them a ring on her finger and tell them that she unfortunately is taken. So far, so good. I was out with some friends of mine. We were just talking and observed some gentleman asking a woman out right of the bat. She showed her ring and politely turned the guy down. Chris ended up talking to this lovely lady later - we all met as part of some group going out, and Chris and she had a longer conversation. After a long conversation, lots of laughter she excused herself for a minute. Once she came back from the bathroom, her ring was gone. He didn’t notice it at first, but as she suddenly became a lot more flirty, he asked her about it straight up. Her answer: “It’s a fake ring. Just to keep guys in bars from hitting on me.” They talked for a bit more about this and that, and she started hinting more and more that she would be very interested in going on a date with Chris. He thought about it, and walked away. Chris told me later that he just does not like to date woman that lie; trust and honesty are important to him. His reasoning was that if she is willing to lie (or: use little “white lies”) right from the start, how can one expect that it does not get worse over time? What if she uses little white lies to get out of every situation she does not want to be in?
What do we learn from this? A fake wedding ring might keep all the drunk loosers away, but will possibly confuse or scare off Mr. Right when he comes along. I’ve heard similar stories from a couple of other guys; it just does not go over well.
Posted in Society, Ramblings | Print | No Comments »
Online Dating
January 3, 2007 8:20 am by Markus.
As I’m currently visiting Germany over the winter break, I couldn’t help but notice the advertising for an online dating website here. They spend a lot of money to get this stuff into peoples heads here. I’ve seen some of that stuff advertised in the US (such as match.com and TRUE) so for the hell of it I went and checked out the website. First thing I noticed is that they require you to create an account to see peoples pictures or browse more than a couple of pages in the search results. That, of course, leads to many many stale profiles from people that just want to window-shop and are not really interested in giving it a serious try. To interested parties (i.e. people that pay) this of course might look like there are so many members on the website that it might be worth paying for.
It just helps add to my impression after reading about Bad Experiences with canceling accounts, which gives a not-so-honorable mention to certain US based dating websites. Apparently you can’t just cancel your membership using the website, but have to take a phone-exit interview. Otherwise, your profile will be kept and your credit card will be continue to be charged. It seems that dating websites are forced to keep people active as long as possible (or at least keep up the illusion). The reason for this might be less mean-spirited than one would at first assume. For example, just to have a couple of thousand people in each major city of the US a dating website would have to have roughly 100.000 active members. That is tough to accomplish, esp. given that without the illusion of activity nobody else would join.
With all that said, a friend of mine found his girlfriend through the Denver Personals on Craigslist. It can work.
Posted in Society, Statistics, Ramblings, Random | Print | 4 Comments »
Ensemble Predictors and Democracy
November 15, 2006 9:15 pm by Markus.
I just read an interesting article about how society is usually described in science fiction. Turns out that in all circumstances it is about a very hierarchical, military like structure. There are no suggestions as to how a civilian society might work in the future. Consider things like Star Trek where a bunch of officers command a star ship around and the rest of the people just function. The captain is smart, benevolent and there is rarely an abuse of power. No democracy, no voting, little about how the civilian society of the future might function. There are things like Futarchy, but that’s pretty much all I could find in a quick search (and it wasn’t proposed in a SciFi-novel so it can’t be any good
). One of the problems with Democracy might be that people don’t always make the right decision as they don’t have access to all the information or are easily swayed by bad arguments (e.g. negative ads - some of them are just factually wrong). My point is that there haven’t been that many viable alternatives proposed, not even some crazy, outlandish suggestions (think teleportation for means of transport) to give people some new ways to think about this.
There is an interesting book out there called The Wisdom of Crowds. It proposes that large crowds of people can be capable of making decissions better than individuals. Long story short, according to the book four key qualities are necessary to make a crowd smart. The crowd needs to be diverse, so that people are bringing different pieces of information to the table. It must not have somebody at the top dictating the crowd’s answer, and summarize people’s opinions into one collective verdict. The people in the crowd need to be independent, so that they pay attention mostly to their own information, and not worrying about what everyone around them thinks (i.e. being immune to persuasion concepts like social proof).
Random Forests in machine learning are an ensemble method that has very good classification performance. The way it works is that hundreds of decision trees are build, but each on a different training set and with a different choice of features. If all the classifiers are strong (i.e. not able to make perfect predictions, but they tend to do the right thing - they take the information they have and make independent decisions) , then the overall vote of all the trees in the ensemble will tend to minimize the misclassification error. Breiman gave a mathematical proof of why this minimizes the classification error (i.e. bad decisions).
I wonder if something like this might work for political decision making. Leaving problems like corruption and other human fallacies (e.g. looking at what others are doing) aside for a moment and assuming that for all things there are good arguments to be made for and against a bill, a senators vote would depend on how he or she weights the particular arguments for and against the bill. If we assume that senators tend to vote for what they perceive to be the right thing, would giving each senator a random subset of information make the overall senate vote for the “right thing”? Another idea would be to make a political decision, similar to jury duty, by picking a large number of people from the general population at random and have them decide on a particular issue.
Edit:I found some criticism of the Wisdom-of-Crowds theory, such as Wikipedia not being accurate enough or a democracy electing people like Hitler. A good question in both cases would be if people made their decisions independently in these cases or not. I think that independent decisions are difficult to achieve in practice. Also one has to wonder how robust this system is due to the assumption that everybody makes the best decission they can.
Posted in Society, Politics, Machine Learning, Sociology | Print | No Comments »
Perception of Beauty / Advertising
November 9, 2006 5:18 pm by Markus.
I just found this video on YouTube. Check it out. All I can say is “wow”.
Edit: A parody of the video is very funny…
Posted in Society, Sociology, Ramblings | Print | No Comments »