WWW07
Not many people seem to know about the WWW07 conference, so I’m sending them some love. Defined by the organizers as “the global event that brings together the key innovators, decision-makers, technologists, businesses, and standards bodies shaping the Web,” the conference has some interesting refereed papers and posters.
The Web space as of now is pretty crowed and it’s difficult to differentiate the wheat from the chaff. The next social network, or the next youtube clone isn’t very interesting. Skimming through these papers brings to light some of the harder problems in the field:
- spammers posting portions of existing comments with a few links changed.
- a website that has the authority in one area might start writing about other things. How do you assign a pagerank to this site?
- RSS aggregators clustering. No fun reading about five different takes on the same breaking news. Much like Google News.
- Google news itself. “collaborative filtering using MinHash clustering, Probabilistic Latent Semantic Indexing (PLSI), and covisitation counts.”
- mining and clustering other kinds of information (like chemical formulae)
- advertisements and click fraud
- personalization
- and a whole bunch of stuff on scalability, privacy, security and the semantic web
Also speaking at the event is Prabhakar Raghavan. I have previously heard him talk about the convergence of the social sciences and the web. Awesome speaker.