Not many people seem to know about the WWW07 conference, so I’m
sending them some love. Defined by the organizers as “the global event
that brings together the key innovators, decision-makers,
technologists, businesses, and standards bodies shaping the Web,” the
conference has some interesting refereed papers and posters.
The Web space as of now is pretty crowed and it’s difficult to
differentiate the wheat from the chaff. The next social network, or
the next youtube clone isn’t very interesting. Skimming through these
papers brings to light some of the harder problems in the field:
- spammers posting portions of existing comments with a few links
changed.
- a website that has the authority in one area
might start writing about other things. How do you assign a pagerank
to this site?
- RSS aggregators clustering. No fun reading about five different
takes on the same breaking news. Much like Google News.
- Google news itself. “collaborative filtering using MinHash
clustering, Probabilistic Latent Semantic Indexing (PLSI), and
covisitation counts.”
- mining and clustering other kinds of information (like chemical formulae)
- advertisements and click fraud
- personalization
- and a whole bunch of stuff on scalability, privacy, security and the
semantic web
Also speaking at the event is Prabhakar Raghavan. I have previously heard
him talk about the convergence of the social sciences and the web. Awesome speaker.