Category Archives: Google

Google, its products, and its role on the web.

Blogger Rebuilt

“Then, when someone wants to see any of the pages on your blog, those pages are created for them dynamically, on the fly.” Sounds familiar… The new Blogger doesn’t make users wait on rebuilding anymore, nice upgrade! (What happened to all those folks saying static was the only way to scale?) That and their other new features show a real respect and sensitivity to their users, the only thing missing is an exporter. Rebuilding is so 2004.

Banned from Google

Something really weird happened when I had the password problem last week — I completely disappeared from Google. It’s not just the search for Matt, but almost every page on my site has disappeared even for super-obvious searches. This happened within a day of the guy getting into my blog account. I have two theories, one is that when all my links started pointing to blogspot (he changed my siteurl) that triggered some sort of anti-spam flag, and my second theory is that [H]e turned on the new Blog Privacy feature in WP that adds noindex,nofollow to the header of your page, and Google was crawling me that very instant and removed my site. BTW, as an update to the previous entry, I have since found out that I did not have a super-obvious password, but rather he found it embedded in the source code in the SVN repository of a new project I’m working on that hasn’t been released yet. I’ve axed the repo, but at least now I don’t feel so bad about having an awful password on my blog. Regardless, the event was a good excuse to review my password strategy and make sure everything was fairly locked down. Update: I’m certain it was the noindex thing, which looks like it was on for about a week. Let’s see how long it takes to bring everything back and if I rank the same. Update 2: Everything is back to normal. 🙂

Podsession Responses

There have been two interesting responses to the podcast I did with Om and Niall the other week. The first was Scott Johnson who responded in a podcast. As I expected, most people are taken aback by my statement to “let the engineers pick” what language and enviroment you use for your product. I think there is one important assumption that wasn’t articulated in that statement: you have brilliant engineers and you trust them. As a psuedo-engineer, I find it insulting when people suggest engineers are unable to factor anything other than their selfish language preferences, things like loaded costs, hardware costs, platforms, long-term viability, hiring, etc are simple variables that can be considered by any intelligent person. If anybody in Automattic came to me that was writing a tool in Python, C, Perl (it’s happened) or whatever, I might ask a question or two but at the end of the day I know they’re able to weigh the costs and benefits just like I would. If you’ve hired an engineer that isn’t able to make these decisions as well or better than you, then you’ve already lost the battle and over time more and more of your time will be spent plugging holes in a descent to mediocrity.

The second response was on the Pronet blog which in an amazing feat of blogging acrobatics managed to mention and link every single person tangentially associated with the podcast except me, even though I’m quoted in every heading. The Google Pages example is brought up again to illustrate that all the hardware in the world sometimes solve a scalability problem, but I still think that’s faulty because none of us had any idea why Pages was slow when it launched, it could have been a faulty router for all we know. Pronet responds to “Go with what your happiest working with” with a set of points to consider for a language, but again with the right people none of that matters. Happiness, in all things not just the language, should be the number goal and metric for everything in an early-stage startup. Happy engineers work smarter, longer, more efficient, attract better candidates, and have a better quality of life. (A corollary is that if you’re already set on a language path, don’t hire anyone who isn’t thrilled with working in that language.) For an example of how this can work in a really extreme case, I suggest everyone read the story of Viaweb and Lisp. (Another talk.)In my mind Lisp is a ridiculous language to build a web application in, but to them and their engineers it was heaven and they had better products earlier than their competition as a result of their unusual choice.

(As an aside, I wonder how many people said the same thing about Ruby for web apps before David Heinemeier Hansson, Rails, and 37signals, or even about PHP before Yahoo and Wikipedia? An example (and a little bit of promotion) is better than a thousand whitepapers.)

Share Your OPML

Share Your OPML has relaunched and appears to be built on WordPress. This was one of my favorite services a few years back and was far ahead of its time. I still think moving OPML around is too hard, it would be nice to have some sort of OPML normalization service that could log into different accounts at places like WordPress, Bloglines, and Google reader, grab your file, auto-discover any feeds for entries that don’t have a xmlUrl, and merge everything together using the updated attribute. Hat tip: Niall and Johan.

Better Trackback?

There is talk of pushing for Trackback to become a standard. A few of the problems with Trackback are immediately apparent: horrible internationization support, bad auto-discovery, proclivity for spamming, no verification, historical baggae of category junk, bad spec. Fix all these and you get… pingback. Pingback is big enough now to make a blip in Google’s markup survey, and is supported by a wide range of platforms. The question is whether people are going to want to support an existing and robust standard or want to put their name on something new, the global “not invented here” syndrome where everyone wants their 15 standards of fame. (As someone who has been involved in several standards myself, I admit the draw is strong.) What Pingback does need is a better advocacy site, like atom has.

Markup Survey

Ian Hixie at Google just published a really awesome web authoring survey of a billion documents. What I found most interesting about reading it was places that things I’ve worked on, notably WordPress and GMPG, popped up.

HTTP Headers — “A pretty significant number of pages include an X-Pingback header (more than the number of pages with the Set-Cookie2 header). In fact, X-Pingback was the 30th most-seen header in our data sample.”

WordPress is one of the few platforms that supports pingback, an alternative to Trackback with a real spec. Apparently there are enough WP pages in the world for this to make a blip on the radar.

Page Headers — “It turns out that a tiny but measurable number of people do use the profileattribute, though. The three most-often used values are http://gmpg.org/xfn/1, http://dublincore.org/documents/dcq-html/, and http://gmpg.org/xfn/11. This makes XFN the most popular HTML metadata profile!”

Too cool for words. 🙂 Both of these profiles are included by default in some WordPress templates.

rel="pingback" and rel="bookmark" both skirt the charts in the link relationship page. No XFN values made the cut there.

The <a> element — “external seems to be mainly propagated by WordPress, but people have long been asking for a way to label their links as being external vs internal.”

Nice to get a direct mention there, and we’ve supported bookmark and tag from the beginning. All in all the report is a very interesting read, and kudos to Google for doing this type of research and sharing it with the web. I hope to see more of these in the future, it delights my inner markup geek.

GBase and Wikipedia

I’ve seen a lot of people suggest with Google Base they’re competing with Craig’s List and eBay, but I see it much more as a play against Wikipedia. When I chatted with Jimmy Wales he said one of their biggest problems was for things that are relatively structured like the wiki dictionary or quotes having hte data in a totally unstructured wiki format makes it hard to work with. He suggested they would go the direction of mediawiki templates and microformats, which is great to hear. Google has come at it from the other direction, giving you a free-form DB to pour your heart into. I guess the question is, will people become as passionate about distribution (what Google promises) rather than working for the greater good of humanity through the Wikipedia.