Copyright ©1997-2011 Glenn Fleishman except as noted otherwise. All rights reserved. For permission to reprint, contact Glenn Fleishman at glenn at glennf.com. Photo © 2008 Laurence Chen; used with permission.
Turning technology from mumbo-jumbo into rich tasty gumbo
Okay, so I’m not so bright. After posting the the previous item on Amazon’s search feature, I went and used it more extensively.
I didn’t quite realize that it was presenting full book pages. The Author’s Guild has sent out a note to its members, which includes me, warning that the system actually allows not just contextual results — my first thought at seeing the search results — but also entire pages. Many pages. In fact, with a little poking, you can retrieve basically entire books.
For reference books — cooking titles, computer books, travel books, etc. — this could devastate sales. I mean, if you can read the five pages you need, why buy the book?
Of course, this points out the flip side: many books, including most of the ones but not all of the ones I write, have marginal utility for the reader and maximum utility for the bookstore but only marginal return for the publisher and marginal return for the author.
That is, the folks who make the most money with the least capital are the folks selling books. The other steps in the chain have more marginal returns, requiring higher volumes of sales to be viable. This isn’t saying that booksellers are ripping us off or have it easy; rather, that their part of the value chain has the highest return on capital where capital is being expended.
(Authors’ ROI is harder to measure: are we trying to make a living, buy a house, earn a specific dollar wage? My return on capital is pretty vast, but that doesn’t equate to making a great living from it.)
One way I’ve tried to get out of this loop has been through discussions over the last few years about launching a publishing company that would have its primary focus on short, niche titles, sold electronically in small volumes at a low price.
Adam and Tonya Engst, publishers of TidBITS, have launched such a venture: the Take Control series. I’ve known the Engsts for more than a decade, and have had many talks on this subject with Adam, with whom I’ve co-authored two editions of The Wireless Networking Starter Kit.
The Take Control series has a few unique aspects: First, the Engsts run a weekly newsletter which has tens of thousands of subscribers. Second, Adam is one of the best-known Mac people, just below a couple of Apple employees, like Steve Jobs. Third, the Engsts are trustworthy and have assembled a bunch of writers who sell lots of books and have a lot of activities already that give them a chance to promote what they’re doing.
The first Take Control book was on installing and upgrading to Panther (Mac OS X 10.3). It cost $5. Nearly 2,000 have been sold in under 72 hours — and that’s not the end of the sales of this book by any means. There’s no digital rights management on the PDF at all: we’re relying on the price and the general utility to make piracy a pointless or at least irrelevant activity.
I think we might have a model here.
Wired News hits a new phenomenon square in the eyes: comment spam. I’ve had to turn comments off on some blogs because of this.
Amazon launched its book-searching feature today; we were talking about this idea all the way back in late 1996 when I worked there as catalog manager. It’s so cool to see it come to fruition.
My friend, old boss, current officemate, and colleague Steve Roth had this idea way back in the mid-90s: why not have a site at which you could search fulltext, see a little context, and then buy the book?
It took a long time for rights, technology, and integration to make it happen. I’ve been using O’Reilly’s Safari Bookshelf for a few months, and it’s a similar idea taken a step further. For a fee per month, you’re licensing the rights to search and read any page in a book on their site up to a certain number of books at one time. You can search for free, actually, and the results are useful because they show context.
All of this is to the good for authors: it allows our work to be seen as useful in context, and to increase sales based on utility.
What’s the deal with the title of this post? When I worked at Amazon, we had gotten this email from Japan asking something that I can’t recall. But it opened in bad translation as something like “Today, I live in the book.” The rest escapes me but was equally beautiful and senseless.
Spam is an adaptive virus: we only see the successes, as more and more filtering wipe out the less adaptive versions. Lately, I’ve been seeing an increasing amount of spam that’s passed through three layers of filtering, two of them involving Bayesian notions of word frequency. This new spam has a bunch of randomly created word-length text strings. The subject lines have punctuation introduced in strange places so that the words are legible, but they don’t “read” as words. (Of course, an easy parsing solution is to normalize words and then run filters against them.)
Obviously, this is the latest end-run around the latest spam innovation. It shows that Bayesian filtering, while a wonderful idea, has its limits because of spammers’ cleverness and adaptability.
Ultimately, these exercises show that no matter what algorithm we use, spam will still filter through. (I’m still seeing Nigerian variants, which amazes me.) The next approach is going to be digital certificate-based: you can’t forge those, and you prevent non-trusted sources from connecting. If you put certificates on the mail servers — and make sure that VeriSign isn’t the only company controlling the issuing of these certificates, but that non-profits and other organizations can be root certificate authorities — then only mail servers configured with them will be able to exchange email with other servers.
It’ll be tricky, but I believe the next change in the net will come that way. Technology and legislation aren’t stopping spam. Digital certificates could dramatically reduce it because of the ability to revoke certificates, eliminating an entire mail server from a system without requiring a blacklist. (Yeah, and then who decides to revoke certificates? And on and on.)
I’ve written some code that runs under a crontab that will take new posts, filter their content, and forward them to a mailing list. I’m working with a Lyris-based list, but the principle is the same if you have some perl expertise. Email me if you want the code.
Thanks, gents! As the subject of many MRIs, I’m extremely grateful to these two fellers.
Of course, Glenn Reynolds was linking to my Wi-Fi Networking News site, not my personal site, so this just increases my Wi-Fi Whuffie.
Hey, I wasn’t able to get to BloggerCon, and I’ll miss all the fun and information. Everyone, have a great time, and don’t sign my name to the bar bills.
I discovered this very reasonable sounding comment just now on my Wi-Fi news site:
“We live in strange times, but someday I think we will look back on all of this and marvel at how crazy it was. God, I hope so. I sure wouldn’t want this insanity to become the norm.”
Unfortunately, it was totally offtopic. The URL of the poster was a scum site, trying to get Google Whuffie.
Jay Allen gets my vote for supreme arbiter of goodness for this page which documents installing a variety of plug-ins and templates inside Movable Type to block the display of comments which contain URLs repugnant to you. It’s not a complete solution, but it does mean that the idiots and scum who have started to spam the comments section of Movable Type (and other) blogs can be suppressed.
It’s based on looking for domain names and URLs in the posts and author info in comments, which means that you get the spammers where they leave. They can circumvent all kinds of content restrictions, but in comment spam, they have to link you somewhere.
This shows off the power of Movable Type’s extensible architecture. Hooking in Jay’s mods took a few minutes. I just had to install a few simple plug-ins, copy his template, and add a bit of If..Then code into the comment templates, and voila!
October 2011 | August 2011 | June 2011 | May 2011 | February 2011 | December 2010 | November 2010 | October 2010 | September 2010 | August 2010 | July 2010 | June 2010 | May 2010 | April 2010 | January 2010 | December 2009 | November 2009 | October 2009 | September 2009 | August 2009 | July 2009 | May 2009 | April 2009 | March 2009 | February 2009 | January 2009 | December 2008 | November 2008 | October 2008 | September 2008 | August 2008 | July 2008 | June 2008 | May 2008 | April 2008 | March 2008 | February 2008 | January 2008 | December 2007 | November 2007 | October 2007 | September 2007 | August 2007 | July 2007 | June 2007 | May 2007 | April 2007 | March 2007 | February 2007 | January 2007 | December 2006 | November 2006 | October 2006 | September 2006 | August 2006 | July 2006 | June 2006 | May 2006 | April 2006 | March 2006 | February 2006 | January 2006 | December 2005 | November 2005 | October 2005 | September 2005 | August 2005 | July 2005 | June 2005 | May 2005 | April 2005 | March 2005 | February 2005 | January 2005 | December 2004 | November 2004 | October 2004 | September 2004 | August 2004 | July 2004 | June 2004 | May 2004 | April 2004 | March 2004 | February 2004 | January 2004 | December 2003 | November 2003 | October 2003 | September 2003 | August 2003 | July 2003 | June 2003 | May 2003 | April 2003 | March 2003 | February 2003 | January 2003 | December 2002 | November 2002 | October 2002 | September 2002 | August 2002 | July 2002 | June 2002 | May 2002 | April 2002 | March 2002 | February 2002 | January 2002 | December 2001 | November 2001 | October 2001 |