Ebooks – Five Years Later


Just a little under five years ago, I started looking at the possibility of Ebooks again. Two months later I bought a Barnes and Noble Nook. For a long time I was very casual with my Ebook reading. I had the Cory Doctorow books and several months worth of free Nook books which more than made up for the cost of the hardware. Really, the best part of fully digital distribution of books (and games) is revolutionary in the way it can allow the purchase of giveaway of media which would be cost-prohibitive if the company had to pay for the physical object or shipping fees. I installed Calibre for the free EPUBs, but it didn’t touch my Barnes and Noble purchases.

In the past 1-2 years, things have really changed, so I wanted to document that both as a documentation of what’s changed and perhaps as a helpful signpost for others looking to make the plunge. If you look at my original post, my biggest fear was Digital Restrictions Management (DRM). It’s not an irrational fear as I’ve lost money on books that I cannot read any longer because the DRM servers have been shut off. (THANKS, MICROSOFT!) Although none of the suppliers has made the MP3 move of declaring the entire store DRM-free (come on Google or someone – I’ll promise to buy ALL my books from you if I know they’re DRM-free!), many publishers have started removing it. As Cory Doctorow wrote in the Guardian:

In a sane world, Hachette would have a whole range of tactics available to it. Amazon’s ebook major competitors – especially Apple and Google – have lots of market clout, and their customers are already carrying around ebook readers (tablets and phones). Hachette could easily play hardball with Amazon by taking out an ad campaign whose message was, “Amazon won’t sell you our books – so we’re holding a 50% sale for anyone who wants to switch to buying ebooks from Apple, Google, Kobo or Nook.”

….

But it is precisely because Hachette has been such a staunch advocate of DRM that it cannot avail itself of this tactic. Hachette, more than any other publisher in the industry, has had a single minded insistence on DRM since the earliest days. It’s likely that every Hachette ebook ever sold has been locked with some company’s proprietary DRM, and therein lies the rub.

So smart publishers like Tor and Baen have been publishing their books DRM-free. So I know I don’t have to worry about losing my books when I hear that Barnes and Noble may be spinning off the Nook or killing it. Also, I can buy books whereever they’re cheapest. Kindle books without DRM can be read outside of the Kindle either in digital readers or by converting to EPUB. Also, for many (although not all) books sold through any of the retailers, it’s trivial to remove the DRM. I will say right now that if they remove that loophole, they’ll be losing my business for sure. I am NOT making the mistake I made before.

Second in importance and related in every way to the above is the rise of the Bundle. Both Humble Bundle and StoryBundle offer DRM-free EPUBs, MOBIs, and PDFs of tons of books with a pay what you want pricing model. This guarantees that if the big boys take their ball and go home, they won’t be getting my money, loyalty, and word of mouth. That’ll go to the indies who are publishing on the bundles. Plus there are the classics on Project Gutenberg. There is more DRM-free out there for me to read than I have hours left in my life.

This explosion of books available to me outside of Barnes and Noble’s ecosystem, plus a desire to have local copies of my B&N books to prevent them from pulling an Amazon and taking it away from me, meant that I had an organizational problem on my hands. So I turned to Calibre, which I hadn’t been using for anything more than converting formats to EPUB and putting them on the Nook. Just as going from a handful of MP3s in the late 90s to over a month’s worth of continuous music in a bunch of file formats necessitated going in an fixing the ID3 tags (A task I’ve been working on for >14 years and have not yet 100% finished), it was time to get serious about the organization of my Ebooks. Otherwise it was going to be hard to manage what was effectively becoming a library. Plus, I like to take advantage of what digital gives me over physical in every medium. (Dynamic playlists in music, for example; stats in every medium) Additionally, while there are some who have complex digital “shelves” on their Nooks, the Nook is generally piss-poor when it comes to managing books and burgeoning libraries. I hear the Kindle is pretty bad at it too which makes sense – the designers didn’t expect people to have digital libraries with thousands of books – something that’s harder to do with physical books since you need a place to put them.

So most of the rest of this blog post is going to discuss how I’ve set up Calibre and how I’m organizing things. First of all, as I mentioned back in 2010, it was the idea of Ebooks that got me to sign up for a Dropbox account. The best thing about it is having my Ebooks triple backed up to the cloud. It’s on Dropbox as well as being backed up to Backblaze on my Windows computer and Crashplan on my Linux computer. So if I lose my Ebooks it’s because God hates me or something. Second, there are at least half a dozen ways to organize the books on the hard drive so Calibre’s is a good as any other and at least it makes sense. When books are imported into Calibre it puts them into folders as so: Author->Book->files of the book in all the formats you have. The power of Calibre comes from being able to search against all the other metadata you can think of, making the choice of folder structure more or less unimportant.

Calibre Setup
Calibre Setup

First, my big complaint: it doesn’t matter if the book comes from Barnes and Noble, one of the bundles, a Kickstarter, a free book, or an illegitimate source, most of the metadata sucks. What irks me most is when the author and summary aren’t filled out. The Ebook should at least contain as much data as the physical book, including the back cover. The tags are also almost always crappy, but those can be a little more personal, so it doesn’t bug me AS MUCH. Although sometimes it can be a bit ludicrous how they include everything under the sun rather than allowing the user to construct a search within their program. Calibre treats tags as both just tags (ie a way of filtering data – give me all books tagged dragon) and as genres. For the most part the latter distinction doesn’t matter, although it’ll come into play when I mention the Catalog Ebook.

Calibre also allows for the addition of columns. From what I understand (although I could be wrong or wrong in a subtle way), when it comes to the open EPUB format, all the metadata changes are written to the EPUB file. Columns are not – those are for Calibre. So it made sense for me to stop using a “read” tag and start using a “read” column to keep track of which books I’d read. That way if I share the book with my wife and she hasn’t read it, that doesn’t throw her system off. Also, with the ballooning of tags that came along with my ballooning library (from 20ish to 300ish in about a year or so), it was more cumbersome to search for unread books using tags than having a column. I also added a date read column which might be useful in the future, particularly if Goodreads goes away or becomes onerous in a way that makes me abandon it. Finally, I added  a metadata fixed column to keep track of which books have been fixed up.

So here’s how I’ve started to manage my metadata. As I finish up books, I fix up the metadata while it’s all fresh in my head what exactly this book is about. Sometimes this involves deleting or changing tags. In the rarer cases where key stuff is missing like publication dates, I download that from within Calibre’s metadata download. I prefer the version that’s made for doing it in bulk because it allows me to pick and choose what it changes. Way too often the data is wildly inconsistent depending on the source. Then I flip the Metadata fixed tag. That way, when I’m done with the books I’m doing in real time, I can go back and fix the ones I read a long time ago.

Calibre has a pretty decent reader built in. I’d probably only ever use it on my laptop as it would be pretty rare for me to read at my desktop. However, I think there’s an interesting and subtle difference between a maximized Calibre reader and a full screen Calibre reader. First here’s how it looks on the default size it opens to on my computer:

Calibre Ebook Reader Default size on my computer
Calibre Ebook Reader Default size on my computer

Now here it is maximized (I hit the up arrow on the right):

Calibre Ebook Reader Maximized
Calibre Ebook Reader Maximized

And here it is full screen:

Calibre Ebook Reader full screen mode
Calibre Ebook Reader full screen mode

This has to do with the fact that a “screen page” has nothing to do with an Ebook page. Each of these will scroll to a different spot, but the top left page number will be a consistent way of referring to the page I’m on. This is why you hear that one of the advantages of Ebooks is that you can arbitrarily resize the text size and the book will reformat itself. I am not sure of the psychology around it, but even though there’s a smaller information density, I find the full screen formatting easier on my eyes than the maximized version.

The power of the searches is pretty self-explanatory, I think. If I come up with some creative search, I’ll make a blog post about it. What’s nice about the way Calibre does things is that you can save a search as a virtual library. So if you DO have a complex search, you don’t have to remember it every time.

Of course, Calibre also has some awesome plugins. Epubsplit allowed me to take a Scalzi bundle I bought from Subterrean press in which they’d put all the books into one EPUB file and separate them out into different books – to make it easier for me to jump around and skip the Lock Out and Old Man’s War stuff so I don’t get spoiled. There’s also a plugin that’ll take content that updates daily and make an EPUB – like a custom newspaper to read on the train every day. Since I don’t ride a train to work, I haven’t used it. But if they have a template for the short fiction published for free on Tor.com, I may have to make use of that in the future.

One of the plugins I find a lot of fun is the one that generates a catalog of all my books. Here’s what it looks like:

Calibre Ebook Reader - My Catalog
Calibre Ebook Reader – My Catalog

Although the advent of smart phones and apps makes it possible to have books from Barnes and Noble, Google Books, and Kindle all on the same device, I’m not as much of a fan of reading on the smart phone. The biggest reason (and the one least likely to be solved in the short term) is the low battery life vs how long it takes to read books. Reading on a phone on a long flight is a great way not to have a working phone when you land. Although newer fonts help, I’m also not really into reading backlit text for a long time. That said, it is nice that I always have my phone with me and I’ve used it when I’ve wanted to start reading a new book and didn’t have my Nook with me. Benefit goes to the Kindle app over my Nook in that it’s very easy to highlight words for looking up when you’re using a touch-screen. It’s painfully slow on my Nook.

So things have picked up quite a bit since 2010 and I see myself enjoying Ebooks for a long time.

,