Small HTML parser I used from GitHub.

Yeah. I'm needing something that will allow me to modify or completely remove certain attributes. I also strip all <FONT> tags from ebook source docs as well as others that cause an ePub to fail epubcheck.

Note: Those that do fail work just fine but Barnes & Nobel will refuse to put a title in their store if it fails. Apple is about as bad.

// @kdfrawg

Clean syntax. Very straightforward.

// @kdfrawg

This isn't the first time I have heard this about "Go". I suppose I'll take a look.

// @kdfrawg

It's been years since I had a good Doom deathmatch.

//

"Why is my Google maps doing this? Do I have a virus. There's like a little Pac Man thing and the word ready"

Yes, I got that call this morning.

i thought they went out of business years ago :/

Assuming you mean F.W. Woolworth.

@kdfrawg It's a couple of lint/sanity tools for XML and HTML sources that have to make in-place changes to the documents.

You'd think that'd be simple but it isn't.

Part of what I use now is an ugly mess of regex and sed but this, naturally, restricts me to making those changes on a Linux box or a Mac with macports installed. I spend most of my time on a Windows box these days.

Note: It's possible under windows too but GnuWin makes my skin crawl. Has ever since an early release played good citizen and wiped my temp folder on exit….Problem was, the default setting for /tmp was C:\…I kid you not. I was not amused.

Also…sed?!

What was I thinking?!

@kdfrawg We have one particular copy-editor. Every document she ever touches becomes a mass of SPAN tags when exported to HTML (for ebook generation). We have another, also using Word, who tends to markup things more. Her's are relatively clean.

It's a mystery.

@kdfrawg Working with DOM. The Java code is just SO much cleaner.