
Thanks for the helpful advice. I'm trying to apply it to Mac OS 8.6 and FineReader 5. Microsoft Word 4.0 seems to be a possibility. More later. Dick Adicks
From: "Jeffrey Kraus-yao" <krausyaoj@ameritech.net> Reply-To: Project Gutenberg Volunteer Discussion <gutvol-d@lists.pglaf.org> Date: Mon, 23 May 2005 20:22:15 -0500 To: "'Project Gutenberg Volunteer Discussion'" <gutvol-d@lists.pglaf.org> Subject: RE: [gutvol-d] PG for Mac users
I have produced a number of ebooks using a Macintosh. The key step is converting the MacRoman text to Unicode UTF-8 text. One option for this step is Cyclone available at http://free.abracode.com/cyclone/ I also use a program that I wrote using Symantec C++ as it is compatible with Mac OS 7 while Cyclone requires Mac OS 8.6.
The tools and process I use for creating ebooks are listed below.
Hardware: Macintosh Quadra 840AV, Color OneScanner 1200/30, Compaq Deskpro 667, and an RCA REB1200 ebook reader
Software: Mac OS 7.6, Xerox TextBridge Pro 8, Microsoft Word 5.1, custom programs written using Symantec C++ 8, Windows 2000, Gemstar eBook Publisher, and REB Librarian
Steps: 1. Scan book using TextBridge. This creates text files encoded using MacRoman and does not create image files. 2. First round of spell check and proofing using Word. 3. Dehyphenate using custom program. This program creates a dictionary of all words in the file and uses an external dictionary to decide which words to remove their hyphens. This can also be done manually using Word. 4. Convert MacRoman text file to UTF-8 text file. Also convert Macintosh line breaks, CR, to Windows line breaks, CR-LF. 5. Add minimal HTML markup which includes paragraph tags and HTML entities for non-ASCII characters. This is a separate file than the UTF-8 file and uses ASCII encoding. 6. Copy text files to computer running Windows 2000. 7. Add additional HTML markup for character styles, headers, and other formatting using printed book as source. 8. Convert HTML file to IMP format using Gemstar eBook Publisher. 9. Copy to REB1200 for proofing using REB Librarian. Make corrections to both text and HTML files. 10. Convert UTF-8 text file to ASCII encoding.
The final results are three files: ASCII text file, UTF-8 text file, and an HTML file.
Jeffrey Kraus-Yao
-----Original Message----- From: gutvol-d-bounces@lists.pglaf.org [mailto:gutvol-d-bounces@lists.pglaf.org] On Behalf Of Dick Adicks Sent: 20 May, 2005 20:15 To: Gutvol-d Subject: [gutvol-d] PG for Mac users
Is there a sub-group to help Mac users to create e-texts that will communicate successfully with PCs? Are there individual volunteers who use Macs and can advise others?
Dick Adicks
_______________________________________________ gutvol-d mailing list gutvol-d@lists.pglaf.org http://lists.pglaf.org/listinfo.cgi/gutvol-d
_______________________________________________ gutvol-d mailing list gutvol-d@lists.pglaf.org http://lists.pglaf.org/listinfo.cgi/gutvol-d