II. PROJECT SCOPE (Click here to Save and Edit this Section)
The focus of the Project is to create and market the D4KQS products/services. To achieve that, we have to complete the following scope of works:
1. Create an Extended Coded Character Set representing over 30,000 Chinese characters to be accepted as a defacto standard for digital publishing of Chinese classics. This Extended Coded Character Set will utilize the areas of (1) CJK Unified Ideographs (20,902 characters), (2) CJK Unified Ideographs Extension A (6,585) and (3) Private Use Area (6,500) in the Basic Multilingual Plane in ISO/IEC 10646-1. In addition, associated input methods and Gaiji True-Type Font of the characters will be created as part of the Extended Coded Character Set for the use with computers.
2. Create Digitization Programs to automatically scan the text pages as image files, clean and separate the image files (pre-OCR program), optically recognize and encode the characters (OCR engine), proof-read the recognized characters and correct the mistakes (proof-reading and correction program).
3. Create the D4KQS products/services (see Product Specifications in Section IV and Product Creation in Section V)
4. Market the D4KQS products/services (see Marketing in Section VI)
5. Maintain, improve and upgrade the D4KQS products/services
6. Create and maintain the virtual community of users for the D4KQS products/services