User Tools

Site Tools


scanning

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revisionBoth sides next revision
scanning [2015/12/02 07:21] sbwscanning [2015/12/02 07:35] sbw
Line 37: Line 37:
 Result of OCR Result of OCR
  
-the lap of luxury. +  the lap of luxury. 
-IYOu know", said Frank, "a tourist's life won't be bad." "No", said Snow, "just on the tarn." +  IYOu know", said Frank, "a tourist's life won't be bad." "No", said Snow, "just on the tarn." 
-But we were all looking forward to it!+  But we were all looking forward to it!
  
 2. Black and white – text only setting 2. Black and white – text only setting
Line 45: Line 45:
 Result of OCR Result of OCR
  
-the lap of luxury. +  the lap of luxury. 
-"You know", said Frank, "a tourist's life won't be bad." "No", said Snow, "just on the turn." +  "You know", said Frank, "a tourist's life won't be bad." "No", said Snow, "just on the turn." 
-But we were all looking forward to it!+  But we were all looking forward to it!
  
  
  
-The margins are not even, so take this into account when you are scanning, and shift the pages one way or the other, otherwise you may lose text off the sides. This is primarily an issue for older magazines which were printed on quarto paper, which is wider than A4. Newer ones are A4 and the text should fit on an A4 page regardless.+{{:processed_0688.jpg?nolink&200|}}{{:processed_0689.jpg?nolink&200|}} The margins are not even, so take this into account when you are scanning, and shift the pages one way or the other, otherwise you may lose text off the sides. This is primarily an issue for older magazines which were printed on quarto paper, which is wider than A4. Newer ones are A4 and the text should fit on an A4 page regardless.
  
 I suggest scanning the various cover sections (front cover, inside front cover, back cover, inside back cover) separately. This makes it cheaper and easier to OCR. For the cover (and any advertisement pages), I use a low density setting, as otherwise it comes out very dark. Again, some trial and error may be needed to get good results. I suggest scanning the various cover sections (front cover, inside front cover, back cover, inside back cover) separately. This makes it cheaper and easier to OCR. For the cover (and any advertisement pages), I use a low density setting, as otherwise it comes out very dark. Again, some trial and error may be needed to get good results.
Line 58: Line 58:
  
 Save the magazine with the following names: Save the magazine with the following names:
-main part - <yyyymm>_<issuenumber>.pdf eg 195401_230.pdf for January 1954, issue number 230. +  * main part - <yyyymm>_<issuenumber>.pdf eg 195401_230.pdf for January 1954, issue number 230. 
-front cover - eg 195401_230_cover.pdf+  front cover - eg 195401_230_cover.pdf
  
 In a lot of cases, the same inside covers and back cover are used over a year. You only need to scan it once. Name them: In a lot of cases, the same inside covers and back cover are used over a year. You only need to scan it once. Name them:
-1954_inside_cover.pdf +  * 1954_inside_cover.pdf 
-1954_back_cover.pdf +  1954_back_cover.pdf 
-1954_inside_back_cover.pdf+  1954_inside_back_cover.pdf
  
-More recent magazines may not have issue numbers, in which case you can leave it out +More recent magazines may not have issue numbers, in which case you can leave it out eg 198701.pdf
-eg 198701.pdf+
  
 I can compile the various sections back into a magazine for the website. If you make a mistake (eg a page scans badly, page out of order) make a note of it and just keep scanning. For badly scanned pages, either rescan the page straight away, or scan it separately afterwards. I have software which can manipulate/insert/remove pages after the fact. This is usually easier than starting again. I can compile the various sections back into a magazine for the website. If you make a mistake (eg a page scans badly, page out of order) make a note of it and just keep scanning. For badly scanned pages, either rescan the page straight away, or scan it separately afterwards. I have software which can manipulate/insert/remove pages after the fact. This is usually easier than starting again.
  
-Zip sets of PDF files together (max of about 10MB per zip file) and email them to me at tomb@ozultimate.com. Include a note explaining any errors that need to be corrected or pages rearranged.+Zip sets of PDF files together (max of about 10MB per zip file) and email them. Include a note explaining any errors that need to be corrected or pages rearranged.
  
 ===== 5. Finishing up ===== ===== 5. Finishing up =====
  
 Insert the magazines back inside the springback binder. Don't restaple them – we'll look for a better method of archiving that doesn't mark the magazines. Insert the magazines back inside the springback binder. Don't restaple them – we'll look for a better method of archiving that doesn't mark the magazines.
- 
- 
scanning.txt · Last modified: 2023/08/16 13:22 by sbw

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki