Send CathInfo's owner Matthew a gift from his Amazon wish list:
https://www.amazon.com/hz/wishlist/ls/25M2B8RERL1UO

Author Topic: Copying text from archive.org books?  (Read 869 times)

0 Members and 1 Guest are viewing this topic.

Offline Cryptinox

  • Full Member
  • ***
  • Posts: 1149
  • Reputation: +248/-91
  • Gender: Male
Copying text from archive.org books?
« on: February 21, 2022, 11:15:57 PM »
  • Thanks!0
  • No Thanks!0
  • I am curious if there is a way I can just get all the text of an Archive.org book on one and just copy it so I don't have to flip through every single page of the book. I want to to do this because I want to copy How Christ Said the First Mass so that I can make an audio version to listen to using a tolerable text to speech program.


    Offline Mark 79

    • Supporter
    • *****
    • Posts: 9526
    • Reputation: +6247/-940
    • Gender: Male
    Re: Copying text from archive.org books?
    « Reply #1 on: February 21, 2022, 11:19:36 PM »
  • Thanks!1
  • No Thanks!0
  • https://archive.org/details/howchristsaidfir00meag

    Scroll down, right sidebar "Download options," pick your choice.


    Offline Cryptinox

    • Full Member
    • ***
    • Posts: 1149
    • Reputation: +248/-91
    • Gender: Male
    Re: Copying text from archive.org books?
    « Reply #2 on: February 22, 2022, 12:08:19 AM »
  • Thanks!0
  • No Thanks!0
  • https://archive.org/details/howchristsaidfir00meag

    Scroll down, right sidebar "Download options," pick your choice.
    Thanks. Exactly what I was looking for.

    Offline B from A

    • Full Member
    • ***
    • Posts: 1106
    • Reputation: +687/-128
    • Gender: Female
    Re: Copying text from archive.org books?
    « Reply #3 on: February 22, 2022, 07:39:10 AM »
  • Thanks!0
  • No Thanks!0
  • so that I can make an audio version to listen to using a tolerable text to speech program.

    It looks like a photographed book.  Is there an easy way to change that to text, or straight to speech from photo?  Or am I wrong about it being a photo? 


    Offline Marion

    • Full Member
    • ***
    • Posts: 1867
    • Reputation: +759/-1134
    • Gender: Male
    • sedem ablata
    Re: Copying text from archive.org books?
    « Reply #4 on: February 22, 2022, 07:58:25 AM »
  • Thanks!1
  • No Thanks!0
  • It looks like a photographed book.  Is there an easy way to change that to text, or straight to speech from photo?  Or am I wrong about it being a photo?

    The PDF contains both page images and text.
    That meaning of the sacred dogmas is ever to be maintained which has once been declared by holy mother church. (Dei Filius)


    Offline Mithrandylan

    • Hero Member
    • *****
    • Posts: 4452
    • Reputation: +5061/-436
    • Gender: Male
    Re: Copying text from archive.org books?
    « Reply #5 on: February 22, 2022, 10:54:01 AM »
  • Thanks!0
  • No Thanks!0
  • It looks like a photographed book.  Is there an easy way to change that to text, or straight to speech from photo?  Or am I wrong about it being a photo?
    .
    You will need to invest in OCR software. Then you can process the image text files into searchable text files. Adobe is one option. If you prefer non SaaS options, ABBY Reader is very good. I think they still provide a perpetual license. 
    "Be kind; do not seek the malicious satisfaction of having discovered an additional enemy to the Church... And, above all, be scrupulously truthful. To all, friends and foes alike, give that serious attention which does not misrepresent any opinion, does not distort any statement, does not mutilate any quotation. We need not fear to serve the cause of Christ less efficiently by putting on His spirit". (Vermeersch, 1913).

    Offline Cryptinox

    • Full Member
    • ***
    • Posts: 1149
    • Reputation: +248/-91
    • Gender: Male
    Re: Copying text from archive.org books?
    « Reply #6 on: February 22, 2022, 11:06:50 AM »
  • Thanks!0
  • No Thanks!0
  • .
    You will need to invest in OCR software. Then you can process the image text files into searchable text files. Adobe is one option. If you prefer non SaaS options, ABBY Reader is very good. I think they still provide a perpetual license.
    Archive.org has the text file available for download