I'll do the text conversion program for you. I've had a sourceforge account for a long time, but I haven't actually used it yet; what should I do to start working on this?
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Wow. I was expecting E-mail.
I forgot to check the Forums.
Go to the File download area, and
look at the text for the first book
of the Oz series.
What we want is to get the data so it is one
sentence per line. As you can tell, I have started
doing some of this by hand.
I plan to upload the rest of the first 14 Oz books
to the files area as well.
Ultimately, the idea is that we want to have a web page connecting the text of the books to a separate page for each sentence, stating the facts that we can derive about OZ from that sentence.
I don't know what to do about quoted sentences.
Usually they involve sentence fragments, but they may have several sentences. What I have done
currently is put them all on one line.
Perhaps the linked 'content' page will have the
sentences broken down and then have those sentences point to new pages that have the meaning of them. That makes sense from a linguistic point of view, because people say things that aren't necessary currently true, but which they think are true.
Anyway, I hope you haven't gotten a new project
by now, I definitely appreciate your help...
David
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
I'll do the text conversion program for you. I've had a sourceforge account for a long time, but I haven't actually used it yet; what should I do to start working on this?
Wow. I was expecting E-mail.
I forgot to check the Forums.
Go to the File download area, and
look at the text for the first book
of the Oz series.
What we want is to get the data so it is one
sentence per line. As you can tell, I have started
doing some of this by hand.
I plan to upload the rest of the first 14 Oz books
to the files area as well.
Ultimately, the idea is that we want to have a web page connecting the text of the books to a separate page for each sentence, stating the facts that we can derive about OZ from that sentence.
I don't know what to do about quoted sentences.
Usually they involve sentence fragments, but they may have several sentences. What I have done
currently is put them all on one line.
Perhaps the linked 'content' page will have the
sentences broken down and then have those sentences point to new pages that have the meaning of them. That makes sense from a linguistic point of view, because people say things that aren't necessary currently true, but which they think are true.
Anyway, I hope you haven't gotten a new project
by now, I definitely appreciate your help...
David