
Hi Paulo, Am 30.07.2009 um 06:40 schrieb Paulo Levi:
But i am reading the rdf into a (file) database. That is more or less what Lucene is. What i am filtering is just what insert into the database, so that its creation is faster / searches only on the fields that interest me.
Sure its a lot of code, that will break if the format changes, but it If you program modularely that would be no problem
reduced the creation step from 5 minutes or so to 40 seconds (this on If you are filtering and just get a factor of 13 I said it your system that is slow. If I remember correctly you are just requestÃng the certain information so somebody else is doing the work!
a fast dual-core computer - i shudder to think what would happen if a user tried to re-index in a 1000 hertz machine). Let's see. My Mac SE was a 1 Mega Hertz machine. That was twenty years ago. It would handle something like this in about ten minutes. I do not know what dbase system I was using.
The index is at about 33.5 mb, and should compress into < 10mb. Probably enough to be included into the application.
Hardcoding data of that size into the program is not feasible. Though most newer computers can load it into memory quite quickly. gives you a factor of 100 if everything is in memory that is why perl is so fast. regard Keith