Visualizations for 50th PG birthday
Hi, socialmedia team. Laura & Joshua have collaborated towards some wonderful illustrations. These will be worthwhile to highlight in our social media feed. See links in the attached email. Three other pending items, for July 4: - A blog post at Distributed Proofreaders - A new page in the About section of www.gutenberg.org (with a link from the main page) - Our monthly newsletter, which I Hope to send on July 4 Thanks for helping to celebrate the 50th year of Project Gutenberg! Greg
I'll put the charts up on our Facebook page on July 4, but the link to the Over Time chart resolves to a page doesn't exist error. Paul For personally curated ebook/elibrary/epublishing news by the former Editor-in-Chief of TeleRead follow me on Twitter @paulkbiba; Editor, Project Gutenberg's Facebook page: https://www.facebook.com/project.gutenberg On Thu, Jul 1, 2021 at 11:27 AM Greg Newby via socialmedia < socialmedia@lists.pglaf.org> wrote:
Hi, socialmedia team. Laura & Joshua have collaborated towards some wonderful illustrations. These will be worthwhile to highlight in our social media feed. See links in the attached email.
Three other pending items, for July 4: - A blog post at Distributed Proofreaders - A new page in the About section of www.gutenberg.org (with a link from the main page) - Our monthly newsletter, which I Hope to send on July 4
Thanks for helping to celebrate the 50th year of Project Gutenberg! Greg
---------- Forwarded message ---------- From: Joshua Preston <photopartner@gmail.com> To: Laura N <lnatal55@gmail.com> Cc: Greg Newby <gbnewby@pglaf.org> Bcc: Date: Wed, 30 Jun 2021 21:59:36 -0400 Subject: Re: 50th PG birthday Thx Laura - I updated the catalog charts to let users go straight to a book!
At the link are the final two charts ready for the public.
*Project Gutenberg at 50*
*Project Gutenberg ebooks Issued Over Time*
p.s. My wife surprised me and whisked me away on holiday, which is why I've been slow to respond. I hope these two charts can help the 50th bday festivities and give PG exposure on social media.
I'd still like to tackle the map project when the data is available. We can pick back up anytime you'd like after this week.
cheers. Josh
On Sat, Jun 26, 2021 at 12:51 PM Laura N <lnatal55@gmail.com> wrote:
Great work, liked the data with LoCC!!
All URLs have the following format:
https://www.gutenberg.org/ebooks/#ebook
#ebook is available in our catalogue, does it suit you?
Laura
Em sáb., 26 de jun. de 2021 às 17:44, Joshua Preston < photopartner@gmail.com> escreveu:
The category graphic could drive ebook downloads if the url for each title was available. I'd snatch up some H.G. Wells right now!
[image: image.png]
On Sat, Jun 26, 2021 at 12:30 PM Joshua Preston <photopartner@gmail.com> wrote:
The LoCC data is perfect for many types of analysis. I took the Wikipedia version of the LoCC since it was "clean" and fairly complete.
Check out the two timeline-based charts at the link (toggle with the tabs at the top. I will keep all the data graphics in this one spot until we publish and then separate them).
https://public.tableau.com/app/profile/joshpreston/viz/PGtest1_shelvesheatma...
Keep the data coming!
On Sat, Jun 26, 2021 at 11:45 AM Greg Newby <gbnewby@pglaf.org> wrote:
Thanks again for all this great work!
For your questions #2 below: No, we don't retain statistics on downloads (it's actually part of our privacy policy). Definitely nothing that ties country of download to individual book titles.
It would be possible to harvest past top-100 pages from the wayback machine, though.
Best, Greg
On Sat, Jun 26, 2021 at 04:30:22PM +0100, Laura N wrote:
Excellent Josh! Let's keep this last one as a general overview.
1. I wonder if you could prepare another tableau with the LoCC
column. > > The subject areas are the following: > > https://www.loc.gov/catdir/cpso/lcco/ > > And the complete classification may be found here: > > https://www.loc.gov/aba/publications/FreeLCC/freelcc.html > > 2. Another approach could be the top authors in the last 30 days; > > 3. Top 100 ebooks in the last 30 days. > > Perhaps, in the future, we could work with items 2&3 over one year period. > I'll start to save these files once a month and see what can be done. > > Since our deadline is too short, let's work with the available data. > > Thanks a lot!! > > Laura > > Em sáb., 26 de jun. de 2021 às 02:23, Joshua Preston < photopartner@gmail.com> > escreveu: > > > Thanks! good night. An update: > > https://public.tableau.com/app/profile/joshpreston/viz/PGtest1_shelvesheatmap/Dashboard2 > > > > > > [image: image.png] > > > > On Fri, Jun 25, 2021 at 9:05 PM Laura N <lnatal55@gmail.com> wrote: > > > >> That’s why I’ll try to clean the data from the authors’ column in our > >> catalogue. As soon as I get some results I'll let you know ( I work in > >> European time zone). > >> > >> On Sat, Jun 26, 2021 at 01:51 Joshua Preston < photopartner@gmail.com> > >> wrote: > >> > >>> Got it! I'll be drilling down into the book classifications. Really god > >>> stuff. > >>> > >>> I meant to ask, I wanted to get more details on this idea you > >>> mentioned: Josh I wonder if you could work with authors on a new timeline. > >>> > >>> On Fri, Jun 25, 2021 at 8:30 PM Laura N <lnatal55@gmail.com> wrote: > >>> > >>>> In addition I am converting David’s list into a csv file in order to > >>>> try to make a cross reference with our catalogue’s data. The way the > >>>> authors are listed there doesn’t allow to get some statistics since all > >>>> collaborators/translators/illustrators are in the same category as the main > >>>> author. Let’s see what we can do since PG’s server only keeps data from the > >>>> last 30 days. > >>>> > >>>> On Sat, Jun 26, 2021 at 01:03 Laura N <lnatal55@gmail.com> wrote: > >>>> > >>>>> In the above link just click in Classification in order to see all > >>>>> subject areas and the listing of all subclasses. Hope it helps in someway… > >>>>> > >>>>> On Sat, Jun 26, 2021 at 00:56 Joshua Preston < photopartner@gmail.com> > >>>>> wrote: > >>>>> > >>>>>> Thx Laura! I meant to ask, can you expand on this? > >>>>>> > >>>>>> Josh I wonder if you could work with authors on a new timeline. > >>>>>> > >>>>>> On Fri, Jun 25, 2021 at 7:44 PM Laura N <lnatal55@gmail.com> wrote: > >>>>>> > >>>>>>> Your answer for item 1 may be found below: > >>>>>>> > >>>>>>> https://en.m.wikipedia.org/wiki/Library_of_Congress_Classification > >>>>>>> > >>>>>>> > >>>>>>> On Sat, Jun 26, 2021 at 00:02 Joshua Preston < photopartner@gmail.com> > >>>>>>> wrote: > >>>>>>> > >>>>>>>> I'm starting to see the contours of the datasets and understand > >>>>>>>> them better. I wanted to be conservative with data requests until I got > >>>>>>>> more of an understanding of what is available. > >>>>>>>> Below is what I hope is available so I can focus on the > >>>>>>>> timeline-based and map-based data graphics. I can focus my efforts there so > >>>>>>>> we can have something 'wow' for social media before next weekend. > >>>>>>>> > >>>>>>>> 1- is there a key for the subjects that spells out what they are? > >>>>>>>> These are the subject codes, correct? > >>>>>>>> > >>>>>>>> e.g. > >>>>>>>> > >>>>>>>> LoCC > >>>>>>>> AC > >>>>>>>> AC; B > >>>>>>>> AC; BL > >>>>>>>> AC; DA > >>>>>>>> AC; E300 > >>>>>>>> AC; F1001 > >>>>>>>> AC; KF > >>>>>>>> AC; PA > >>>>>>>> AC; PL > >>>>>>>> AC; PN > >>>>>>>> AC; PR > >>>>>>>> AC; PS > >>>>>>>> AC; Q; QH > >>>>>>>> AC; QA > >>>>>>>> AC; TJ > >>>>>>>> AC; TX > >>>>>>>> 2a- is there a data file of lifetime downloads of each book/story > >>>>>>>> title? > >>>>>>>> 2b- by month/year or year? > >>>>>>>> 2c- by country? > >>>>>>>> > >>>>>>>> 3- Would it be possible to get the top 100 downloads by month from > >>>>>>>> the last 12 months for each country and for each state in the U.S.? > >>>>>>>> > >>>>>>>> > >>>>>>>> thanks in advance. > >>>>>>>> > >>>>>>>> Josh > >>>>>>>> > >>>>>>>> > >>>>>>>> On Fri, Jun 25, 2021 at 6:25 PM Greg Newby < gbnewby@pglaf.org> > >>>>>>>> wrote: > >>>>>>>> > >>>>>>>>> These are all great - thanks! > >>>>>>>>> > >>>>>>>>> The bookshelves have mostly not been updated in years, > >>>>>>>>> unfortunately... we do have subject codes in the CSV I sent, which are all > >>>>>>>>> up-to-date. The subject codes might be another way to categorize. > >>>>>>>>> - Greg > >>>>>>>>> > >>>>>>>>> On Fri, Jun 25, 2021 at 06:37:30PM +0100, Laura N wrote: > >>>>>>>>> > Awesome, thanks! In the meantime, I've been working with the > >>>>>>>>> pg_catalogue > >>>>>>>>> > and I made this plot: > >>>>>>>>> > > >>>>>>>>> > [image: image.png] > >>>>>>>>> > > >>>>>>>>> > I also found a very nice plot by continent which I intended to > >>>>>>>>> make since > >>>>>>>>> > the beginning, please take a look. It's very straightforward and > >>>>>>>>> I can use > >>>>>>>>> > the data I collected from the top downloads in the last 30 days > >>>>>>>>> and make a > >>>>>>>>> > similar pic. > >>>>>>>>> > > >>>>>>>>> > https://geediting.com/most-iconic-book-set-in-every-country/ > >>>>>>>>> > > >>>>>>>>> > Any additional ideas? Josh I wonder if you could work with > >>>>>>>>> authors on a new > >>>>>>>>> > timeline. Do you think it is possible? > >>>>>>>>> > > >>>>>>>>> > Regards > >>>>>>>>> > > >>>>>>>>> > Laura > >>>>>>>>> > > >>>>>>>>> > Em sex., 25 de jun. de 2021 às 03:49, Joshua Preston < > >>>>>>>>> photopartner@gmail.com> > >>>>>>>>> > escreveu: > >>>>>>>>> > > >>>>>>>>> > > 50 years! congrats. I missed that... > >>>>>>>>> > > > >>>>>>>>> > > Below is a link to a time-based chart to start off with. > >>>>>>>>> > > > >>>>>>>>> > > (If the catalog has lifetime downloads, that would make for a > >>>>>>>>> great > >>>>>>>>> > > visual.) > >>>>>>>>> > > > >>>>>>>>> > > > >>>>>>>>> > > > >>>>>>>>> https://public.tableau.com/views/PGtest1_shelvesheatmap/Dashboard2?:language=en-US&:display_count=n&:origin=viz_share_link:showVizHome=no > >>>>>>>>> > > > >>>>>>>>> > > On Thu, Jun 24, 2021 at 7:46 PM Greg Newby < gbnewby@pglaf.org> > >>>>>>>>> wrote: > >>>>>>>>> > > > >>>>>>>>> > >> Hi, Josh. Responses: > >>>>>>>>> > >> > >>>>>>>>> > >> On Thu, Jun 24, 2021 at 06:31:36PM -0400, Joshua Preston > >>>>>>>>> wrote: > >>>>>>>>> > >> > Hey Greg and Laura (aka birthday planners) > >>>>>>>>> > >> > > >>>>>>>>> > >> > Ya'll are really delivering on the data – geographic > >>>>>>>>> fields, date > >>>>>>>>> > >> fields, > >>>>>>>>> > >> > download counts, etc. Thanks for the good clean datasets. > >>>>>>>>> It puts us in > >>>>>>>>> > >> a > >>>>>>>>> > >> > good position. > >>>>>>>>> > >> > > >>>>>>>>> > >> > Laura - I'll continue with your inspiration for a map-based > >>>>>>>>> option. > >>>>>>>>> > >> > > >>>>>>>>> > >> > Greg - Thanks for the sneak peek and access to what I am > >>>>>>>>> assuming is the > >>>>>>>>> > >> > entire ebook library? Good stuff. > >>>>>>>>> > >> > > >>>>>>>>> > >> > Question: what is the 'Issued' date field? I'm trying to > >>>>>>>>> reconcile the > >>>>>>>>> > >> 25th > >>>>>>>>> > >> > bday and how there are some dates that are older than 25 > >>>>>>>>> years. > >>>>>>>>> > >> > >>>>>>>>> > >> This is the 50th Birthday. The first eBook was July 4, 1971. > >>>>>>>>> > >> > >>>>>>>>> > >> The "Issued" date is the release date: the date the eBook > >>>>>>>>> first appeared > >>>>>>>>> > >> in the PG catalog. Clearly this could be used as the basis of > >>>>>>>>> a timeline > >>>>>>>>> > >> graphic. > >>>>>>>>> > >> > >>>>>>>>> > >> > > >>>>>>>>> > >> > For the group effort - both datasets include the 'primary > >>>>>>>>> key' or > >>>>>>>>> > >> unique ID > >>>>>>>>> > >> > for each book. PLEASE keep including that as you send any > >>>>>>>>> more data. > >>>>>>>>> > >> That's > >>>>>>>>> > >> > key (pun) to stitching all the data sources together. > >>>>>>>>> > >> > >>>>>>>>> > >> Yes. That's the eBook number, a.k.a. accession number. It is > >>>>>>>>> our unique > >>>>>>>>> > >> key, always. > >>>>>>>>> > >> > >>>>>>>>> > >> > I'll start sending bare-bones prototypes your way. As we > >>>>>>>>> get closer to > >>>>>>>>> > >> > final charts I'll start to consider the design aesthetic. > >>>>>>>>> The first > >>>>>>>>> > >> thing > >>>>>>>>> > >> > to do is design on the basic presentation of the data. > >>>>>>>>> Feedback is > >>>>>>>>> > >> > appreciated. > >>>>>>>>> > >> > >>>>>>>>> > >> You might also like this very thorough timeline: > >>>>>>>>> > >> https://archive.org/details/history-ebooks > >>>>>>>>> > >> > >>>>>>>>> > >> It was created by Marie Lebert, who also wrote some of the > >>>>>>>>> articles in > >>>>>>>>> > >> the About section of www.gutenberg.org > >>>>>>>>> > >> > >>>>>>>>> > >> Excellent! Thanks again. > >>>>>>>>> > >> Greg > >>>>>>>>> > >> > >>>>>>>>> > >> > >>>>>>>>> > >> > If at some point, a proper chat is in order, my cell is > >>>>>>>>> below. > >>>>>>>>> > >> > > >>>>>>>>> > >> > cheers. > >>>>>>>>> > >> > > >>>>>>>>> > >> > Josh > >>>>>>>>> > >> > 678.231.0787 > >>>>>>>>> > >> > > >>>>>>>>> > >> > On Thu, Jun 24, 2021 at 3:31 PM Laura N < lnatal55@gmail.com> > >>>>>>>>> wrote: > >>>>>>>>> > >> > > >>>>>>>>> > >> > > This is great Greg! I’ll take a look and see what else I > >>>>>>>>> can do with > >>>>>>>>> > >> these > >>>>>>>>> > >> > > data. > >>>>>>>>> > >> > > > >>>>>>>>> > >> > > Regards and thanks for your help > >>>>>>>>> > >> > > > >>>>>>>>> > >> > > Laura > >>>>>>>>> > >> > > > >>>>>>>>> > >> > > > >>>>>>>>> > >> > > > >>>>>>>>> > >> > > On Thu, Jun 24, 2021 at 20:21 Greg Newby < > >>>>>>>>> gbnewby@pglaf.org> wrote: > >>>>>>>>> > >> > > > >>>>>>>>> > >> > >> > >>>>>>>>> > >> > >> Hi, Josh. In case it's of interest, we have a brand-new > >>>>>>>>> CSV metadata > >>>>>>>>> > >> > >> extract: > >>>>>>>>> > >> > >> > >>>>>>>>> https://www.gutenberg.org/cache/epub/feeds/pg_catalog.csv.zip > >>>>>>>>> > >> > >> > >>>>>>>>> > >> > >> This is not currently advertised anywhere, because there > >>>>>>>>> are still > >>>>>>>>> > >> some > >>>>>>>>> > >> > >> other fields we are trying to add. It's not > >>>>>>>>> automatically updated. > >>>>>>>>> > >> > >> > >>>>>>>>> > >> > >> Thanks again for looking into some nice visualization > >>>>>>>>> products for > >>>>>>>>> > >> > >> Project Gutenberg! > >>>>>>>>> > >> > >> > >>>>>>>>> > >> > >> Best, > >>>>>>>>> > >> > >> Greg > >>>>>>>>> > >> > >> > >>>>>>>>> > >> > >> On Mon, Jun 21, 2021 at 07:36:59PM -0400, Joshua Preston > >>>>>>>>> wrote: > >>>>>>>>> > >> > >> > Super. This sounds promising. Josh > >>>>>>>>> > >> > >> > > >>>>>>>>> > >> > >> > On Mon, Jun 21, 2021 at 4:38 PM Laura N < > >>>>>>>>> lnatal55@gmail.com> > >>>>>>>>> > >> wrote: > >>>>>>>>> > >> > >> > > >>>>>>>>> > >> > >> > > Hi Joshua: > >>>>>>>>> > >> > >> > > I am preparing a worksheet with the most downloaded > >>>>>>>>> book per > >>>>>>>>> > >> country > >>>>>>>>> > >> > >> and I > >>>>>>>>> > >> > >> > > just finished Europe for instance. I’ll convert my > >>>>>>>>> file in a csv > >>>>>>>>> > >> > >> format and > >>>>>>>>> > >> > >> > > send it to you. > >>>>>>>>> > >> > >> > > > >>>>>>>>> > >> > >> > > Glad you enjoyed the idea, looks promising for the > >>>>>>>>> PG’s > >>>>>>>>> > >> birthday. The > >>>>>>>>> > >> > >> idea > >>>>>>>>> > >> > >> > > is to disseminate these infographics to our social > >>>>>>>>> media. > >>>>>>>>> > >> > >> > > > >>>>>>>>> > >> > >> > > Regards > >>>>>>>>> > >> > >> > > > >>>>>>>>> > >> > >> > > Laura > >>>>>>>>> > >> > >> > > > >>>>>>>>> > >> > >> > > On Mon, Jun 21, 2021 at 21:26 Joshua Preston < > >>>>>>>>> > >> photopartner@gmail.com> > >>>>>>>>> > >> > >> > > wrote: > >>>>>>>>> > >> > >> > > > >>>>>>>>> > >> > >> > >> Hey Laura - neat idea. > >>>>>>>>> > >> > >> > >> > >>>>>>>>> > >> > >> > >> If the book data is available in a json or csv > >>>>>>>>> format, it might > >>>>>>>>> > >> be > >>>>>>>>> > >> > >> > >> possible to show the archive in a graphical form. > >>>>>>>>> The map idea > >>>>>>>>> > >> is > >>>>>>>>> > >> > >> only > >>>>>>>>> > >> > >> > >> feasible if there is geographic data (e.g. country > >>>>>>>>> name for each > >>>>>>>>> > >> > >> author). I > >>>>>>>>> > >> > >> > >> like the map infographics, but the Tableau software > >>>>>>>>> can't > >>>>>>>>> > >> replicate > >>>>>>>>> > >> > >> the > >>>>>>>>> > >> > >> > >> imagery those maps use. (Tableau is not a > >>>>>>>>> Geographical > >>>>>>>>> > >> Information > >>>>>>>>> > >> > >> System > >>>>>>>>> > >> > >> > >> with that type of capability). > >>>>>>>>> > >> > >> > >> > >>>>>>>>> > >> > >> > >> I can see a timeline of sorts as well. e.g. > >>>>>>>>> > >> > >> > >> > >>>>>>>>> > >> > >> > >>>>>>>>> > >> > >>>>>>>>> https://public.tableau.com/app/profile/bo.mccready8742/viz/FilmGenrePopularity-1910-2018/GenreRelativePopularity > >>>>>>>>> > >> > >> > >> > >>>>>>>>> > >> > >> > >> The easiest place to start is to figure out on your > >>>>>>>>> end if your > >>>>>>>>> > >> web > >>>>>>>>> > >> > >> > >> developers can export all the book data into the > >>>>>>>>> json or csv > >>>>>>>>> > >> format > >>>>>>>>> > >> > >> for > >>>>>>>>> > >> > >> > >> Tableau to read and then I can give you a better > >>>>>>>>> idea of what I > >>>>>>>>> > >> > >> might be > >>>>>>>>> > >> > >> > >> able to do. > >>>>>>>>> > >> > >> > >> > >>>>>>>>> > >> > >> > >> thanks for thinking of me. > >>>>>>>>> > >> > >> > >> > >>>>>>>>> > >> > >> > >> cheers. Josh > >>>>>>>>> > >> > >> > >> > >>>>>>>>> > >> > >> > > -- > >>>>>>>>> > >> > >> > > Laura Natal Rodrigues > >>>>>>>>> > >> > >> > > > >>>>>>>>> > >> > >> > >>>>>>>>> > >> > >> Dr. Gregory B. Newby > >>>>>>>>> > >> > >> Chief Executive and Director > >>>>>>>>> > >> > >> Project Gutenberg Literary Archive Foundation > >>>>>>>>> www.gutenberg.org > >>>>>>>>> > >> > >> A 501(c)(3) not-for-profit organization with EIN > >>>>>>>>> 64-6221541 > >>>>>>>>> > >> > >> gbnewby@pglaf.org > >>>>>>>>> > >> > >> > >>>>>>>>> > >> > > -- > >>>>>>>>> > >> > > Laura Natal Rodrigues > >>>>>>>>> > >> > > > >>>>>>>>> > >> > >>>>>>>>> > > > >>>>>>>>> > >>>>>>>>> > >>>>>>>>> -- > >>>>>>> Laura Natal Rodrigues > >>>>>>> > >>>>>> -- > >>>>> Laura Natal Rodrigues > >>>>> > >>>> -- > >>>> Laura Natal Rodrigues > >>>> > >>> -- > >> Laura Natal Rodrigues > >> > >
_______________________________________________ socialmedia mailing list socialmedia@lists.pglaf.org https://lists.cpunks.org/mailman/listinfo/socialmedia
Paul, try these ones: *Project Gutenberg at 50* https://public.tableau.com/views/ProjectGutenbergat50/Dashareachartbysubject?:language=en-US&:display_count=n&:origin=viz_share_link&:showVizHome=no *Project Gutenberg ebooks Issued Over Time* https://public.tableau.com/views/ProjectGutenbergebooksIssuedOverTime/Dashheatmap?:language=en-US&:display_count=n&:origin=viz_share_link:showVizHome=no cheers Laura PS: I'll also post them @gutenberg_org on July 4. Em qui., 1 de jul. de 2021 às 18:28, Paul Biba via socialmedia < socialmedia@lists.pglaf.org> escreveu:
I'll put the charts up on our Facebook page on July 4, but the link to the Over Time chart resolves to a page doesn't exist error.
Paul
For personally curated ebook/elibrary/epublishing news by the former Editor-in-Chief of TeleRead follow me on Twitter @paulkbiba; Editor, Project Gutenberg's Facebook page: https://www.facebook.com/project.gutenberg
On Thu, Jul 1, 2021 at 11:27 AM Greg Newby via socialmedia < socialmedia@lists.pglaf.org> wrote:
Hi, socialmedia team. Laura & Joshua have collaborated towards some wonderful illustrations. These will be worthwhile to highlight in our social media feed. See links in the attached email.
Three other pending items, for July 4: - A blog post at Distributed Proofreaders - A new page in the About section of www.gutenberg.org (with a link from the main page) - Our monthly newsletter, which I Hope to send on July 4
Thanks for helping to celebrate the 50th year of Project Gutenberg! Greg
---------- Forwarded message ---------- From: Joshua Preston <photopartner@gmail.com> To: Laura N <lnatal55@gmail.com> Cc: Greg Newby <gbnewby@pglaf.org> Bcc: Date: Wed, 30 Jun 2021 21:59:36 -0400 Subject: Re: 50th PG birthday Thx Laura - I updated the catalog charts to let users go straight to a book!
At the link are the final two charts ready for the public.
*Project Gutenberg at 50*
*Project Gutenberg ebooks Issued Over Time*
p.s. My wife surprised me and whisked me away on holiday, which is why I've been slow to respond. I hope these two charts can help the 50th bday festivities and give PG exposure on social media.
I'd still like to tackle the map project when the data is available. We can pick back up anytime you'd like after this week.
cheers. Josh
On Sat, Jun 26, 2021 at 12:51 PM Laura N <lnatal55@gmail.com> wrote:
Great work, liked the data with LoCC!!
All URLs have the following format:
https://www.gutenberg.org/ebooks/#ebook
#ebook is available in our catalogue, does it suit you?
Laura
Em sáb., 26 de jun. de 2021 às 17:44, Joshua Preston < photopartner@gmail.com> escreveu:
The category graphic could drive ebook downloads if the url for each title was available. I'd snatch up some H.G. Wells right now!
[image: image.png]
On Sat, Jun 26, 2021 at 12:30 PM Joshua Preston <photopartner@gmail.com> wrote:
The LoCC data is perfect for many types of analysis. I took the Wikipedia version of the LoCC since it was "clean" and fairly complete.
Check out the two timeline-based charts at the link (toggle with the tabs at the top. I will keep all the data graphics in this one spot until we publish and then separate them).
https://public.tableau.com/app/profile/joshpreston/viz/PGtest1_shelvesheatma...
Keep the data coming!
On Sat, Jun 26, 2021 at 11:45 AM Greg Newby <gbnewby@pglaf.org> wrote:
Thanks again for all this great work!
For your questions #2 below: No, we don't retain statistics on downloads (it's actually part of our privacy policy). Definitely nothing that ties country of download to individual book titles.
It would be possible to harvest past top-100 pages from the wayback machine, though.
Best, Greg
On Sat, Jun 26, 2021 at 04:30:22PM +0100, Laura N wrote: > Excellent Josh! Let's keep this last one as a general overview. > > 1. I wonder if you could prepare another tableau with the LoCC column. > > The subject areas are the following: > > https://www.loc.gov/catdir/cpso/lcco/ > > And the complete classification may be found here: > > https://www.loc.gov/aba/publications/FreeLCC/freelcc.html > > 2. Another approach could be the top authors in the last 30 days; > > 3. Top 100 ebooks in the last 30 days. > > Perhaps, in the future, we could work with items 2&3 over one year period. > I'll start to save these files once a month and see what can be done. > > Since our deadline is too short, let's work with the available data. > > Thanks a lot!! > > Laura > > Em sáb., 26 de jun. de 2021 às 02:23, Joshua Preston < photopartner@gmail.com> > escreveu: > > > Thanks! good night. An update: > > https://public.tableau.com/app/profile/joshpreston/viz/PGtest1_shelvesheatma... > > > > > > [image: image.png] > > > > On Fri, Jun 25, 2021 at 9:05 PM Laura N <lnatal55@gmail.com> wrote: > > > >> That’s why I’ll try to clean the data from the authors’ column in our > >> catalogue. As soon as I get some results I'll let you know ( I work in > >> European time zone). > >> > >> On Sat, Jun 26, 2021 at 01:51 Joshua Preston < photopartner@gmail.com> > >> wrote: > >> > >>> Got it! I'll be drilling down into the book classifications. Really god > >>> stuff. > >>> > >>> I meant to ask, I wanted to get more details on this idea you > >>> mentioned: Josh I wonder if you could work with authors on a new timeline. > >>> > >>> On Fri, Jun 25, 2021 at 8:30 PM Laura N <lnatal55@gmail.com> wrote: > >>> > >>>> In addition I am converting David’s list into a csv file in order to > >>>> try to make a cross reference with our catalogue’s data. The way the > >>>> authors are listed there doesn’t allow to get some statistics since all > >>>> collaborators/translators/illustrators are in the same category as the main > >>>> author. Let’s see what we can do since PG’s server only keeps data from the > >>>> last 30 days. > >>>> > >>>> On Sat, Jun 26, 2021 at 01:03 Laura N <lnatal55@gmail.com> wrote: > >>>> > >>>>> In the above link just click in Classification in order to see all > >>>>> subject areas and the listing of all subclasses. Hope it helps in someway… > >>>>> > >>>>> On Sat, Jun 26, 2021 at 00:56 Joshua Preston < photopartner@gmail.com> > >>>>> wrote: > >>>>> > >>>>>> Thx Laura! I meant to ask, can you expand on this? > >>>>>> > >>>>>> Josh I wonder if you could work with authors on a new timeline. > >>>>>> > >>>>>> On Fri, Jun 25, 2021 at 7:44 PM Laura N <lnatal55@gmail.com> wrote: > >>>>>> > >>>>>>> Your answer for item 1 may be found below: > >>>>>>> > >>>>>>> https://en.m.wikipedia.org/wiki/Library_of_Congress_Classification > >>>>>>> > >>>>>>> > >>>>>>> On Sat, Jun 26, 2021 at 00:02 Joshua Preston < photopartner@gmail.com> > >>>>>>> wrote: > >>>>>>> > >>>>>>>> I'm starting to see the contours of the datasets and understand > >>>>>>>> them better. I wanted to be conservative with data requests until I got > >>>>>>>> more of an understanding of what is available. > >>>>>>>> Below is what I hope is available so I can focus on the > >>>>>>>> timeline-based and map-based data graphics. I can focus my efforts there so > >>>>>>>> we can have something 'wow' for social media before next weekend. > >>>>>>>> > >>>>>>>> 1- is there a key for the subjects that spells out what they are? > >>>>>>>> These are the subject codes, correct? > >>>>>>>> > >>>>>>>> e.g. > >>>>>>>> > >>>>>>>> LoCC > >>>>>>>> AC > >>>>>>>> AC; B > >>>>>>>> AC; BL > >>>>>>>> AC; DA > >>>>>>>> AC; E300 > >>>>>>>> AC; F1001 > >>>>>>>> AC; KF > >>>>>>>> AC; PA > >>>>>>>> AC; PL > >>>>>>>> AC; PN > >>>>>>>> AC; PR > >>>>>>>> AC; PS > >>>>>>>> AC; Q; QH > >>>>>>>> AC; QA > >>>>>>>> AC; TJ > >>>>>>>> AC; TX > >>>>>>>> 2a- is there a data file of lifetime downloads of each book/story > >>>>>>>> title? > >>>>>>>> 2b- by month/year or year? > >>>>>>>> 2c- by country? > >>>>>>>> > >>>>>>>> 3- Would it be possible to get the top 100 downloads by month from > >>>>>>>> the last 12 months for each country and for each state in the U.S.? > >>>>>>>> > >>>>>>>> > >>>>>>>> thanks in advance. > >>>>>>>> > >>>>>>>> Josh > >>>>>>>> > >>>>>>>> > >>>>>>>> On Fri, Jun 25, 2021 at 6:25 PM Greg Newby < gbnewby@pglaf.org> > >>>>>>>> wrote: > >>>>>>>> > >>>>>>>>> These are all great - thanks! > >>>>>>>>> > >>>>>>>>> The bookshelves have mostly not been updated in years, > >>>>>>>>> unfortunately... we do have subject codes in the CSV I sent, which are all > >>>>>>>>> up-to-date. The subject codes might be another way to categorize. > >>>>>>>>> - Greg > >>>>>>>>> > >>>>>>>>> On Fri, Jun 25, 2021 at 06:37:30PM +0100, Laura N wrote: > >>>>>>>>> > Awesome, thanks! In the meantime, I've been working with the > >>>>>>>>> pg_catalogue > >>>>>>>>> > and I made this plot: > >>>>>>>>> > > >>>>>>>>> > [image: image.png] > >>>>>>>>> > > >>>>>>>>> > I also found a very nice plot by continent which I intended to > >>>>>>>>> make since > >>>>>>>>> > the beginning, please take a look. It's very straightforward and > >>>>>>>>> I can use > >>>>>>>>> > the data I collected from the top downloads in the last 30 days > >>>>>>>>> and make a > >>>>>>>>> > similar pic. > >>>>>>>>> > > >>>>>>>>> > https://geediting.com/most-iconic-book-set-in-every-country/ > >>>>>>>>> > > >>>>>>>>> > Any additional ideas? Josh I wonder if you could work with > >>>>>>>>> authors on a new > >>>>>>>>> > timeline. Do you think it is possible? > >>>>>>>>> > > >>>>>>>>> > Regards > >>>>>>>>> > > >>>>>>>>> > Laura > >>>>>>>>> > > >>>>>>>>> > Em sex., 25 de jun. de 2021 às 03:49, Joshua Preston < > >>>>>>>>> photopartner@gmail.com> > >>>>>>>>> > escreveu: > >>>>>>>>> > > >>>>>>>>> > > 50 years! congrats. I missed that... > >>>>>>>>> > > > >>>>>>>>> > > Below is a link to a time-based chart to start off with. > >>>>>>>>> > > > >>>>>>>>> > > (If the catalog has lifetime downloads, that would make for a > >>>>>>>>> great > >>>>>>>>> > > visual.) > >>>>>>>>> > > > >>>>>>>>> > > > >>>>>>>>> > > > >>>>>>>>> https://public.tableau.com/views/PGtest1_shelvesheatmap/Dashboard2?:language=en-US&:display_count=n&:origin=viz_share_link:showVizHome=no > >>>>>>>>> > > > >>>>>>>>> > > On Thu, Jun 24, 2021 at 7:46 PM Greg Newby < gbnewby@pglaf.org> > >>>>>>>>> wrote: > >>>>>>>>> > > > >>>>>>>>> > >> Hi, Josh. Responses: > >>>>>>>>> > >> > >>>>>>>>> > >> On Thu, Jun 24, 2021 at 06:31:36PM -0400, Joshua Preston > >>>>>>>>> wrote: > >>>>>>>>> > >> > Hey Greg and Laura (aka birthday planners) > >>>>>>>>> > >> > > >>>>>>>>> > >> > Ya'll are really delivering on the data – geographic > >>>>>>>>> fields, date > >>>>>>>>> > >> fields, > >>>>>>>>> > >> > download counts, etc. Thanks for the good clean datasets. > >>>>>>>>> It puts us in > >>>>>>>>> > >> a > >>>>>>>>> > >> > good position. > >>>>>>>>> > >> > > >>>>>>>>> > >> > Laura - I'll continue with your inspiration for a map-based > >>>>>>>>> option. > >>>>>>>>> > >> > > >>>>>>>>> > >> > Greg - Thanks for the sneak peek and access to what I am > >>>>>>>>> assuming is the > >>>>>>>>> > >> > entire ebook library? Good stuff. > >>>>>>>>> > >> > > >>>>>>>>> > >> > Question: what is the 'Issued' date field? I'm trying to > >>>>>>>>> reconcile the > >>>>>>>>> > >> 25th > >>>>>>>>> > >> > bday and how there are some dates that are older than 25 > >>>>>>>>> years. > >>>>>>>>> > >> > >>>>>>>>> > >> This is the 50th Birthday. The first eBook was July 4, 1971. > >>>>>>>>> > >> > >>>>>>>>> > >> The "Issued" date is the release date: the date the eBook > >>>>>>>>> first appeared > >>>>>>>>> > >> in the PG catalog. Clearly this could be used as the basis of > >>>>>>>>> a timeline > >>>>>>>>> > >> graphic. > >>>>>>>>> > >> > >>>>>>>>> > >> > > >>>>>>>>> > >> > For the group effort - both datasets include the 'primary > >>>>>>>>> key' or > >>>>>>>>> > >> unique ID > >>>>>>>>> > >> > for each book. PLEASE keep including that as you send any > >>>>>>>>> more data. > >>>>>>>>> > >> That's > >>>>>>>>> > >> > key (pun) to stitching all the data sources together. > >>>>>>>>> > >> > >>>>>>>>> > >> Yes. That's the eBook number, a.k.a. accession number. It is > >>>>>>>>> our unique > >>>>>>>>> > >> key, always. > >>>>>>>>> > >> > >>>>>>>>> > >> > I'll start sending bare-bones prototypes your way. As we > >>>>>>>>> get closer to > >>>>>>>>> > >> > final charts I'll start to consider the design aesthetic. > >>>>>>>>> The first > >>>>>>>>> > >> thing > >>>>>>>>> > >> > to do is design on the basic presentation of the data. > >>>>>>>>> Feedback is > >>>>>>>>> > >> > appreciated. > >>>>>>>>> > >> > >>>>>>>>> > >> You might also like this very thorough timeline: > >>>>>>>>> > >> https://archive.org/details/history-ebooks > >>>>>>>>> > >> > >>>>>>>>> > >> It was created by Marie Lebert, who also wrote some of the > >>>>>>>>> articles in > >>>>>>>>> > >> the About section of www.gutenberg.org > >>>>>>>>> > >> > >>>>>>>>> > >> Excellent! Thanks again. > >>>>>>>>> > >> Greg > >>>>>>>>> > >> > >>>>>>>>> > >> > >>>>>>>>> > >> > If at some point, a proper chat is in order, my cell is > >>>>>>>>> below. > >>>>>>>>> > >> > > >>>>>>>>> > >> > cheers. > >>>>>>>>> > >> > > >>>>>>>>> > >> > Josh > >>>>>>>>> > >> > 678.231.0787 > >>>>>>>>> > >> > > >>>>>>>>> > >> > On Thu, Jun 24, 2021 at 3:31 PM Laura N < lnatal55@gmail.com> > >>>>>>>>> wrote: > >>>>>>>>> > >> > > >>>>>>>>> > >> > > This is great Greg! I’ll take a look and see what else I > >>>>>>>>> can do with > >>>>>>>>> > >> these > >>>>>>>>> > >> > > data. > >>>>>>>>> > >> > > > >>>>>>>>> > >> > > Regards and thanks for your help > >>>>>>>>> > >> > > > >>>>>>>>> > >> > > Laura > >>>>>>>>> > >> > > > >>>>>>>>> > >> > > > >>>>>>>>> > >> > > > >>>>>>>>> > >> > > On Thu, Jun 24, 2021 at 20:21 Greg Newby < > >>>>>>>>> gbnewby@pglaf.org> wrote: > >>>>>>>>> > >> > > > >>>>>>>>> > >> > >> > >>>>>>>>> > >> > >> Hi, Josh. In case it's of interest, we have a brand-new > >>>>>>>>> CSV metadata > >>>>>>>>> > >> > >> extract: > >>>>>>>>> > >> > >> > >>>>>>>>> https://www.gutenberg.org/cache/epub/feeds/pg_catalog.csv.zip > >>>>>>>>> > >> > >> > >>>>>>>>> > >> > >> This is not currently advertised anywhere, because there > >>>>>>>>> are still > >>>>>>>>> > >> some > >>>>>>>>> > >> > >> other fields we are trying to add. It's not > >>>>>>>>> automatically updated. > >>>>>>>>> > >> > >> > >>>>>>>>> > >> > >> Thanks again for looking into some nice visualization > >>>>>>>>> products for > >>>>>>>>> > >> > >> Project Gutenberg! > >>>>>>>>> > >> > >> > >>>>>>>>> > >> > >> Best, > >>>>>>>>> > >> > >> Greg > >>>>>>>>> > >> > >> > >>>>>>>>> > >> > >> On Mon, Jun 21, 2021 at 07:36:59PM -0400, Joshua Preston > >>>>>>>>> wrote: > >>>>>>>>> > >> > >> > Super. This sounds promising. Josh > >>>>>>>>> > >> > >> > > >>>>>>>>> > >> > >> > On Mon, Jun 21, 2021 at 4:38 PM Laura N < > >>>>>>>>> lnatal55@gmail.com> > >>>>>>>>> > >> wrote: > >>>>>>>>> > >> > >> > > >>>>>>>>> > >> > >> > > Hi Joshua: > >>>>>>>>> > >> > >> > > I am preparing a worksheet with the most downloaded > >>>>>>>>> book per > >>>>>>>>> > >> country > >>>>>>>>> > >> > >> and I > >>>>>>>>> > >> > >> > > just finished Europe for instance. I’ll convert my > >>>>>>>>> file in a csv > >>>>>>>>> > >> > >> format and > >>>>>>>>> > >> > >> > > send it to you. > >>>>>>>>> > >> > >> > > > >>>>>>>>> > >> > >> > > Glad you enjoyed the idea, looks promising for the > >>>>>>>>> PG’s > >>>>>>>>> > >> birthday. The > >>>>>>>>> > >> > >> idea > >>>>>>>>> > >> > >> > > is to disseminate these infographics to our social > >>>>>>>>> media. > >>>>>>>>> > >> > >> > > > >>>>>>>>> > >> > >> > > Regards > >>>>>>>>> > >> > >> > > > >>>>>>>>> > >> > >> > > Laura > >>>>>>>>> > >> > >> > > > >>>>>>>>> > >> > >> > > On Mon, Jun 21, 2021 at 21:26 Joshua Preston < > >>>>>>>>> > >> photopartner@gmail.com> > >>>>>>>>> > >> > >> > > wrote: > >>>>>>>>> > >> > >> > > > >>>>>>>>> > >> > >> > >> Hey Laura - neat idea. > >>>>>>>>> > >> > >> > >> > >>>>>>>>> > >> > >> > >> If the book data is available in a json or csv > >>>>>>>>> format, it might > >>>>>>>>> > >> be > >>>>>>>>> > >> > >> > >> possible to show the archive in a graphical form. > >>>>>>>>> The map idea > >>>>>>>>> > >> is > >>>>>>>>> > >> > >> only > >>>>>>>>> > >> > >> > >> feasible if there is geographic data (e.g. country > >>>>>>>>> name for each > >>>>>>>>> > >> > >> author). I > >>>>>>>>> > >> > >> > >> like the map infographics, but the Tableau software > >>>>>>>>> can't > >>>>>>>>> > >> replicate > >>>>>>>>> > >> > >> the > >>>>>>>>> > >> > >> > >> imagery those maps use. (Tableau is not a > >>>>>>>>> Geographical > >>>>>>>>> > >> Information > >>>>>>>>> > >> > >> System > >>>>>>>>> > >> > >> > >> with that type of capability). > >>>>>>>>> > >> > >> > >> > >>>>>>>>> > >> > >> > >> I can see a timeline of sorts as well. e.g. > >>>>>>>>> > >> > >> > >> > >>>>>>>>> > >> > >> > >>>>>>>>> > >> > >>>>>>>>> https://public.tableau.com/app/profile/bo.mccready8742/viz/FilmGenrePopulari... > >>>>>>>>> > >> > >> > >> > >>>>>>>>> > >> > >> > >> The easiest place to start is to figure out on your > >>>>>>>>> end if your > >>>>>>>>> > >> web > >>>>>>>>> > >> > >> > >> developers can export all the book data into the > >>>>>>>>> json or csv > >>>>>>>>> > >> format > >>>>>>>>> > >> > >> for > >>>>>>>>> > >> > >> > >> Tableau to read and then I can give you a better > >>>>>>>>> idea of what I > >>>>>>>>> > >> > >> might be > >>>>>>>>> > >> > >> > >> able to do. > >>>>>>>>> > >> > >> > >> > >>>>>>>>> > >> > >> > >> thanks for thinking of me. > >>>>>>>>> > >> > >> > >> > >>>>>>>>> > >> > >> > >> cheers. Josh > >>>>>>>>> > >> > >> > >> > >>>>>>>>> > >> > >> > > -- > >>>>>>>>> > >> > >> > > Laura Natal Rodrigues > >>>>>>>>> > >> > >> > > > >>>>>>>>> > >> > >> > >>>>>>>>> > >> > >> Dr. Gregory B. Newby > >>>>>>>>> > >> > >> Chief Executive and Director > >>>>>>>>> > >> > >> Project Gutenberg Literary Archive Foundation > >>>>>>>>> www.gutenberg.org > >>>>>>>>> > >> > >> A 501(c)(3) not-for-profit organization with EIN > >>>>>>>>> 64-6221541 > >>>>>>>>> > >> > >> gbnewby@pglaf.org > >>>>>>>>> > >> > >> > >>>>>>>>> > >> > > -- > >>>>>>>>> > >> > > Laura Natal Rodrigues > >>>>>>>>> > >> > > > >>>>>>>>> > >> > >>>>>>>>> > > > >>>>>>>>> > >>>>>>>>> > >>>>>>>>> -- > >>>>>>> Laura Natal Rodrigues > >>>>>>> > >>>>>> -- > >>>>> Laura Natal Rodrigues > >>>>> > >>>> -- > >>>> Laura Natal Rodrigues > >>>> > >>> -- > >> Laura Natal Rodrigues > >> > >
_______________________________________________ socialmedia mailing list socialmedia@lists.pglaf.org https://lists.cpunks.org/mailman/listinfo/socialmedia
_______________________________________________ socialmedia mailing list socialmedia@lists.pglaf.org https://lists.cpunks.org/mailman/listinfo/socialmedia
These two pages are scheduled to be posted on our Facebook page on July 4 at 0001 and 0002. If there are any changes to their urls please let me know so I can correct the scheduling page. For personally curated ebook/elibrary/epublishing news by the former Editor-in-Chief of TeleRead follow me on Twitter @paulkbiba; Editor, Project Gutenberg's Facebook page: https://www.facebook.com/project.gutenberg On Thu, Jul 1, 2021 at 11:27 AM Greg Newby via socialmedia < socialmedia@lists.pglaf.org> wrote:
Hi, socialmedia team. Laura & Joshua have collaborated towards some wonderful illustrations. These will be worthwhile to highlight in our social media feed. See links in the attached email.
Three other pending items, for July 4: - A blog post at Distributed Proofreaders - A new page in the About section of www.gutenberg.org (with a link from the main page) - Our monthly newsletter, which I Hope to send on July 4
Thanks for helping to celebrate the 50th year of Project Gutenberg! Greg
---------- Forwarded message ---------- From: Joshua Preston <photopartner@gmail.com> To: Laura N <lnatal55@gmail.com> Cc: Greg Newby <gbnewby@pglaf.org> Bcc: Date: Wed, 30 Jun 2021 21:59:36 -0400 Subject: Re: 50th PG birthday Thx Laura - I updated the catalog charts to let users go straight to a book!
At the link are the final two charts ready for the public.
*Project Gutenberg at 50*
*Project Gutenberg ebooks Issued Over Time*
p.s. My wife surprised me and whisked me away on holiday, which is why I've been slow to respond. I hope these two charts can help the 50th bday festivities and give PG exposure on social media.
I'd still like to tackle the map project when the data is available. We can pick back up anytime you'd like after this week.
cheers. Josh
On Sat, Jun 26, 2021 at 12:51 PM Laura N <lnatal55@gmail.com> wrote:
Great work, liked the data with LoCC!!
All URLs have the following format:
https://www.gutenberg.org/ebooks/#ebook
#ebook is available in our catalogue, does it suit you?
Laura
Em sáb., 26 de jun. de 2021 às 17:44, Joshua Preston < photopartner@gmail.com> escreveu:
The category graphic could drive ebook downloads if the url for each title was available. I'd snatch up some H.G. Wells right now!
[image: image.png]
On Sat, Jun 26, 2021 at 12:30 PM Joshua Preston <photopartner@gmail.com> wrote:
The LoCC data is perfect for many types of analysis. I took the Wikipedia version of the LoCC since it was "clean" and fairly complete.
Check out the two timeline-based charts at the link (toggle with the tabs at the top. I will keep all the data graphics in this one spot until we publish and then separate them).
https://public.tableau.com/app/profile/joshpreston/viz/PGtest1_shelvesheatma...
Keep the data coming!
On Sat, Jun 26, 2021 at 11:45 AM Greg Newby <gbnewby@pglaf.org> wrote:
Thanks again for all this great work!
For your questions #2 below: No, we don't retain statistics on downloads (it's actually part of our privacy policy). Definitely nothing that ties country of download to individual book titles.
It would be possible to harvest past top-100 pages from the wayback machine, though.
Best, Greg
On Sat, Jun 26, 2021 at 04:30:22PM +0100, Laura N wrote:
Excellent Josh! Let's keep this last one as a general overview.
1. I wonder if you could prepare another tableau with the LoCC
column. > > The subject areas are the following: > > https://www.loc.gov/catdir/cpso/lcco/ > > And the complete classification may be found here: > > https://www.loc.gov/aba/publications/FreeLCC/freelcc.html > > 2. Another approach could be the top authors in the last 30 days; > > 3. Top 100 ebooks in the last 30 days. > > Perhaps, in the future, we could work with items 2&3 over one year period. > I'll start to save these files once a month and see what can be done. > > Since our deadline is too short, let's work with the available data. > > Thanks a lot!! > > Laura > > Em sáb., 26 de jun. de 2021 às 02:23, Joshua Preston < photopartner@gmail.com> > escreveu: > > > Thanks! good night. An update: > > https://public.tableau.com/app/profile/joshpreston/viz/PGtest1_shelvesheatmap/Dashboard2 > > > > > > [image: image.png] > > > > On Fri, Jun 25, 2021 at 9:05 PM Laura N <lnatal55@gmail.com> wrote: > > > >> That’s why I’ll try to clean the data from the authors’ column in our > >> catalogue. As soon as I get some results I'll let you know ( I work in > >> European time zone). > >> > >> On Sat, Jun 26, 2021 at 01:51 Joshua Preston < photopartner@gmail.com> > >> wrote: > >> > >>> Got it! I'll be drilling down into the book classifications. Really god > >>> stuff. > >>> > >>> I meant to ask, I wanted to get more details on this idea you > >>> mentioned: Josh I wonder if you could work with authors on a new timeline. > >>> > >>> On Fri, Jun 25, 2021 at 8:30 PM Laura N <lnatal55@gmail.com> wrote: > >>> > >>>> In addition I am converting David’s list into a csv file in order to > >>>> try to make a cross reference with our catalogue’s data. The way the > >>>> authors are listed there doesn’t allow to get some statistics since all > >>>> collaborators/translators/illustrators are in the same category as the main > >>>> author. Let’s see what we can do since PG’s server only keeps data from the > >>>> last 30 days. > >>>> > >>>> On Sat, Jun 26, 2021 at 01:03 Laura N <lnatal55@gmail.com> wrote: > >>>> > >>>>> In the above link just click in Classification in order to see all > >>>>> subject areas and the listing of all subclasses. Hope it helps in someway… > >>>>> > >>>>> On Sat, Jun 26, 2021 at 00:56 Joshua Preston < photopartner@gmail.com> > >>>>> wrote: > >>>>> > >>>>>> Thx Laura! I meant to ask, can you expand on this? > >>>>>> > >>>>>> Josh I wonder if you could work with authors on a new timeline. > >>>>>> > >>>>>> On Fri, Jun 25, 2021 at 7:44 PM Laura N <lnatal55@gmail.com> wrote: > >>>>>> > >>>>>>> Your answer for item 1 may be found below: > >>>>>>> > >>>>>>> https://en.m.wikipedia.org/wiki/Library_of_Congress_Classification > >>>>>>> > >>>>>>> > >>>>>>> On Sat, Jun 26, 2021 at 00:02 Joshua Preston < photopartner@gmail.com> > >>>>>>> wrote: > >>>>>>> > >>>>>>>> I'm starting to see the contours of the datasets and understand > >>>>>>>> them better. I wanted to be conservative with data requests until I got > >>>>>>>> more of an understanding of what is available. > >>>>>>>> Below is what I hope is available so I can focus on the > >>>>>>>> timeline-based and map-based data graphics. I can focus my efforts there so > >>>>>>>> we can have something 'wow' for social media before next weekend. > >>>>>>>> > >>>>>>>> 1- is there a key for the subjects that spells out what they are? > >>>>>>>> These are the subject codes, correct? > >>>>>>>> > >>>>>>>> e.g. > >>>>>>>> > >>>>>>>> LoCC > >>>>>>>> AC > >>>>>>>> AC; B > >>>>>>>> AC; BL > >>>>>>>> AC; DA > >>>>>>>> AC; E300 > >>>>>>>> AC; F1001 > >>>>>>>> AC; KF > >>>>>>>> AC; PA > >>>>>>>> AC; PL > >>>>>>>> AC; PN > >>>>>>>> AC; PR > >>>>>>>> AC; PS > >>>>>>>> AC; Q; QH > >>>>>>>> AC; QA > >>>>>>>> AC; TJ > >>>>>>>> AC; TX > >>>>>>>> 2a- is there a data file of lifetime downloads of each book/story > >>>>>>>> title? > >>>>>>>> 2b- by month/year or year? > >>>>>>>> 2c- by country? > >>>>>>>> > >>>>>>>> 3- Would it be possible to get the top 100 downloads by month from > >>>>>>>> the last 12 months for each country and for each state in the U.S.? > >>>>>>>> > >>>>>>>> > >>>>>>>> thanks in advance. > >>>>>>>> > >>>>>>>> Josh > >>>>>>>> > >>>>>>>> > >>>>>>>> On Fri, Jun 25, 2021 at 6:25 PM Greg Newby < gbnewby@pglaf.org> > >>>>>>>> wrote: > >>>>>>>> > >>>>>>>>> These are all great - thanks! > >>>>>>>>> > >>>>>>>>> The bookshelves have mostly not been updated in years, > >>>>>>>>> unfortunately... we do have subject codes in the CSV I sent, which are all > >>>>>>>>> up-to-date. The subject codes might be another way to categorize. > >>>>>>>>> - Greg > >>>>>>>>> > >>>>>>>>> On Fri, Jun 25, 2021 at 06:37:30PM +0100, Laura N wrote: > >>>>>>>>> > Awesome, thanks! In the meantime, I've been working with the > >>>>>>>>> pg_catalogue > >>>>>>>>> > and I made this plot: > >>>>>>>>> > > >>>>>>>>> > [image: image.png] > >>>>>>>>> > > >>>>>>>>> > I also found a very nice plot by continent which I intended to > >>>>>>>>> make since > >>>>>>>>> > the beginning, please take a look. It's very straightforward and > >>>>>>>>> I can use > >>>>>>>>> > the data I collected from the top downloads in the last 30 days > >>>>>>>>> and make a > >>>>>>>>> > similar pic. > >>>>>>>>> > > >>>>>>>>> > https://geediting.com/most-iconic-book-set-in-every-country/ > >>>>>>>>> > > >>>>>>>>> > Any additional ideas? Josh I wonder if you could work with > >>>>>>>>> authors on a new > >>>>>>>>> > timeline. Do you think it is possible? > >>>>>>>>> > > >>>>>>>>> > Regards > >>>>>>>>> > > >>>>>>>>> > Laura > >>>>>>>>> > > >>>>>>>>> > Em sex., 25 de jun. de 2021 às 03:49, Joshua Preston < > >>>>>>>>> photopartner@gmail.com> > >>>>>>>>> > escreveu: > >>>>>>>>> > > >>>>>>>>> > > 50 years! congrats. I missed that... > >>>>>>>>> > > > >>>>>>>>> > > Below is a link to a time-based chart to start off with. > >>>>>>>>> > > > >>>>>>>>> > > (If the catalog has lifetime downloads, that would make for a > >>>>>>>>> great > >>>>>>>>> > > visual.) > >>>>>>>>> > > > >>>>>>>>> > > > >>>>>>>>> > > > >>>>>>>>> https://public.tableau.com/views/PGtest1_shelvesheatmap/Dashboard2?:language=en-US&:display_count=n&:origin=viz_share_link:showVizHome=no > >>>>>>>>> > > > >>>>>>>>> > > On Thu, Jun 24, 2021 at 7:46 PM Greg Newby < gbnewby@pglaf.org> > >>>>>>>>> wrote: > >>>>>>>>> > > > >>>>>>>>> > >> Hi, Josh. Responses: > >>>>>>>>> > >> > >>>>>>>>> > >> On Thu, Jun 24, 2021 at 06:31:36PM -0400, Joshua Preston > >>>>>>>>> wrote: > >>>>>>>>> > >> > Hey Greg and Laura (aka birthday planners) > >>>>>>>>> > >> > > >>>>>>>>> > >> > Ya'll are really delivering on the data – geographic > >>>>>>>>> fields, date > >>>>>>>>> > >> fields, > >>>>>>>>> > >> > download counts, etc. Thanks for the good clean datasets. > >>>>>>>>> It puts us in > >>>>>>>>> > >> a > >>>>>>>>> > >> > good position. > >>>>>>>>> > >> > > >>>>>>>>> > >> > Laura - I'll continue with your inspiration for a map-based > >>>>>>>>> option. > >>>>>>>>> > >> > > >>>>>>>>> > >> > Greg - Thanks for the sneak peek and access to what I am > >>>>>>>>> assuming is the > >>>>>>>>> > >> > entire ebook library? Good stuff. > >>>>>>>>> > >> > > >>>>>>>>> > >> > Question: what is the 'Issued' date field? I'm trying to > >>>>>>>>> reconcile the > >>>>>>>>> > >> 25th > >>>>>>>>> > >> > bday and how there are some dates that are older than 25 > >>>>>>>>> years. > >>>>>>>>> > >> > >>>>>>>>> > >> This is the 50th Birthday. The first eBook was July 4, 1971. > >>>>>>>>> > >> > >>>>>>>>> > >> The "Issued" date is the release date: the date the eBook > >>>>>>>>> first appeared > >>>>>>>>> > >> in the PG catalog. Clearly this could be used as the basis of > >>>>>>>>> a timeline > >>>>>>>>> > >> graphic. > >>>>>>>>> > >> > >>>>>>>>> > >> > > >>>>>>>>> > >> > For the group effort - both datasets include the 'primary > >>>>>>>>> key' or > >>>>>>>>> > >> unique ID > >>>>>>>>> > >> > for each book. PLEASE keep including that as you send any > >>>>>>>>> more data. > >>>>>>>>> > >> That's > >>>>>>>>> > >> > key (pun) to stitching all the data sources together. > >>>>>>>>> > >> > >>>>>>>>> > >> Yes. That's the eBook number, a.k.a. accession number. It is > >>>>>>>>> our unique > >>>>>>>>> > >> key, always. > >>>>>>>>> > >> > >>>>>>>>> > >> > I'll start sending bare-bones prototypes your way. As we > >>>>>>>>> get closer to > >>>>>>>>> > >> > final charts I'll start to consider the design aesthetic. > >>>>>>>>> The first > >>>>>>>>> > >> thing > >>>>>>>>> > >> > to do is design on the basic presentation of the data. > >>>>>>>>> Feedback is > >>>>>>>>> > >> > appreciated. > >>>>>>>>> > >> > >>>>>>>>> > >> You might also like this very thorough timeline: > >>>>>>>>> > >> https://archive.org/details/history-ebooks > >>>>>>>>> > >> > >>>>>>>>> > >> It was created by Marie Lebert, who also wrote some of the > >>>>>>>>> articles in > >>>>>>>>> > >> the About section of www.gutenberg.org > >>>>>>>>> > >> > >>>>>>>>> > >> Excellent! Thanks again. > >>>>>>>>> > >> Greg > >>>>>>>>> > >> > >>>>>>>>> > >> > >>>>>>>>> > >> > If at some point, a proper chat is in order, my cell is > >>>>>>>>> below. > >>>>>>>>> > >> > > >>>>>>>>> > >> > cheers. > >>>>>>>>> > >> > > >>>>>>>>> > >> > Josh > >>>>>>>>> > >> > 678.231.0787 > >>>>>>>>> > >> > > >>>>>>>>> > >> > On Thu, Jun 24, 2021 at 3:31 PM Laura N < lnatal55@gmail.com> > >>>>>>>>> wrote: > >>>>>>>>> > >> > > >>>>>>>>> > >> > > This is great Greg! I’ll take a look and see what else I > >>>>>>>>> can do with > >>>>>>>>> > >> these > >>>>>>>>> > >> > > data. > >>>>>>>>> > >> > > > >>>>>>>>> > >> > > Regards and thanks for your help > >>>>>>>>> > >> > > > >>>>>>>>> > >> > > Laura > >>>>>>>>> > >> > > > >>>>>>>>> > >> > > > >>>>>>>>> > >> > > > >>>>>>>>> > >> > > On Thu, Jun 24, 2021 at 20:21 Greg Newby < > >>>>>>>>> gbnewby@pglaf.org> wrote: > >>>>>>>>> > >> > > > >>>>>>>>> > >> > >> > >>>>>>>>> > >> > >> Hi, Josh. In case it's of interest, we have a brand-new > >>>>>>>>> CSV metadata > >>>>>>>>> > >> > >> extract: > >>>>>>>>> > >> > >> > >>>>>>>>> https://www.gutenberg.org/cache/epub/feeds/pg_catalog.csv.zip > >>>>>>>>> > >> > >> > >>>>>>>>> > >> > >> This is not currently advertised anywhere, because there > >>>>>>>>> are still > >>>>>>>>> > >> some > >>>>>>>>> > >> > >> other fields we are trying to add. It's not > >>>>>>>>> automatically updated. > >>>>>>>>> > >> > >> > >>>>>>>>> > >> > >> Thanks again for looking into some nice visualization > >>>>>>>>> products for > >>>>>>>>> > >> > >> Project Gutenberg! > >>>>>>>>> > >> > >> > >>>>>>>>> > >> > >> Best, > >>>>>>>>> > >> > >> Greg > >>>>>>>>> > >> > >> > >>>>>>>>> > >> > >> On Mon, Jun 21, 2021 at 07:36:59PM -0400, Joshua Preston > >>>>>>>>> wrote: > >>>>>>>>> > >> > >> > Super. This sounds promising. Josh > >>>>>>>>> > >> > >> > > >>>>>>>>> > >> > >> > On Mon, Jun 21, 2021 at 4:38 PM Laura N < > >>>>>>>>> lnatal55@gmail.com> > >>>>>>>>> > >> wrote: > >>>>>>>>> > >> > >> > > >>>>>>>>> > >> > >> > > Hi Joshua: > >>>>>>>>> > >> > >> > > I am preparing a worksheet with the most downloaded > >>>>>>>>> book per > >>>>>>>>> > >> country > >>>>>>>>> > >> > >> and I > >>>>>>>>> > >> > >> > > just finished Europe for instance. I’ll convert my > >>>>>>>>> file in a csv > >>>>>>>>> > >> > >> format and > >>>>>>>>> > >> > >> > > send it to you. > >>>>>>>>> > >> > >> > > > >>>>>>>>> > >> > >> > > Glad you enjoyed the idea, looks promising for the > >>>>>>>>> PG’s > >>>>>>>>> > >> birthday. The > >>>>>>>>> > >> > >> idea > >>>>>>>>> > >> > >> > > is to disseminate these infographics to our social > >>>>>>>>> media. > >>>>>>>>> > >> > >> > > > >>>>>>>>> > >> > >> > > Regards > >>>>>>>>> > >> > >> > > > >>>>>>>>> > >> > >> > > Laura > >>>>>>>>> > >> > >> > > > >>>>>>>>> > >> > >> > > On Mon, Jun 21, 2021 at 21:26 Joshua Preston < > >>>>>>>>> > >> photopartner@gmail.com> > >>>>>>>>> > >> > >> > > wrote: > >>>>>>>>> > >> > >> > > > >>>>>>>>> > >> > >> > >> Hey Laura - neat idea. > >>>>>>>>> > >> > >> > >> > >>>>>>>>> > >> > >> > >> If the book data is available in a json or csv > >>>>>>>>> format, it might > >>>>>>>>> > >> be > >>>>>>>>> > >> > >> > >> possible to show the archive in a graphical form. > >>>>>>>>> The map idea > >>>>>>>>> > >> is > >>>>>>>>> > >> > >> only > >>>>>>>>> > >> > >> > >> feasible if there is geographic data (e.g. country > >>>>>>>>> name for each > >>>>>>>>> > >> > >> author). I > >>>>>>>>> > >> > >> > >> like the map infographics, but the Tableau software > >>>>>>>>> can't > >>>>>>>>> > >> replicate > >>>>>>>>> > >> > >> the > >>>>>>>>> > >> > >> > >> imagery those maps use. (Tableau is not a > >>>>>>>>> Geographical > >>>>>>>>> > >> Information > >>>>>>>>> > >> > >> System > >>>>>>>>> > >> > >> > >> with that type of capability). > >>>>>>>>> > >> > >> > >> > >>>>>>>>> > >> > >> > >> I can see a timeline of sorts as well. e.g. > >>>>>>>>> > >> > >> > >> > >>>>>>>>> > >> > >> > >>>>>>>>> > >> > >>>>>>>>> https://public.tableau.com/app/profile/bo.mccready8742/viz/FilmGenrePopularity-1910-2018/GenreRelativePopularity > >>>>>>>>> > >> > >> > >> > >>>>>>>>> > >> > >> > >> The easiest place to start is to figure out on your > >>>>>>>>> end if your > >>>>>>>>> > >> web > >>>>>>>>> > >> > >> > >> developers can export all the book data into the > >>>>>>>>> json or csv > >>>>>>>>> > >> format > >>>>>>>>> > >> > >> for > >>>>>>>>> > >> > >> > >> Tableau to read and then I can give you a better > >>>>>>>>> idea of what I > >>>>>>>>> > >> > >> might be > >>>>>>>>> > >> > >> > >> able to do. > >>>>>>>>> > >> > >> > >> > >>>>>>>>> > >> > >> > >> thanks for thinking of me. > >>>>>>>>> > >> > >> > >> > >>>>>>>>> > >> > >> > >> cheers. Josh > >>>>>>>>> > >> > >> > >> > >>>>>>>>> > >> > >> > > -- > >>>>>>>>> > >> > >> > > Laura Natal Rodrigues > >>>>>>>>> > >> > >> > > > >>>>>>>>> > >> > >> > >>>>>>>>> > >> > >> Dr. Gregory B. Newby > >>>>>>>>> > >> > >> Chief Executive and Director > >>>>>>>>> > >> > >> Project Gutenberg Literary Archive Foundation > >>>>>>>>> www.gutenberg.org > >>>>>>>>> > >> > >> A 501(c)(3) not-for-profit organization with EIN > >>>>>>>>> 64-6221541 > >>>>>>>>> > >> > >> gbnewby@pglaf.org > >>>>>>>>> > >> > >> > >>>>>>>>> > >> > > -- > >>>>>>>>> > >> > > Laura Natal Rodrigues > >>>>>>>>> > >> > > > >>>>>>>>> > >> > >>>>>>>>> > > > >>>>>>>>> > >>>>>>>>> > >>>>>>>>> -- > >>>>>>> Laura Natal Rodrigues > >>>>>>> > >>>>>> -- > >>>>> Laura Natal Rodrigues > >>>>> > >>>> -- > >>>> Laura Natal Rodrigues > >>>> > >>> -- > >> Laura Natal Rodrigues > >> > >
_______________________________________________ socialmedia mailing list socialmedia@lists.pglaf.org https://lists.cpunks.org/mailman/listinfo/socialmedia
participants (3)
-
Greg Newby
-
Laura N
-
Paul Biba