rhu: (Default)
[personal profile] rhu
Oh, [livejournal.com profile] tahnan, you're not the only one who can have fun:

When did certain science terms come into play?
Facebook vs. LiveJournal (were you blogging in the 19th century?)
Nice to see which one wins but again, what's with the tech term in the 19C? Bad data tagging?
A propos of my recent question in the NYT On Language column
Something is very wrong here

Science!

(no subject)

Date: 2010-12-22 04:01 pm (UTC)
From: [identity profile] kirisutogomen.livejournal.com
The 'livejournal' thing seems to be some sort of mistake, but the 'google' stuff is genuine: Lots of 19th century googling.

Science!

(no subject)

Date: 2010-12-22 04:37 pm (UTC)
From: [identity profile] danchall.myvidoop.com (from livejournal.com)
Neat "scratch v scrap." The doubling of appearances of "scrap paper" since 1960 might have something to do with recycling.

(no subject)

Date: 2010-12-22 05:54 pm (UTC)
jadelennox: Michael Gorman, former ALA president: "I R SRS LIBRARN. THIS R SRS THRED" (liberrian: lol gorman)
From: [personal profile] jadelennox
Google books' metadata is notoriously bad. you can find librarians ranting about it if you search over the last couple of years, but the ngram viewer has made it all the more apparent.

Their OCR is pretty hideous, too, but that doesn't produce false positives in the same way, except for the funny ones like best/beft, and honestly you can't really expect OCR to do a great job on long S.

But their metadata is appalling.

(That last problem is easy, though. The dataset is case sensitive. Here, fixed it.)
Edited Date: 2010-12-22 05:57 pm (UTC)

(no subject)

Date: 2010-12-22 06:03 pm (UTC)
ext_87516: (Default)
From: [identity profile] 530nm330hz.livejournal.com
It even says "case sensitive" in bold. *Sigh*

"Follows Instructions" = "Needs Improvement"

Thanks!

Profile

rhu: (Default)
Andrew M. Greene

January 2013

S M T W T F S
  12345
6789101112
13141516171819
20212223242526
2728293031  

Most Popular Tags

Style Credit

Expand Cut Tags

No cut tags