Joho the Bloglibraries Archives - Joho the Blog

August 11, 2015

1M copyright free images ready for viewing and tagging

The British Library has posted one million public domain images — images not subject to any copyright restrictions — at Flickr. (They did this at least a year ago, but it’s still worth noting, isn’t it?)

The public can view them, copy them, and reuse them freely in every regard. An article in Quartz by Anne Quito reports:


So far, these images, which range from Restoration-era cartoons to colonial explorers’ early photographs, have been used on rugs, album covers, gift tags, a mapping project, and an art installation at the Burning Man festival in Nevada, among other things.

The Library posted them not only so they could be enjoyed and reused, but so the public would do what the Library is not staffed to do all by itself: add tags. Says Quartz:

to date, the collection has garnered over 267 million views, and over 400,000 tags have been added to images on Flickr by users. Through a “tagathon” with the Wikimedia UK community, the Library discovered over 50,000 maps in the collection, which they are now in the process of fitting into a modern map.

I can’t figure out how to search within a collection at Flickr, but this view at least does some clustering.

1 Comment »

August 2, 2015

[2b2k][liveblog] Wayne Wiegand: Libraries beyond information

Wayne Wiegand is giving the lunchtime talk at the Library History Seminar XIII at Simmons College. He’s talking about his new book Part of Our Lives: A People’s History of the American Public Library.

NOTE: Live-blogging. Getting things wrong. Missing points. Omitting key information. Introducing artificial choppiness. Over-emphasizing small matters. Paraphrasing badly. Not running a spellpchecker. Mangling other people’s ideas and words. You are warned, people.


He introduces himself as a humanist, which brings with it a curiosity about what it means to be a human in the world. He is flawed, born into a flawed culture. He exercises his curiosity in the field of library history. [He’s also the author of the best biography of Melvil Dewey.]


People love libraries, he says, citing the Pew Internet 2013 survey that showed that almost all institutions except libraries and first responders have fallen in public esteem. His new book traces the history of the public library by listening to people who have used them since the middle of the 19th century, a bottom-up perspective. He did much of his research by searching newspaper archives, finding letters to the editors as well as articles. =People love their libraries for (1) the info they make accessible, (2) the public space, and (3) the stories they circulate that make sense of their world.

Thomas Edison spent as much time as possible in the library. The Wright Brothers came upon an ornithology book that kindled their interest in flight. HS Truman cited the library as influential. Lilly Tomlin, too. Bill Clinton, too, especially loving books about native Americans. Barack Obama, too. “The first place I wanted to be was a library,” he said when he returned from overseas. He was especially interested in Kenya, the home of his father.


For most of its history, library info science discourse has focused on what was “useful knowledge” in the 19th century, “best books” in the 20th century, or what we now call “information.” Because people don’t have to use libraries (unlike, say, courts) users have greatly influenced the shape of libraries.


“To demonstrate library as place, let me introduce you to Ricky,” he says as he starts a video. She is an adult student who does her homework in the library. When she was broke, it was a warm place where she could apply for jobs.” She has difficulty working through her emotions to express how much the library means to her.

Wayne reads a librarian’s account of the very young MLK’s regular attendance at his public library. James Levine learned to play piano there. In 1969 the Gary Indiana held a talent conference; the Jackson brothers didn’t win, but Michael became a local favorite. [Who won???] In another library, a homeless man–Mr. Conrad– came in and set up a chess board. People listened and learned from him.


“To categorize these activities as information gathering fails to appreciate the richness” of the meaning of the library for these places.


Wayne plays another video. Maria is 95 years old. She started using the library when was 12 or 13 after her family had immigrated from Russia. “That library was everything to me.” Her family could not afford to buy books “and there were some many other servicces, it was library library library all the time.” “I have seen many ugly things. You can’t live all the time with the bad.” The library was something beautiful.


Pete Seeger remembered all his life stories he read in the library.


The young Ronald Reagan read a popular Christian novel, declared himself saved, and had himself baptized. He went to his public library twice a week, mainly reading adventure stories.


Oprah Winfrey’s library taught her that there was a better world and that she could be a part of it.


Sonia Sotamayor buried herself in reading in the public library after her father died when she was nine. Nancy Drew was formative: paying attention, finding clues, reaching logical conclusions.


Wayne plays a video of Danny, a young man who learned about music from CDs in the library, and found a movie that “dropped an emotional anchor down so I didn’t feel like I was floundering” in his sexuality.


Public libraries have always played a role in making stories accessible to everyone. Communities insist that libraries stock a set of stories that the community responds to. Stories stimulate imagination, construct community through shared reading, and make manifest moral weightings.


In his book, Wayne gives story, people, and place equal weight. “Stories and libraries as place has been as important, and for many people, more important than information.” We need to look at how these activities product human subjectivity as community-based. We lack a research base to comprehend the many ways libraries are used.


The death of libraries has been pronounced too early. In 2012, the US has more libraries than ever. Attendance in 2012 dipped because the hours libraries are open went down that year, but for the decade it was up 28%. [May have gotten the number wrong a bit.] In 2012, libraries circulated 2.2B items, up 28% from 2003. And more. [Too fast to capture.] The prophets of doom have too narrow a view of what libraries do and are. “We have to expand the boundaries of our professional discourse beyond information.”


Libraries fighting against budget cuts too often replicate the stereotypes. “Public libraries no longer are warehouses of book” gives credence to the falsehood that libraries ever were that.

He ends by introducing Dawn Logsdon who is working on a film for 2017 titled Free for All: Inside the Public Library. (She’s been taping people at the conference and assures the audience that whatever doesn’t make into the film will be available online.) She shows a few minutes of a prior documentary of hers: Faubourg Treme.

1 Comment »

June 1, 2015

[misc][liveblog] Alex Wright: The secret history of hypertext

I’m in Oslo for Kunnskapsorganisasjonsdagene, which my dear friend Google Translate tells me is Knowledge Organization Days. I have been in Oslo a few times before — yes, once in winter, which was as cold as Boston but far more usable — and am always re-delighted by it.

Alex Wright is keynoting this morning. The last time I saw him was … in Oslo. So apparently Fate has chosen this city as our Kismet. Also coincidence. Nevertheless, I always enjoy talking with Alex, as we did last night, because he is always thinking about, and doing, interesting things. He’s currently at Etsy , which is a fascinating and inspiring place to work, and is a professor interaction design,. He continues to think about the possibilities for design and organization that led him to write about Paul Otlet who created what Alex has called an “analog search engine”: a catalog of facts expressed in millions of index cards.

NOTE: Live-blogging. Getting things wrong. Missing points. Omitting key information. Introducing artificial choppiness. Over-emphasizing small matters. Paraphrasing badly. Not running a spellpchecker. Mangling other people’s ideas and words. You are warned, people.

Alex begins by telling us that he began as a librarian, working as a cataloguer for six years. He has a library degree. As he works in the Net, he finds himself always drawn back to libraries. The Net’s fascination with the new brings technologists to look into the future rather than to history. Alex asks, “How do we understand the evolution of the Web and the Net in an historical context?” We tend to think of the history of the Net in terms of computer science. But that’s only part of the story.

A big part of the story takes us into the history of libraries, especially in Europe. He begins his history of hypertext with the 16th century Swiss naturalist Conrad Gessner who created a “universal bibliography” by writing each entry on a slip of paper. Leibniz used the same technique, writing notes on slips of paper and putting them in an index cabinet he had built to order.

In the 18th century, the French started using playing cards to record information. At the beginning of the 19th, the Jacquard loom used cards to guide weaving patterns, inspiring Charles Babbage to create what many [but not me] consider to be the first computer.

In 1836, Isaac Adams created the steam powered printing press. This, along with economic and social changes, enabled the mass production of books, newspapers, and magazines. “This is when the information explosion truly started.”

To make sense of this, cataloging systems were invented. They were viewed as regimented systems that could bring efficiencies … a very industrial concept, Alex says.

“The mid-19th century was also a period of networking”: telegraph systems, telephones, internationally integrated postal systems. “Goods, people, and ideas were flowing across national borders in a way they never had before.” International journals. International political movements, such as Marxism. International congresses (conferences). People were optimistic about new political structures emerging.

Alex lists tech from the time that spread information: a daily reading of the news over copper wires, pneumatic tubes under cities (he references Molly Wright Steenson‘s great work on this), etc.

Alex now tells us about Paul Otlet, a Belgian who at the age of 15 started designing his own cataloging system. He and a partner, Henri La Fontaine, started creating bibliographies of disciplines, starting with the law. Then they began a project to create a universal bibliography.

Otlet thought libraries were focused on the wrong problem. Getting readers to the right book isn’t enough. People also need access to the information in the books. At the 1900 [?] world’s fair in Paris, Otlet and La Fontaine demonstrated their new system. They wanted to provide a universal language for expressing the connections among topics. It was not a top-down system like Dewey’s.

Within a few years, with a small staff (mainly women) they had 15 million cards in their catalog. You could buy a copy of the catalog. You could send a query by telegraphy, and get a response telegraphed back to you, for a fee.

Otlet saw this in a bigger context. He and La Fontaine created the Union of International Associations, an association of associations, as the governing body for the universal classification system. The various associations would be responsible for their discpline’s information.

Otlet met a Scotsman named Patrick Geddes who worked against specialization and the fracturing of academic disciplines. He created a camera obscura in Edinburgh so that people could see all of the city, from the royal areas and the slums, all at once. He wanted to stitch all this information together in a way that would have a social effect. [I’ve been there as a tourist and had no idea!] He also used visual forms to show the connections between topics.

Geddes created a museum, the Palais Mondial, that was organized like hypertext., bringing together topics in visually rich, engaging displays. The displays are forerunners of today’s tablet-based displays.

Another collaborator, Hendrik Christian Andersen, wanted to create a world city. He went deep into designing it. He and Otlet looked into getting land in Belgium for this. World War I put a crimp in the idea of the world joining in peace. Otlet and Andersen were early supporters of the idea of a League of Nations.

After the War, Otlet became a progressive activist, including for women’s rights. As his real world projects lost momentum, in the 1930s he turned inward, thinking about the future. How could the new technologies of radio, television, telephone, etc., come together? (Alex shows a minute from the documentary, The Man who wanted to Classify the World.”) Otlet imagines a screen and television instead of books. All the books and info are in a separate facility, feeding the screen. “The radiated library and the televised book.” 1934.

So, why has no one ever heard of Otlet? In part because he worked in Belgium in the 1930s. In the 1940s, the Nazis destroyed his work. They replaced his building, destrooying 70 tons of materials, with an exhibit of Nazi art.

Although there are similarities to the Web, how Otlet’s system worked was very different. His system was a much more controlled environment, with a classification system, subject experts, etc. … much more a publishing system than a bottom-up system. Linked Data and the Semantic Web are very Otlet-ish ideas. RDF triples and Otlet’s “auxiliary tables” are very similar.

Alex now talks about post-Otlet hypertext pioneers.

H.G. Wells’ “World Brain” essay from 1938. “The whole human memory can be, and probably in a shoirt time will be, made accessibo every individual.” He foresaw a complete and freely avaiable encyclopedia. He and Otlet met at a conference.

Emanuel Goldberg wanted to encode punchcard-style information on microfilm for rapid searching.

Then there’s Vannevar Bush‘s Memex that would let users create public trails between documents.

And Liklider‘s idea that different types of computers should be able to share infromation. And Engelbart who in 1968’s “Mother of all Demos” had a functioning hypertext system.

Ted Nelson thought computer scientists were focused on data computation rather than seeing computers as tools of connection. He invnted the term “hypertext,” the Xanadu web, and “transclusion” (embedding a doc in another doc). Nelson thought that links always should be two way. Xanadu= “intellectual property” controls built into it.

The Internet is very flat, with no central point of control. It’s self-organizing. Private corporations are much bigger on the Net than Otlet, Engelbart, and Nelson envisioned “Our access to information is very mediated.” We don’t see the classification system. But at sites like Facebook you see transclusion, two-way linking, identity management — needs that Otlet and others identified. The Semantic Web takes an Otlet-like approach to classification, albeit perhaps by algorithms rather than experts. Likewise, the Google “knowledge vaults” project tries to raise the ranking of results that come from expert sources.

It’s good to look back at ideas that were left by the wayside, he concludes, having just decisively demonstrated the truth of that conclusion :)

Q: Henry James?

A: James had something of a crush on Anderson, but when he saw the plan for the World City told him that it was a crazy idea.

[Wonderful talk. Read his book.]

2 Comments »

April 30, 2015

A UN museum?

I got to spend yesterday with an awesome group of about twenty people at the United Nations, brainstorming what a UN museum might look like. This was under the auspices of the UN Live project which (I believe) last week was endorsed by UN Secretary General Ban Ki-moon.

Some of the people at the meeting
Some of us

Although it was a free-ranging discussion from many points of view, there seemed to be general implicit agreement about a few points. (What the UN Live group does with this discussion is up to them, of course.)

Security Council
Where we did not meet

First, there was no apparent interest in constructing a museum that takes telling the UN’s story as its focus. Rather, the discussion was entirely about ways in which the values of the UN could be furthered by enabling people to connect with one another around the world.

Second, No one even considered the possibility that it might be only a physical museum. Physical elements were part of many of the ideas, but primarily to enable online services.

Here are some of the ideas that I particularly liked, starting (how rude!) with mine.

I stole it directly from a Knight Foundation proposal by my friend Nate Hill at Chattanooga Public Library. He proposed setting up 4K displays in a few libraries that have gigabit connections, to enable local residents to interact with one another. At the meeting yesterday I suggested (crediting Nate, but probably too fast for anyone to hear me, so I’m clear, right?) that the Museum be distributed via “magic mirrors” – Net-connected video monitors – that connect citizens globally. These would go into libraries and other safe spaces where there can be facilitators. (We’re all local people, so we need help talking globally.) Where possible, there might be two screens so that people can see themselves and the group they’re talking with. (For some reason, I like the idea of the monitors being circular. More like portals.)

These magic mirrors would be a platform for activities to be invented. For example:

  • Kids could play together. Virtual Jenga? Keep a virtual ball afloat? (Assume Kinect-like sensors.) Collaborative virtual jigsaw puzzle of a photo of one of their home towns? Or maybe each group is working collaboratively on one puzzle, but each team’s pieces are part of the image of the other’s team’s home. A simple mirror imitation game where each kid mimics the other’s movements? It’s a platform, so it’d be open to far better ideas than these.

  • Kids could create together. Collaborative drawing? Collaborative crazy machines a la Rube Goldberg?

  • Real-time, video AMAs: “We’re Iranian parents. AUA [ask us anything] at 10am EDT.”

  • Listings for other activities, including those proposed below.

Someone suggested that the UN create pop-up museums by bringing in a shipping container stocked with media tools. (Technically, a plop-down museum, it seems to me.) The local community would be invited to tell its story, perhaps in 100 images (borrowing the British Museum’s “A History of the World in 100 Objects”), or perhaps by providing a StoryCorps-style recording booth. Or send the kids out with video cameras. (There might have to be someone who could help with the media.) The community would be able to tell its story to the world. The world could react and interact. (These containers could contain magic mirrors.)

Another idea: Facilitate local people coming together virtually to share solutions to common problems, building on the multiple and admirable efforts to do this already.

Another idea: One group pointed out that museums typically face backwards in time. So suppose the UN museum instead constructed itself in real time as significant events occurred. E.g., as an earthquake disaster unrolls, the UN Museum would track it live, presenting its consequences intimately to the world, recording it for posterity, and facilitating relief efforts.

There was general agreement, I believe, that all of the UN Museum’s content should be openly available through APIs.

There were many, many more ideas, many of which I find exciting. I don’t know if any of the ideas discussed are going to make it past the cool-way-to-spend-an-afternoon phase, but I am thrilled by the general prospect of a UN Museum that takes as its mission not just the curation of artifacts that tell a story but advancing the UN’s mission by connecting people globally around common concerns, shared interests, and a desire to help and delight one another.

Now go ahead and be cynical and snarky.

2 Comments »

February 23, 2015

The library-sized hole in the Internet

Sarah Bartlett of OCLC interviewed me at some length about the future of libraries. You can read it here.

At some point I will write up the topic of my talk at the OCLC’s EMEA Regional Council Meeting in Florence: libraries as community centers…of meaning.

2 Comments »

February 8, 2015

Libraries as community centers…of meaning

Well, it’s snowing in Boston and I’m in Florence. Italy. (I’m SO sorry, Ann!) I’m here to keynote an OCLC EMEA (Europe, Middle East, Africa) conference about libraries.

After three major revisions, I believe that on Tuesday I’m going to propose thinking about libraries as community centers. But not the usual sort where local people gather, work, socialize, play, learn … all good things, for sure. In addition, I’m going to suggest that they view themselves as community centers of meaning.

I know it sounds silly, and I’m open to better phrases, but I think it’s not entirely pointless. (The idea arose in a conversation with Robert Fleming, executive director of the Emerson College library. I’m teaching a course at Emerson this semester.)

The idea is simple. It used to be that once a user checked a book out of the library, the library was out of the loop. The user read it at home, talked about it with friends or a Significant Other, maybe spent an evening with a book club discussing it. The library might be slightly in the loop if they enabled users to review or rate books, or if they have an awesome Awesome Box. But even so, the pickings were pretty slim.

Now, of course, users are likely to talk on line about what they’re reading. At least as important, the library has tons of metadata that it can use to gauge how relevant an item is to its community, and even get a glimmer of what makes it relevant. Of course, much of this information is private, but there are ways to use it without violating anyone’s privacy.

If a community can be made more aware of what it’s finding meaningful and relevant, it can learn from itself, push its own boundaries, unearth new ideas, and find ever-better disagreements.

Note that I am not suggesting that libraries curate community meaning. Rather, libraries can provide services to facilitate the development of community meaning, making the community aware of itself. And this is of course an additional opportunity for librarians to contribute their own expertise at contextualizing and expanding our understanding.

Who is currently the custodian of community meaning? No one. Who is in the best position to be that custodian and facilitator? Your local library.

2 Comments »

February 2, 2015

Future of libraries, Kenya style

This video will remind you, if you happen to have forgotten, what libraries mean to much of the world:

Internet, mesh, people eager to learn, the same people eager to share. A future for libraries.

You can contribute here.

Be the first to comment »

January 7, 2015

Harvard Library adopts LibraryCloud

According to a post by the Harvard Library, LibraryCloud is now officially a part of the Library toolset. It doesn’t even have the word “pilot” next to it. I’m very happy and a little proud about this.

LibraryCloud is two things at once. Internal to Harvard Library, it’s a metadata hub that lets lots of different data inputs be normalized, enriched, and distributed. As those inputs change, you can change LibraryCloud’s workflow process once, and all the apps and services that depend upon those data can continue to work without making any changes. That’s because LibraryCloud makes the data that’s been input available through an API which provides a stable interface to that data. (I am overstating the smoothness here. But that’s the idea.)

To the Harvard community and beyond, LibraryCloud provides open APIs to access tons of metadata gathered by Harvard Library. LibraryCloud already has metadata about 18M items in the Harvard Library collection — one of the great collections — including virtually all the books and other items in the catalog (nearly 13M), a couple of million of images in the VIA collection, and archives at the folder level in Harvard OASIS. New data can be added relatively easily, and because LibraryCloud is workflow based, that data can be updated, normalized and enriched automatically. (Note that we’re talking about metadata here, not the content. That’s a different kettle of copyrighted fish.)

LibraryCloud began as an idea of mine (yes, this is me taking credit for the idea) about 4.5 years ago. With the help of the Harvard Library Innovation Lab, which I co-directed until a few months ago, we invited in local libraries and had a great conversation about what could be done if there were an open API to metadata from multiple libraries. Over time, the Lab built an initial version of LibraryCloud primarily with Harvard data, but with scads of data from non-Harvard sources. (Paul Deschner, take many many bows. Matt Phillips, too.) This version of LibraryCloud — now called lilCloud — is still available and is still awesome.

With the help of the Library Lab, a Harvard internal grant-giving group, we began a new version based on a workflow engine and hosted in the Amazon cloud. (Jeffrey Licht, Michael Vandermillen, Randy Stern, Paul Deschner, Tracey Robinson, Robin Wendler, Scott Wicks, Jim Borron, Mary Lee Kennedy, and many more, take bows as well. And we couldn’t have done it without you, Arcardia Foundation!) (Note that I suffer from Never Gets a List Right Syndrome, so if I left you out, blame my brain and let me know. Don’t be shy. I’m ashamed already.)

The Harvard version of LibraryCloud is a one-library implementation, although that one library comprises 73 libraries. Thus the LibraryCloud Harvard has adopted is a good distance from the initial vision of a single API for accessing multiple libraries. But it’s a big first step. It’s open source code [documentation]. Who knows?

I think it’s impressive that Harvard Library has taken this step toward adopting a platform architecture, and it’s cool beyond cool that this architecture is further opening up Harvard Library’s metadata riches to any developer or site that wants to use it. (This also would not have happened without Harvard Library’s enlightened Open Metadata policy.)

1 Comment »

November 18, 2014

[2b2k] Four things to learn in a learning commons

Last night I got to give a talk at a public meeting of the Gloucester Education Foundation and the Gloucester Public School District. We talked about learning commons and libraries. It was awesome to see the way that community comports itself towards its teachers, students and librarians, and how engaged they are. Truly exceptional.

Afterwards there were comments by Richard Safier (superintendent), Deborah Kelsey (director of the Sawyer Free Library), and Samantha Whitney (librarian and teacher at the high school), and then a brief workshop at the attendees tables. The attendees included about a dozen of Samantha’s students; you can see in the liveliness of her students and the great questions they asked that Samantha is an inspiring teacher.

I came out of these conversations thinking that if my charter were to establish a “learning commons” in a school library, I’d ask what sort of learning I want to be modeled in that space. I think I’d be looking for four characteristics:

1. Students need to learn the basics (and beyond!) of online literacy: not just how to use the tools, but, more important, how to think critically in the networked age. Many schools are recognizing that, thankfully. But it’s something that probably will be done socially as often as not: “Can I trust a site?” is a question probably best asked of a network.

2. Old-school critical thinking was often thought of as learning how to sift claims so that only that which is worth believing makes it through. Those skills are of course still valuable, but on a network we are almost always left with contradictory piles of sifted beliefs. Sometimes we need to dispute those other beliefs because they are simply wrong. But on a network we also need to learn to live with difference — and to appreciate difference — more than ever. So, I would take learning to love difference to be an essential skill.

3. It kills me that most people have never clicked on a Wikipedia “Talk” page to see the discussion that resulted in the article they’re reading. If we’re going to get through this thing — life together on this planet — we’re really going to have to learn to be more meta-aware about what we read and encounter online. The old trick of authority was to erase any signs of what produced the authoritative declaration. We can’t afford that any more. We need always to be aware the what we come across resulted from humans and human processes.

4. We can’t rely on individual brains. We need brains that are networked with other brains. Those networks can be smarter than any of their individual members, but only if the participants learn how to let the group make them all smarter instead of stupider.

I am not sure how these skills can be taught — excellent educators and the communities that support them, like those I met last night, are in a better position to figure it out — but they are four skills that seem highly congruent with a networked learning commons.

1 Comment »

October 27, 2014

[liveblog] Christine Borgmann

Christine Borgman, chair of Info Studies at UCLA, and author of the essential Scholarship in the Digital Age, is giving a talk on The Knowledge Infrastructure of Astronomy. Her new book is Big Data, Little Data, No Data: Scholarship in the Networked World, but you’ll have to wait until January. (And please note that precisely because this is a well-organized talk with clearly marked sections, it comes across as choppy in these notes.)

NOTE: Live-blogging. Getting things wrong. Missing points.Omitting key information. Introducing artificial choppiness. Over-emphasizing small matters. Paraphrasing badly. Not running a spellpchecker. Mangling other people’s ideas and words. You are warned, people.

Her new book draws on 15 yrs of studying various disciplines and 7-8 years focusing on astronomy as a discipline. It’s framed around the change to more data-intensive research across the sciences and humanities plus, the policy push for open access to content and to data. (The team site.)

They’ve been looking at four groups:

The world thinks that astronomy and genomics have figured out how to do data intensive science, she says. But scientists in these groups know that it’s not that straightforward. Christine’s group is trying to learn from these groups and help them learn from one another

Knowledge Infrastructures are “string and baling wire.” Pieces pulled together. The new layered on top of the old.

The first English scientific journal began almost 350 yrs ago. (Philosophical Transactions of the Royal Academy.) We no longer think of the research object as a journal but as a set of articles, objects, and data. People don’t have a simple answer to what is their data. The raw files? The tables of data? When they’re told to share their data, they’re not sure what data is meant.”Even in astronomy we don’t have a single, crisp idea of what are our data.”

It’s very hard to find and organize all the archives of data. Even establishing a chronology is difficult. E.g., “Yes, that project has that date stamp but it’s really a transfer from a prior project twenty years older than that.” It’s hard to map the pieces.

Seamless Astronomy: ADS All Sky Survey, mapping data onto the sky. Also, they’re trying to integrate various link mappings, e.g., Chandra, NED, Simbad, WorldWide Telescope, Arxiv.org, Visier, Aladin. But mapping these collections doesn’t tell you why they’re being linked, what they have in common, or what are their differences. What kind of science is being accomplished by making those relationships? Christine hopes her project will help explain this, although not everyone will agree with the explanations.

Her group wants to draw some maps and models: “A Christmas Tree of Links!” She shows a variety of maps, possible ways of organizing the field. E.g., one from 5 yrs ago clusters services, repositories, archives and publishers. Another scheme: Publications, Objects, Observations; the connection between pubs (citations) and observations is the most loosely coupled. “The trend we’re seeing is that astronomy is making considerable progress in tying together the observations, publications, and data.” “Within astronomy, you’ve built many more pieces of your infrastructure than any other field we’ve looked at.”

She calls out Chris Erdmann [sitting immediately in front of me] as a leader in trying to get data curation and custodianship taken up by libraries. Others are worrying about bit-rot and other issues.

Astronomy is committed to open access, but the resource commitments are uneven.

Strengths of astronomy:

  • collaboration and openness.

  • International coordination.

  • Long term value of data.

  • Agreed standards.

  • Shared resources.

Gaps of astronomy:


  • Investment in data sstewardship: varies by mission and by type of research. E.g., space-based missions get more investment than the ground-based ones. (An audience member says that that’s because the space research was so expensive that there was more insistence on making the data public and usable. A lively discussion ensues…)


  • The access to data varies.


  • Curation of tools and technologies


  • International coordination. Sould we curate existing data? But you don’t get funding for using existing data. So, invest in getting new data from new instruments??


Christine ends with some provocative questions about openness. What does it mean exactly? What does it get us?


Q&A


Q: As soon as you move out of the Solar System to celestial astronomy, all the standards change.


A: When it takes ten years to build an instrument, it forces you to make early decisions about standards. But when you’re deploying sensors in lakes, you don’t always note that this is #127 that Eric put the tinfoil on top of because it wasn’t working well. Or people use Google Docs and don’t even label the rows and columns because all the readers know what they mean. That makes going back to it is much harder. “Making it useful for yourself is hard enough.” It’s harder still to make it useful for someone in 5 yrs, and harder still to make it useful for an unknown scientist in another country speaking another language and maybe from another discipline.


Q: You have to put a data management plan into every proposal, but you can’t make it a budget item… [There is a lively discussion of which funders reasonably fund this]


Q: Why does Europe fund ground-based data better than the US does?


A: [audience] Because of Riccardo Giacconi.

A: [Christine] We need to better fund the invisible workforce that makes science work. We’re trying to cast a light on this invisible infrastructure.

1 Comment »

Next Page »