The Countway Library at Harvard Medical School today held a forum/seminar on what they’re working on. What they’re working on is pretty great.
NOTE: Live-blogging. Getting things wrong. Missing points. Omitting key information. Introducing artificial choppiness. Over-emphasizing small matters. Paraphrasing badly. Not running a spellpchecker. Mangling other peoples ideas and words. You are warned, people.
Griffin Weber – “Discovering Expertise using research networking websites”
In 2008 Countway built Catalyst, a site with faculty profiles. Passive networks, shown on the right, are the existing networks. Active networks, shown on the right, grow over time. It provides a set of visualizations, including of co-authors out two degrees, topics, people with similar interests, physical neighbors, etc.
Last year they extended this to the entire university with the Faculty Finder. It’s a search site with links back to the faculty members school or departmental website.
Then they decided to link up the instances of this open source software being used at 30 universities, via federated search.
The data in the system can be used to do interesting visualizations of various relations; Griffin shows some examples.
Internally it uses the VIVO ontology.
Emily Gustainis, Head of Collection Services for Center for the History of Medicine – “Collaborative Content Building Using Omeka”
Omeka, from George Mason University, has had a big effect. It’s a “free, flexible, open source web-publishing platform for the display of library, museum, archives and scholarly collections and exhibitions.” It combines the cataloging and exhibiting of content, enabling users to self-curate their collections.
They are collaborating on the Our Marathon site that Northeastern is building, as well as with other institutions on other projects, including a collection of historical embryo photos. They’re also working on the Harvard Library Interoperability Initiative [yay!] on making cross-institutional collections available through Omeka without requiring local instances of all the content. (Here are some of the collections.)
Jonathan Kennedy: ASHE (Automatic Subject Heading Extraction)
Jon works with Countway Library and CBMI (Center for Biomedical Informatics) on semantic technologies. They’re working on semantic search, so that searches for, e.g., cancer return results about tumors, neoplasms, etc..
ASHE uses automated processes to try to generate the sort of subject headings for medical articles that a human would apply. They developed a tool and tested it on 50 books that had already been categorized by humans so they’d have something to compare the algorithmic results to. The results have been very encouraging. The system’s top ten suggested headings quite consistently contain the human-generated headings, and about 25% of the time the human-generated headings are way towards the top of the suggested ones. Also, ASHE does a good job supplying secondary headings.
They would like to expand beyond the medical domain. Criteria: Well-supported ontologies that provide dictionaries of synonymous terms, with parent to child relationships. Also, the granularity has to be right. The Library of Congress Subject Headings generally aren’t hierarchical enough for the project. But the Getty Thesaurus might be good, as well as an astrophysics ontology.
Julia Whelan: “Research in Medical Education: A bibliometric study of scholarship”
Harvard Medical School and the University of Pittsburgh Medical School are partnering to address a set of questions including: Is scholarship about medical education growing? Which journals publish it? What’s it growth compared to other medical topics? Which topics in medical ed are covered? etc. They use MeSH headings (Medical Subject Headings) to track studies and articles. They looked at 72.5K articles from 1960-2010 in 3,869 different journals. They saw growth in the number of articles and substantial growth in the number of journals. These grew faster than other medical articles and journals. They’ve also analyzed topic coverage over time, and which journals publish the most on particular topics. E.g., 80% of articles in medical education are not published in medical education journals.
Future projects: studying topics by the gender of the authors, medical specialities, medical school culture, promotion criteria, and making data available to historians on the Web. Here’s a paper on this project.
David Osterberg: “Strategic Planning at Countway: Innovation and Collaboration from the Bottom Up”
(David gives a highly condensed version of his talk because a tour of Countway is about to start.) Countway has a very flat organization. The staff is small enough to meet in one room, which they do every month. At one meeting, they brainstormed what they can do to make Countway better, and lots of great ideas arose. They formed working groups on everything from the use of space to a set of in-depth training videos to e-special collections that pull together info from all across the spectrum…