Library Technology

Developing technologies to support libraries.

Stephen Abram (bloglines)

Props to “Stephen Abram MLS —Vice President Innovations, SirsiDynix” for his mentions of bloglines all throughout this podcast:





“25 Technologies in 50 Minutes



Nov 14, 2006 | 01:00:00



Lots of technologies are due consideration for our library portals. Which would be on your top 25 list? We can’t do it all at once but we should be trying more than a few out to learn about them. Stephen Abram, SirsiDynix’s Vice President of Innovation lists a technology application every 120 seconds in this roller coaster ride of what’s out there in LibraryLand and which one’s are worth playing with and seeing if they’ll be useful to your library and your community of users. Join us for the cook’s tour of what’s in the front of the pack for the 2.0 Community Portal. Maybe I’ll use an egg timer dinger for every 2 minutes!



More Info | Listen Now | Podcast”

http://www.sirsidynixinstitute.com/archive.php

Charleston Acquisitions Conference 2006 XVI Friday Concurrent 1: Scatter and Decay E-journal usage patterns

Scatter and Decay E-journal usage patterns

Carol Tanopir

Abundant Data – too much?

Easy access to use of ejournal collections
 
There is a need to gather more data, because as good as it is now we have to gather info from users to find out what they use and why and what other things they use (web, subject repositories) to get a fuller picture.

Max Data funded by ilms funded to get data about data about selected libraries and look at whole picture.
They want to look at the story behind the data, what can we conclude with confidence about the different methods of data collection.

Ultimate outcome will be a cost benefit model for the different methods of data collection. What is worthwhile to use for the different things you want to find.

Goals to help libraries make the best use of data, they are sharing what they have collected today.

3 teams
Dave Norris London looking at deep log data, logs from all of Ohio link libraries, they isolated 4 libraries that they also surveyed.
1    Research intensive, 1 research extensive and 2 masters heavy.

Log analysis of ejournal usage only

With survey you can better identify survey base, you can separate out by rank and degree.

Negative thing about surveys is that it is self reported, but you can find out why they did things and what the values

Gail from Tennessee survey reports

 2 findings for today: scatter and decay. Looking at how you measure use in libraries.

Scatter: relative dispersion of points on a graph in respect to a mean value, (used loosely today) want to look at how reading varies by subject discipline.

Looked at how many journals used in last month multiplied it by 12 to get a year.

Medical and health students and faculty have read many more articles 434 for faculty and 222 students, many more than other disciplines. Variation in number of articles or reading done found.

Downloads by subject in Ohio University, found by logs
Medical journals first followed by other sciences. Looked at subject are of journal to come up with this distribution.

Used counter data and combined it to come up with a list of titles and used info from link resolver to come up with subject headings. Medicine did not come out on top this way. Medical school located in Memphis, and they use separate databases from Knoxville location, this would account for the discrepancy.

The Arts/humanities section looks high because link resolver put new/fashion entertainment into that category.

File had 17,000 unique titles. Log report had 7,000.

In addition to counter reports they also had other info from aggregators which came through here.

Pointed out that some titles had multiple subjects.

Couple titles in top 20 use, billboard and rolling stone, history of rock class is very popular there (Tenn.)

You have to look behind stats to find out where information is coming from. In the future it would be good to pull out scholarly titles and look at the distribution.

David looked at downloads by subject staff and faculty vs. student use.

Student use is much higher to online articles. This might be because Faculty has other sources for info, personal sub-reading rooms-colleagues.

Survey: how many minutes did you spend on the last article you read. Found that engineers spent twice as much time on medical articles.

Found medical articles are more widely read but they have been read through more quickly.

Average number of seconds of view per article was gleaned from the log data.

135 seconds time read by sciences viewers. Assumed that article is scanned over and decision is made to print or download.

Looking at the Counter data to see the entire collection. Found that 10% of titles counted for about 80% of use.

Decided to look at titles alone, and combined data on like titles, wasn’t fun matching on titles or unstandard ISSN. Ended up with 17000 titles and 11.5% counted for 80% of usage.

Highest use titles: science with over 8000 downloads. Nature 4,000, Tetrahedron, AMS archives (older chem. Data), USA today

Data from Ohio link looks somewhat different because they were looking at more scholarly titles (presumably)

–David –
Decay: process of gradually becoming inferior, gradual decrease. Or reading fails by age of articles

Did a lot of work on all 6000 Ohio link titles, good background on contextual use. There has been an increase in use of older material, because it is much more visible than it previously was.

Evidence shows that people who use search engines find older material.

Decay of article readings.
Carol asked people about when the last article they read was.

Looking at downloads

Looking at how data is triangulated and what does it mean?

Citation: Journal of the American society for

When we actually look at dowloads. We get a different picture

Conclusion:
No one perfect method for data collection. Need to look at sub discipline of reader and for items. Found so far that student assignments can lead to skewed data.
 
When looking at stories behind data there is a big difference based on sub discipline. Caution about log data.

Student reader skewed towards students, because faculty has other way to get info

We need to examine what are value is compared to our costs.

They will continue to look at possibilities for log collection. More work the more cost and more of your time is used.

The more effort you put in the greater story.

Charleston Acquitisions conference 2006 Thursday SEssion 2

After the Dinosaur Killer: Adaptation and Survival

Michael Pelikan – Information Sciences and Technology Librarian, Penn University

This is a Recap of a talk from 3 years ago

Standards

Elements

Vocabulary

DRM-predicted that it would be solved at network level

10 years ago there was a forecast by F.W. Lancaster that

Libraries will be seen as a drain on the academic community

Who is the dinosaur?

Unintended consequences – king kong vs dino (world view of both similar)

Universities will bypass traditional publishing model.

Business model gives publishers money to print

Patron’s worldview

World is in their hand=ambient findability

Searchable information where they need it

Problem-you can only search for words/not ideas

Academon – pays for student papers and sells them, giving royalties back to the author.

CiteSeer- (experimental project)

Google Scholar inspired by citeseer

Their acquisition of jotspot (wiki hosting) building of communties shows beginning of business model totally supported by advertising.

Most valuable information on the web today=your clicks (click forensics—traffic)

Click throughs and web logs gives you 100% sample rate that is accessible through machine analysis.

Nemesis (book) describes the problem with inherited knowledge, and not creating new ideas (instead just re-harvesting previously created ideas.)

Charleston 2006 Thursday Second Keynote

Massive Scale Librarianship

R.David Lankes IST 676

www.davidlankes.org

http://drew.syr.edu/MSL

big picture=every mile of roadway will generate a gigabyte of information a day.

As soon as info exists people will want to use it

Geometric progression—gladwell tipping point.

Predictable change

In the near term we will no longer have to choose what to keep because there will be so much storage at our fingertips.

Storage rate is exceeding Moore’s Law

Issues

  1. privacy
  2. ethics of involvement
  3. commercialization and scarcity
  4. unintended consequences
    1. AOL
  5. security
  6. preservation –data sitting on tape drive on unsupported proprietary system that is broken.
  7. culture and control
  8. cognition and perceptual scarcity
    1. more and more information produced but no more time to read it all
  9. scalability and sustainability
    1. power required to cool storage machines

Options?

Cataloging the information doesn’t work

Not every bit of info at there deserves this kind of attention

Ignoring the info is not an option

Other areas are replicating librarianship to deal with data

Is role of library to be curated collection of high quality work? That doesn’t work when everything is freely accessible.

Only true option is too embrace the problem and for libraries to assert themselves to the situation.

Participatory Librarianship

Iis.syr.edu

Requires shift in thinking

Our job is knowledge

Gather/sort/create

Facilitate knowledge creation of community

-Socratic method

-book groups

-collection development

Hidden item focus on grand unified theory of libraries

Bibliofundamentalism –Karen Schneider

-Role of library as a haven of order and quality

Participatory Librarianship

Constant evolution

Taken catalog and user data to create catalogs similar to amazon

Bringing all the info together in a mashup kind of a situation.

=true catalog (metadata based)+ community library

Anyone can move add or delete data

Conversations

Local, network, web, institutional, massive—DATA

Internet Librarian Day 2 Tuesday Track A session 3: Mashup Applications: Blyberg, Deweese

REST- accessed via url
makes it very easy to access data in a logical way
you dont have to form a post
just formulate url in such a way that the data is transferred.

PatREST=patron rest (developed by Blyberg)

he found that there was nothing out of the box that was friendly enough to share with the patrons

XSLT=xml style sheet-could have used to transform database info into XML-but even those seemed to diificult.

The expectations that patrons should understand marc is wrong
John wanted something easy to understand.

patrest allows users to access the catalog directly

Here is an example where the most popular books from 11-20 are returned:
www.aadl.org/rest/top/books/10/2

stuff you can do-

*top books=electronic signage application writen in apples quartz composer (visual in nature)
runs in 40 inch plasma display in library

*superpatron wall of books-book cover links to web opac

google widget/gadget using google api
google provides functionality.
Ajax data calls prewritten by google.
blyberg.net

Why let patrons have access?

  1. library stewardship-if users are part of building process they turn into strong advocates of library itself.
  2. taps into knowledge and expertise in community (allows for outreach)
  3. encourages innovation
  4. benefits other libraries. if your library had patrest then the applications they have built then they will work for your community as well.
  5. solicits high quality feedback
  6. promote this as a service to users in order to justify existence.
  7. lets us be part of a growing force in the web community, sharing data and letting other people use it.
  8. blurs the distinction of where the edge of the library is–

***
Chris Deweese
MASHUP google style
-putting your data on the map

one of the easiest APIs to use.

1. get a google maps api key-reuiqres url where you are going to host it.
2. get the “hello world” example
3. copy that text into a new HTML file (replace “key=abcdefg” with key=”what you get from ggogle”
4. lets add some controls “(zoom/pan)
5. latitutde and longitude coord. -geocoding service available-feed it an address and it gives you the coord)
6. create a new point -glat long
7.save your work

for plotting many points google has included a way to import specially formatted XML file-simple loop in javascript allows all you points to go on map.

google maps is like a puzzle to find hints to put everything together

feed it a static xml file or feed it url

Plotting dynamic data

data stored in mysql database
using asp.net and a custom api to output data in xml format for google maps api

in this example the tool was used to study delivery routes of library

 

Internet Librarian Day 2 Tuesday Track A session 1: Mashup: Fichter

What is a mashup and why would i want one
Darlene Fichter

The origin of the word mashup comes from the music industry where one track is “mashed” “up” with another


IBM Executive Declares Web 2.0 Technology to Drive New Business Applications”
http://www-03.ibm.com/press/us/en/pressrelease/19821.wss

There is a rise of citizen mapping projects

fun v. business

view of mashups as todays playground

some of the initial projects might look frivilous.
but in the future –see trend of blogging

DEFINITION
website or web pplication that takes content from one or more sources ro create a new service

most common tools
api or rss feed

mashup ecosystem
http://web2.wsj2.com/the_web_20_mashup_ecosystem_ramps_up.htm

things that need to exist
-open data (usage statements, copyright info)
-open services. programmatic way to draw down access
-small pieces loosley joined

culture of trust-nobody really knows what you are doing with the data

think of it as a lego brick that you can use to build other things

*housingmaps site is example of early mashup

putting info in a useful context
data store from zip code data

photos links and news in one place–daily mashups

newsmap uses google news to crunch data and analyze it
different colors represent news stories

book carousel is a pretty mashup-ajax–moves across book covers to show top 25 books in libraries

book covers coming from syndetics

full disclosure to say not responsible for info data comning from a diff source

hetemeel.com/einsteinform.php

-placeopedia connecting wikipedia articles with map data

mashup the things you want in ine place. =life aggregator tools

-flickr, blogs, del.icio.us,digg,43things etc..

blogging librarian blogger map– www.frappr.com/blogginglibrarians

liveplasma=mining amazon and searching for relationships between things
*these could all link back to library

bashr=putting flickr together with wikipedia articles and links from del.icio.us

weather bon=weather info and access from webcams and info from weather services

interspersing info into websites==bookburro

facts:1105 mashups (programmableweb)

types- mostly mapping

mashup matrix lets you see all the diff sites with apoi or data and see how they have been combines.

Typology of the mashup

  • presentation mashups (superficial)
  • client-side data mashup (application dowloaded and does some work)

Where to start

  • point click paster and publish
  • “cloning” simple source code edits
  • get an idea
  • get a developer token (so they know who is making queries)
  • read the fine print
  • create your first mashup

Internet Librarian Monday Session 6 C106 Gadgets3

Well I went to this presentation, but I seem to have lost my notes. So Here is a colelction of links to other bloggers hwo have posted about this presentation.

my 2cents: fun fluff. They did have some pretty cool trivia prizes though.

http://librarianinblack.typepad.com/librarianinblack/2006/10/il_2006_gadgets.html
http://www.adventuresinlibraryschool.com/2006/10/30/il06-gadgets-gadgets-gadgets/
http://www.infotodayblog.com/2006/10/gadgets-gadgets-gadgets.shtml
http://davidsrandomstuff.blogspot.com/2006/10/il2006-gadgets-gadgets-gadgets.html

Internet Librarian Monday Session 5 C105 Second Life Lori Bell, Michael Sauers, Tom Peters

Alliance Second Life
Lori Bell Michael Sauers Tom Peters
Second Life Library: Going to where the users are.

LaunchGraphic in SL

info island library is truly a collaborative effort.
every continent represented. global project.

try SL for free. virtual world. not really a game. over a million users. lots of businesses are going in there. important that libraries be there as well. SL is not really a game , it has a point. Amazon, reuters, wired ..different business getting in on it.

1. create an avatar.
*pornography and gambling typically lead tech efforts.

changing it to SL library 3D

dedicate whatever time you have to volunteer.

INFO ISLAND
-they have found that avatars are tired of sex and gambling and are looking for more to do. Their information needs can be fulfilled here.

Parvenu Tower

2 main traditional libraries.

-what kinds of collections, resources should be offerred is one question they constantly wrestle with.

every floor in tower contains diff info. with links to resources.

-They found that users like it when the library is staffed.

exhibits-promote digital collections.

lectures /programs once or twice a week.

They are working on APIs to connect SL with resources on the NET.

The Virtual library is still about books and information

They have a monthly book discussion group

They are still trying to find out what collections work best etc..doing a lot of experimentation.

Reality Check

minimal requirements
800 mhz pentium 3

recc requirements
1.6ghz P4

-if you don’t have the right hardware it can be a frustrating experience.
-most of cummunication is chat and instant message
-you don’t have to spend real money but it helps. curreny $1=$190 Linden dollars
for example every page in a power point presentation will cost you about 10 linden dollars.
-the greater the number of people involved in an event the greater the lag time.
-your boss and coworkers will probably not view this as work. so plan on doing this at home
-you might get sucked in.
-people are generally polite, but they still fall on your head.(when entering SL)
-keeping track of acquaintances based on their appearance can be difficult (avatars change)
-updates for SL -you can’t login until you download and install the update
-system problems – grey goo: grey colored virtual goo oozing over things – you might buy an item but when you log in next time it has dissappeared.
-sometimes it just doesn’t work – logging fails.

“what we are learning”
-how to build an island -terraforming
-staffing
-govt and management
-collection-do we have to have it and what should it contain.
-services:what does ref service mean in SL
-exhibits and events-live events are a big draw in SL
- privacy concerns-

Internet Librarian Monday Session 4 D104 Cool Tools and Mashups for Webmasters Frank Cervone, Darlene Fichter

Cool tools and mashups for webmasters
Frank Cervone
Darlene Fichter

*font tester-see how fonts are going to look online
*meta tag expert- utility that allows you to have a standard data entry form to create Dublin core. at the end you end up with code you can cut and paster into web pages
*URL investigator-link check site and lets you see data about site. Check out someone’s URL
*link popularity check- lets you put in a URL and lets you see how it rates against other sites. Compare hits between sites
*eXactmapper lite- keeping site map up to date. Lets you create a number of views of sitemap for your site.
*google site map builder-much more effective google view of your site. upload file to website. overview visually of how these things workout
*wink – allows you to create flash applications lets you create tutorial sort of products
*powerbullet presenter- allows you to create a stand alone exe file
*PHP editor- good for a number of programming languages
*PHP expert debugger-can set it up in 2 different way 1. locally or 2. on server.

Mashups in a nutshell

new breed of web applications. using info from more than one data source. information comes from 3rd parties . mix it up with your local data.

APIs/web services

step 1 get idea
step 2 sign up for developer token
step 3 read the fine print
Step 4 create your 1st mashup

-learn from others. copy how other people do things.
-online tutorials
-point and click sites
-developer toolkits
-program it

*IDEA* create google map for branch libraries.-pictures+info

point and click mashups

digital life aggregators. – suite of tools that let you bring different things together in one place. pulling in all those RSS feeds into one place.

mashups for anyone. – communitywalk.com –show a route–

make different kinds of tours. (one example- a book that has different landmarks etc..)

Frappr – blogging librarians

a map of different communities

other google map builders

yourgmap.com
mapbuilder.net (google or yahoo users)
googlemapbuilder (not free)

Book ideas
-lots of options
reviews, book covers,

-book carousel–
click through popular new fictions by book cover

yahoo user interface library (goldmine)

FLICKR toys –
-captioner! (add captions to photos)
-colr pickr search flickr for photos of a certain color

web gallery creator
tool that allows you to go through directories on your computer and create web pages with all the photos in there. creates thumbnails as well.
-BMP lite – takes photos and compresses them in batch. uses a couple of different formats
-Gliffy- quickly drawing and sharing docs on the web.
-community toolbar from conduit. use wizard to create toolbar (previously known as effective Brands)
-Firefox extensions:
1.duplicate tab-new page
2. tab x-any tab you can close
3. tabs export
4. snapper*:create snapshot
5.Browster- preview a site before you go there.
6.google notebook
7. measureit:measure dimensions of image
8.linkification:converts text links into genuine clickable links

Zotero- research , note taking,

-DFincBackup: backup program to local hard drive or network automatically.
-CCleaner: cleans up crap in windows
gets rid of uneccesary system files, registry entries. *makes things run faster
-a-squared HiJackFree: manage the autoruns on your system (similar to msconfig?) helps you find out what processes are running, what their properties are etc…
-Retrievr:sketch or upload photo to see what photos match.

Internet Librarian Monday Session 3 C103 Jesse Andrews (Flock, Book Burro)

Jesse Andrews
Flock, Book Burro

Greasemonkey
Userscripts

makes the web your playground.

Book Burro-greasemonkey example that he later made into an extension

browser 101
URLs

1st part resolves to IP address. your web browser makes a request from that IP to get a document

other people might have information about this URL

(cached version, older versions, tags etc…)
so you might also care about other bits of errata about URL

browsers

you can render the information from a web page any way we need to.

you can write more info about

-example:greasemonkey

idea:once page is loaded a script is executed to get more information about the content
add data, hide adds. -filters/corrects a few things.

scripts can enhance functionality. lets you tinker with it

example of user script. -
1. Shows where script should run.
2. If on certain page download info

de-xeni
boing boing gets risque
enables you to filter out a certain users postings

userscripts.org is a greasemonkey repository
(built using ruby on rails(take idea to execution in a couple of days if you know the tech))

you can go in and look at them and modify them. For example modify them to look at your library instead

On march 28 2005 a new version of greasemonkey was made available giving users the ability to make requests outside their local domains.

This change provided the catalyst to build bookburro

the bookburro extension tries to find an ISBN for a book on any page you are on and then
pops open window and searches a select group of databases to find prices.

His recommendation is to use screenscraping from sites that don’t yet have open data.

The next step he took was to integrate bookburro w/ worldcat

After bookburro he decided that he wanted to take these same principles to other projects.

He thought it would be a good idea to create a repository of libraries, and how to access their info–but then found out talis was doing this (silkworm)

A mashup of Book Burro w/ Book Mooch (p2p for books) added the ability to say *this is a book I want -or- * this is a book I am willing to trade.

He would love to hear new ideas or for vendors to work with him to make it easier to access their data.

FLOCK (newest release by the end of the year)(available on all platforms) Cardinal is carnality in release

based on firefox but with the following features:
-photo sharing
-blogs
-RSS

their goal is to make the other tasks you do on the web simple and fast.

Here is a rundown of some of the available functionalities:

1. flickr- button on browser toggle section of browser to make it a way to consume photos.
uploader button makes it easy to drag and drop. You can take a photo in the browser and drag and drop it to a text area and it converts to a link (integrate to photobucket as well)
2. Blogging-selection-BlogThis- to make blogpost. tries to make the correct sitelink. drag photo from bar into blogpost and it will show up. regular text area with advanced features. (personally he actually blogs in VI in the text editor and stores posts in txt file.
3, News reader bar. Integration w/ blog

These aren’t just firefox extensions, because they are trying to integrate all of these features into a comprehensive user environment.

*haven’t quite figured out how to integrate creative commons in a good way

4. Full text search. -searching for sites you have already been to. added full text search indexer. any time you visit any page it indexes that occurrence. you are linked to the history cache on your machine so that anytime you delete the history this information goes away. Yahoo is the only search provider currently with a dynamic feature.

5. del.icio.us as a way to share bookmarks. they have a bookmarklet that can be integrated into the firefox toolbar. Flock takes it one step further. keyword completion across feature set. upload things you choose to share and download. -so it works between your computer instances.

*search elsewhere features in upcoming versions

still working on how to build a web browser

overstimulate.com