Library Technology

Developing technologies to support libraries.

Archive for the ‘il2006’


Internet Librarian Day 2 Tuesday Track A session 3: Mashup Applications: Blyberg, Deweese

REST- accessed via url
makes it very easy to access data in a logical way
you dont have to form a post
just formulate url in such a way that the data is transferred.

PatREST=patron rest (developed by Blyberg)

he found that there was nothing out of the box that was friendly enough to share with the patrons

XSLT=xml style sheet-could have used to transform database info into XML-but even those seemed to diificult.

The expectations that patrons should understand marc is wrong
John wanted something easy to understand.

patrest allows users to access the catalog directly

Here is an example where the most popular books from 11-20 are returned:
www.aadl.org/rest/top/books/10/2

stuff you can do-

*top books=electronic signage application writen in apples quartz composer (visual in nature)
runs in 40 inch plasma display in library

*superpatron wall of books-book cover links to web opac

google widget/gadget using google api
google provides functionality.
Ajax data calls prewritten by google.
blyberg.net

Why let patrons have access?

  1. library stewardship-if users are part of building process they turn into strong advocates of library itself.
  2. taps into knowledge and expertise in community (allows for outreach)
  3. encourages innovation
  4. benefits other libraries. if your library had patrest then the applications they have built then they will work for your community as well.
  5. solicits high quality feedback
  6. promote this as a service to users in order to justify existence.
  7. lets us be part of a growing force in the web community, sharing data and letting other people use it.
  8. blurs the distinction of where the edge of the library is–

***
Chris Deweese
MASHUP google style
-putting your data on the map

one of the easiest APIs to use.

1. get a google maps api key-reuiqres url where you are going to host it.
2. get the “hello world” example
3. copy that text into a new HTML file (replace “key=abcdefg” with key=”what you get from ggogle”
4. lets add some controls “(zoom/pan)
5. latitutde and longitude coord. -geocoding service available-feed it an address and it gives you the coord)
6. create a new point -glat long
7.save your work

for plotting many points google has included a way to import specially formatted XML file-simple loop in javascript allows all you points to go on map.

google maps is like a puzzle to find hints to put everything together

feed it a static xml file or feed it url

Plotting dynamic data

data stored in mysql database
using asp.net and a custom api to output data in xml format for google maps api

in this example the tool was used to study delivery routes of library

 

Internet Librarian Day 2 Tuesday Track A session 1: Mashup: Fichter

What is a mashup and why would i want one
Darlene Fichter

The origin of the word mashup comes from the music industry where one track is “mashed” “up” with another


IBM Executive Declares Web 2.0 Technology to Drive New Business Applications”
http://www-03.ibm.com/press/us/en/pressrelease/19821.wss

There is a rise of citizen mapping projects

fun v. business

view of mashups as todays playground

some of the initial projects might look frivilous.
but in the future –see trend of blogging

DEFINITION
website or web pplication that takes content from one or more sources ro create a new service

most common tools
api or rss feed

mashup ecosystem
http://web2.wsj2.com/the_web_20_mashup_ecosystem_ramps_up.htm

things that need to exist
-open data (usage statements, copyright info)
-open services. programmatic way to draw down access
-small pieces loosley joined

culture of trust-nobody really knows what you are doing with the data

think of it as a lego brick that you can use to build other things

*housingmaps site is example of early mashup

putting info in a useful context
data store from zip code data

photos links and news in one place–daily mashups

newsmap uses google news to crunch data and analyze it
different colors represent news stories

book carousel is a pretty mashup-ajax–moves across book covers to show top 25 books in libraries

book covers coming from syndetics

full disclosure to say not responsible for info data comning from a diff source

hetemeel.com/einsteinform.php

-placeopedia connecting wikipedia articles with map data

mashup the things you want in ine place. =life aggregator tools

-flickr, blogs, del.icio.us,digg,43things etc..

blogging librarian blogger map– www.frappr.com/blogginglibrarians

liveplasma=mining amazon and searching for relationships between things
*these could all link back to library

bashr=putting flickr together with wikipedia articles and links from del.icio.us

weather bon=weather info and access from webcams and info from weather services

interspersing info into websites==bookburro

facts:1105 mashups (programmableweb)

types- mostly mapping

mashup matrix lets you see all the diff sites with apoi or data and see how they have been combines.

Typology of the mashup

  • presentation mashups (superficial)
  • client-side data mashup (application dowloaded and does some work)

Where to start

  • point click paster and publish
  • “cloning” simple source code edits
  • get an idea
  • get a developer token (so they know who is making queries)
  • read the fine print
  • create your first mashup

Internet Librarian Monday Session 6 C106 Gadgets3

Well I went to this presentation, but I seem to have lost my notes. So Here is a colelction of links to other bloggers hwo have posted about this presentation.

my 2cents: fun fluff. They did have some pretty cool trivia prizes though.

http://librarianinblack.typepad.com/librarianinblack/2006/10/il_2006_gadgets.html
http://www.adventuresinlibraryschool.com/2006/10/30/il06-gadgets-gadgets-gadgets/
http://www.infotodayblog.com/2006/10/gadgets-gadgets-gadgets.shtml
http://davidsrandomstuff.blogspot.com/2006/10/il2006-gadgets-gadgets-gadgets.html

Internet Librarian Monday Session 5 C105 Second Life Lori Bell, Michael Sauers, Tom Peters

Alliance Second Life
Lori Bell Michael Sauers Tom Peters
Second Life Library: Going to where the users are.

LaunchGraphic in SL

info island library is truly a collaborative effort.
every continent represented. global project.

try SL for free. virtual world. not really a game. over a million users. lots of businesses are going in there. important that libraries be there as well. SL is not really a game , it has a point. Amazon, reuters, wired ..different business getting in on it.

1. create an avatar.
*pornography and gambling typically lead tech efforts.

changing it to SL library 3D

dedicate whatever time you have to volunteer.

INFO ISLAND
-they have found that avatars are tired of sex and gambling and are looking for more to do. Their information needs can be fulfilled here.

Parvenu Tower

2 main traditional libraries.

-what kinds of collections, resources should be offerred is one question they constantly wrestle with.

every floor in tower contains diff info. with links to resources.

-They found that users like it when the library is staffed.

exhibits-promote digital collections.

lectures /programs once or twice a week.

They are working on APIs to connect SL with resources on the NET.

The Virtual library is still about books and information

They have a monthly book discussion group

They are still trying to find out what collections work best etc..doing a lot of experimentation.

Reality Check

minimal requirements
800 mhz pentium 3

recc requirements
1.6ghz P4

-if you don’t have the right hardware it can be a frustrating experience.
-most of cummunication is chat and instant message
-you don’t have to spend real money but it helps. curreny $1=$190 Linden dollars
for example every page in a power point presentation will cost you about 10 linden dollars.
-the greater the number of people involved in an event the greater the lag time.
-your boss and coworkers will probably not view this as work. so plan on doing this at home
-you might get sucked in.
-people are generally polite, but they still fall on your head.(when entering SL)
-keeping track of acquaintances based on their appearance can be difficult (avatars change)
-updates for SL -you can’t login until you download and install the update
-system problems – grey goo: grey colored virtual goo oozing over things – you might buy an item but when you log in next time it has dissappeared.
-sometimes it just doesn’t work – logging fails.

“what we are learning”
-how to build an island -terraforming
-staffing
-govt and management
-collection-do we have to have it and what should it contain.
-services:what does ref service mean in SL
-exhibits and events-live events are a big draw in SL
- privacy concerns-

Internet Librarian Monday Session 4 D104 Cool Tools and Mashups for Webmasters Frank Cervone, Darlene Fichter

Cool tools and mashups for webmasters
Frank Cervone
Darlene Fichter

*font tester-see how fonts are going to look online
*meta tag expert- utility that allows you to have a standard data entry form to create Dublin core. at the end you end up with code you can cut and paster into web pages
*URL investigator-link check site and lets you see data about site. Check out someone’s URL
*link popularity check- lets you put in a URL and lets you see how it rates against other sites. Compare hits between sites
*eXactmapper lite- keeping site map up to date. Lets you create a number of views of sitemap for your site.
*google site map builder-much more effective google view of your site. upload file to website. overview visually of how these things workout
*wink – allows you to create flash applications lets you create tutorial sort of products
*powerbullet presenter- allows you to create a stand alone exe file
*PHP editor- good for a number of programming languages
*PHP expert debugger-can set it up in 2 different way 1. locally or 2. on server.

Mashups in a nutshell

new breed of web applications. using info from more than one data source. information comes from 3rd parties . mix it up with your local data.

APIs/web services

step 1 get idea
step 2 sign up for developer token
step 3 read the fine print
Step 4 create your 1st mashup

-learn from others. copy how other people do things.
-online tutorials
-point and click sites
-developer toolkits
-program it

*IDEA* create google map for branch libraries.-pictures+info

point and click mashups

digital life aggregators. – suite of tools that let you bring different things together in one place. pulling in all those RSS feeds into one place.

mashups for anyone. – communitywalk.com –show a route–

make different kinds of tours. (one example- a book that has different landmarks etc..)

Frappr – blogging librarians

a map of different communities

other google map builders

yourgmap.com
mapbuilder.net (google or yahoo users)
googlemapbuilder (not free)

Book ideas
-lots of options
reviews, book covers,

-book carousel–
click through popular new fictions by book cover

yahoo user interface library (goldmine)

FLICKR toys –
-captioner! (add captions to photos)
-colr pickr search flickr for photos of a certain color

web gallery creator
tool that allows you to go through directories on your computer and create web pages with all the photos in there. creates thumbnails as well.
-BMP lite – takes photos and compresses them in batch. uses a couple of different formats
-Gliffy- quickly drawing and sharing docs on the web.
-community toolbar from conduit. use wizard to create toolbar (previously known as effective Brands)
-Firefox extensions:
1.duplicate tab-new page
2. tab x-any tab you can close
3. tabs export
4. snapper*:create snapshot
5.Browster- preview a site before you go there.
6.google notebook
7. measureit:measure dimensions of image
8.linkification:converts text links into genuine clickable links

Zotero- research , note taking,

-DFincBackup: backup program to local hard drive or network automatically.
-CCleaner: cleans up crap in windows
gets rid of uneccesary system files, registry entries. *makes things run faster
-a-squared HiJackFree: manage the autoruns on your system (similar to msconfig?) helps you find out what processes are running, what their properties are etc…
-Retrievr:sketch or upload photo to see what photos match.

Internet Librarian Monday Session 3 C103 Jesse Andrews (Flock, Book Burro)

Jesse Andrews
Flock, Book Burro

Greasemonkey
Userscripts

makes the web your playground.

Book Burro-greasemonkey example that he later made into an extension

browser 101
URLs

1st part resolves to IP address. your web browser makes a request from that IP to get a document

other people might have information about this URL

(cached version, older versions, tags etc…)
so you might also care about other bits of errata about URL

browsers

you can render the information from a web page any way we need to.

you can write more info about

-example:greasemonkey

idea:once page is loaded a script is executed to get more information about the content
add data, hide adds. -filters/corrects a few things.

scripts can enhance functionality. lets you tinker with it

example of user script. -
1. Shows where script should run.
2. If on certain page download info

de-xeni
boing boing gets risque
enables you to filter out a certain users postings

userscripts.org is a greasemonkey repository
(built using ruby on rails(take idea to execution in a couple of days if you know the tech))

you can go in and look at them and modify them. For example modify them to look at your library instead

On march 28 2005 a new version of greasemonkey was made available giving users the ability to make requests outside their local domains.

This change provided the catalyst to build bookburro

the bookburro extension tries to find an ISBN for a book on any page you are on and then
pops open window and searches a select group of databases to find prices.

His recommendation is to use screenscraping from sites that don’t yet have open data.

The next step he took was to integrate bookburro w/ worldcat

After bookburro he decided that he wanted to take these same principles to other projects.

He thought it would be a good idea to create a repository of libraries, and how to access their info–but then found out talis was doing this (silkworm)

A mashup of Book Burro w/ Book Mooch (p2p for books) added the ability to say *this is a book I want -or- * this is a book I am willing to trade.

He would love to hear new ideas or for vendors to work with him to make it easier to access their data.

FLOCK (newest release by the end of the year)(available on all platforms) Cardinal is carnality in release

based on firefox but with the following features:
-photo sharing
-blogs
-RSS

their goal is to make the other tasks you do on the web simple and fast.

Here is a rundown of some of the available functionalities:

1. flickr- button on browser toggle section of browser to make it a way to consume photos.
uploader button makes it easy to drag and drop. You can take a photo in the browser and drag and drop it to a text area and it converts to a link (integrate to photobucket as well)
2. Blogging-selection-BlogThis- to make blogpost. tries to make the correct sitelink. drag photo from bar into blogpost and it will show up. regular text area with advanced features. (personally he actually blogs in VI in the text editor and stores posts in txt file.
3, News reader bar. Integration w/ blog

These aren’t just firefox extensions, because they are trying to integrate all of these features into a comprehensive user environment.

*haven’t quite figured out how to integrate creative commons in a good way

4. Full text search. -searching for sites you have already been to. added full text search indexer. any time you visit any page it indexes that occurrence. you are linked to the history cache on your machine so that anytime you delete the history this information goes away. Yahoo is the only search provider currently with a dynamic feature.

5. del.icio.us as a way to share bookmarks. they have a bookmarklet that can be integrated into the firefox toolbar. Flock takes it one step further. keyword completion across feature set. upload things you choose to share and download. -so it works between your computer instances.

*search elsewhere features in upcoming versions

still working on how to build a web browser

overstimulate.com

Internet Librarian Monday Session 2 D102 Karen Coombs, Jason Clark

Well I have had some time to sleep and reorient myself with what I do at work. Here are the notes I took during the session titled “Innovative Uses of Web 2.0 Technologies”

Jason Clark
Karen Coombs
Web 2.0 tech and innovative uses
“Introducing web 2.0 into library websites”

Architectural things are done on the backend to make the web site friendlier

There is a movement towards a more active and dynamic web

Examples

social software
weblogs
folksonomies
wikis
podcasts
web APIs
AJAX

concepts
-radical decentralization: developed content management system to let people make their own changes. lets them make changes to any pages, has to be approved by that particular webpage owner.

Librarie’s should be encouraged to incorporate wikis and blogs into the library site so that the users are able to be more creative

-small pieces loosely joined: use whatever software or piece of programming that meets the needs at the time. For each problem build different modules to meet the need. This makes content reusable anywhere, and it lets faculty members grab the information they need. Aspects of the content management system can be replaced as needed. This makes the library site more flexible

-perpetual data: Deploy systems early, and make constant improvements. Bring the system up as soon as possible then get feedback and make changes as needed. A small group of library staff should do the alpha testing and then the rest of the library does beta testing. Enhancement requests an be posted on a wall and the development team can be constantly working to make improvements–mindset: won’t always work not set in stone open for improvement. Constant Change. Painful for librarian to change. Helping them change with perpetual beta idea. Gives them the sense that something new is always coming

-remixable content: intention to build API that allows incorporation into other systems. API lets others pull in all the content they want. use AJAX to add database link to any wiki blog etc.. Once functionality and APIs built everyone can use it. Doesn’t ask everyone to be a brilliant coder to work with data.

-user as contributor: users can contribute to what is going on with library website. Idea to let users have access to update and change page i.e. UTHINK. User tagging and review of content in catalog**

-rich user experience: multimedia -screencasting. how to do things etc.. Personalization and customization. Balance personalization with privacy issues. Spaces for collaboration and interaction.

University of Houston library

Further resources
John Blyberg, Paul Miller

***

Jason Clark

Social tagging and folksonomies in practice
ROJO
i.e. del.icio.us, Amazon,
ability for user to contribute and make resource their own and share knowledge with larger group.
Library setting:
-additional access points in library catalog
-user vocabulary , organize info
-communities of practice centered around book or article.
-organize group of pages for library subject guide.

PennTags. del.icio.us clone for library universe at UPENN
users use University IDs allow University community to come in and tag library resources.

(IDEA: make tagging utility for libraries with individual institutional access)

Montana: Beta electronic theses and dissertation component to library system-giving users ability to tag these resources.

-open system that doesn’t need for users to login.

social tagging :why does it work.
*adaptable
*maps and displays simple relationships
*not complex

potential problems:
-lack of precision:synonym and term drift.
–vulnerable to”gaming” of system (if group of users coordinate they can decide what terms belong to particular resource),
-some people are wrong and don’t understand what they are tagging.

Goal:
*mine data from users and build a better system for them.

zoom cloud / tag cloud

blog on tagging

tagsonomy.com

coders: PHP MySQL etc

open source options
–freetag

Blogs Wednesday Internet Librarian 2006

Walter nelson from RAND
Syndication and website content.

using blog related tools in other ways
customers don’t understand rss, so hard to feed it to community
people get webpages, so if you can put it on web page then your users can use it

moveable type-installation the hard part.
installed on server
canned formats can be used.
or you can make it your own. make it to look like any other website.

you can make moveable type into a CMS
Features:
-easy to use
-control of authors
-generates static web pages
-automatically updates rss feed.

!Think outside the blog!
-blogs are just a tech with useful features
-moveable type can be used for other purposes, you don’t have to use it just for blogging

purposes.

Feed2JS

2nd piece of puzzle
freeware from maricopa community college. use it to generate javascript and it displays your

rss as a bulleted list

allows you to propagate info broadly. you can use their server to parse your rss feed, but

not recommended.

install feed2js on your own server.

Announcements section on the RAND page is an RSS feed
-if you click on headline you will go directly to blog entry

Categories=taxonomical info

RSS feed -additional uses

add external newsfeeds
create static “link list”

category feeds
parse rss feeds into category
one blog creates multiple feeds

-set up branch libraries as categories

***
Using blogs for internal communications
Coombs

existing tech infrastructure for internal communication not well used.
it is easy for staff to create and maintain feedback
communication about projects needed.

diff kinds of blogs
-committee blogs, used to make announcements and post minutes and other committee documents, gather feedback on what the committee is working on.
-blogs for service points (access services, reference) maintained by staff who work at certain desk, announcements, active desktop on whatever computer someone goes to at the desk. great because it keeps the newest stuff on top
-working groups, groups working as a team: similar to committee blogs because they help users communicate, allows users to send important materials found that should be read.

Unresolved issues:
-blogs are private so there aren’t any rss feeds so hard for users to subscribe and keep up to date,
-don’t want to make some of them public because they don’t want some announcements to be seen generally.
-integration with existing authentication and authorization systems. yet another password to remember
- keeping up with blog permissions,: people leave accounts deleted, standard account management issue.
dynamic templating feature from moveable type:files aren’t created, relies on an older version of php, and if you upgrade php it breaks the system. -so looking at moving away from moveable type

***
Aaron Schmidt
*no one cares that you have a blog*
using blogs as a tool to put them on your website
to combat criticism–it is only a website.
it is about connecting and what they can do with website and how they can contribute to it.

lamson library-plymouth state – bisson using moveable type as opac
interaction happening at the opac.

Flickr: use uploading tool=piece of software that lets you drag and drop photos and bundle tags

flickr can be associated with web blog and you can send the photo to your blog
>flickr badge

new materials on flickr
westmount library, instead of these are the new titles they use a graphical representation and post them on flickr with notes, the notes describe the book and link it into the library opac.

meebo me–im amazing tool for weblog, allows communication in synchronous or asynchronous chat, widget.

blog elsewhere. — follow through get content into other pieces of blogosphere to drive information to you.

contribute to blogs in your community, flow will come back to you.

nationwide group project to contribute to social software web sites.

follow through-have a content plan, if you just let it sit there it will be static and wont do you any good.

?’s

what other blog platforms are you looking at-looking at wordpress new.

integrates with corporate directory signons – so that would help with authentication.

moveable type is working on the kind of functionality too.

Track D301: Wikis in libraries: Internet Librarian 2006 Wed.

1st speaker:
Introduction to wikis
Nicole Engard

*share secretary duties from a meeting, everyone who was there can add comments.

wikipedia not to be used as primary research tool
can be used for information discovery

why use wiki?

>easy to learn
>share knowledge
>cross borders
>revert back to older editions=security
>you track who has made changes

Mediawiki is free but has to be installed on your own server.

huge list of wikis
c2.com/cgi/wiki (some more to the address here to look up)

**
re-building intranet and wanted to have more collaboration.
2006 introduced intranet 2006
-more task oriented now, previously dept oriented.

*goal =radical decentralization=empower users

Nicole wrote the code herself because when she started it she trained librarians on wikipedia to get them used to it, and created a page where everyone was encourage to go in and have fun. no on edited it. so she looked to see whay they weren’t editing it. found out that syntx was the problem. wanted to develop new wiki using previosuly deployed application. –wysiwyg

-a lot of wikis dont have structure and their organization need one. (hierarchy)

icon (graphical) keys make it easy for users to edit pages

personal quick links on sidebar based on login

there is a button that allows users to submit page for checking to web team

editing tool looks like ms word for example –used wsywig pro (not free) powerful and works well with multiple editors on page.

*breadcrumbs at the top of pages–

***
Darren Chase
an agrreable wiki

-problem:working together collaboratively on projects can be difficult

*Bird problem story-

-whatever solution is they wanted to have control especially for the sake of Systems.

options
-static html
-blogs
-cms
-wiki

staff may have to meet a learning curve for the wiki

had to choose one, so considered different options

-pbwiki
-mediawiki
-twiki

-wikimatrix lets you compare all the different options

They chose twiki because it is easy to edit and it has good features, access control and versioning. Another good feature is the ability to make web pages.

building twiki:
perl module used instead of using sql database.
easy to install for systems department
-plugins

staff training and buy in important. or else the project will fail

Scholarship in Chaos

Flying high on the web or in free fall?
Rich Wiggins MSU

Microsoft Elsevier, and Google representatives on the panel

The future?

E:dynamic market, phenomenal the info available. working on federation and analysis. peer review question, publishers becoming more important? in academic market this poses some challenges:

  1. versioning
  2. quality (peer review)
  3. archiving

scirus
giving structure to the data.

M: wants to broaden question. how it is relevant to academic librarian. has broad interest and appeal. user appeals are not fitting with typical academic researcher. queries span into diff regions. Excited about content from publisher or open access repositories. making it available using Academic Live search is what they are trying to do.

Question? what kind of searches are being done?
(from audience–science searches)

M:seeing searches from a wide audience.

G: (on the question of academic searching) -sharing a story- as an undergraduate on a small campus in india trying to find faster ways to multiply small numbers on chips-went to library looked up articles followed citation sent work in and reviewers told him it was nice but where has he been for last five year (laugh)
Goal of what they are trying to do is make it possible for anyone anywhere to find research wherever it is produced throughout the world.
-Q: from Donna (LC) in the audience- in terms of scholarly publishing finds that audience is going to OA or blogs what do you have to say about scholarly publishing on blogging what should she do?-how do we capture scholarly info.
G: from goggle’s viewpoint -1st trying to make sure that formal information first (from google scholar POV)
Donna:trying to capture serious scholarly blogs that actually have scholarly info- how is google keeping up with this new trend?
G: what is the problem.

from the audience: blogs are ephemeral but they might have good academic material.
E: so what you are saying is that there are new modes of publication, I would recommend that you go to scirus because they don’t just look at traditional publication types. important to make a distinction between blogging and peer reviewed material and scirus is able to do this.

M: come up with way to identify blog as scholarly so it can be cataloged and made discoverable. =standards

Audience: how to preserve blog content

Barbara: this happens with scholarly journals as well. failure of archiving

E:challenges of new era -archiving and versioning already mentioned (see above) that is why we have peer review process

(scirus seems like a unique thing, how do you address what is on the web?)

versioning and archiving is responsibility of publisher.

Barbara: google scholar approach, struck by fact that when you cluster material you weren’t limited to types of products. do you follow into blog country to pursue the authors thoughts.

G:archiving significant problem
Barbara:frequently google’s cache returns things that have been gone
host:wife’s cat has thyroid problem and radiating the thyroid kills it and another gland takes over. google search for radioactive cat how do we find important blog articles

g: important articles will rise to top
host:what about controversy
g:if we can analyze it then the one with more people believing in it will rise to the top. we have to depend on what the people in the field talk about.

m: uses similar techniques for relevancy ranking.

host:serendipity. throwing random things into hitlist?
M:no.

e: agrees, relevancy is difficult to compute. trying to show results from different source, scirus has keyword terms to give clickable related terms.

host: theory of comprehensive search, women part of clinical setting, show to have toxic effects that were not indexed online. dr. only looked online and was not able to fond this info so lady was killed.

e: crucial question. difference between search engines and A&I databases. A&I database goes back and enters information. In terms of comprehensiveness search engine cant really do anything about this.

host: if your relative had complicated condition what databases would you search.

e: abstracting & indexing database material determine by editorial design. Would not google it.
m: focused on historical material in printed form, only a matter of time before information made available. Start with brother who is a surgeon.
G: I would search everything, every possible thing no matter where. Would not stop at google.
Barbara: contact medical librarian.

from audience: controlled vocabulary? And date searching?

g: search logs show no vocab based searches so doesn’t think it is very important, there are issues because people don’t know what problems are.

Donna: the law , when authoritativeness –when most important piece of law could mean someone going to jail or not. Working on something bringing all the different pieces of law from around the world into one place and using an authorotative language to go behind it. They help you get over asking the right question; what you don’t know doesn’t hurt you. Software and hardware.

if you type in a particular term then a sidebar could pop up that have the related terms

Donna:knowledge in thesaurus beyond what any one person may know.

e: for example if you type in heart attack you get cardiac failure as well. Thesaurus important. In scirus there is a controlled vocabulary.
m:good feature idea for navigating through information. Looking at different types of taxonomies. writing software to draw vocab from text. investing in this idea.
g:problem with concept refer across different languages and fields is complex and not easily resolved by using ontologies. you can go back and look at common use and scenarios. when a new concept is used it is described in previous terms and then it is named and referred by new term from then on. simplest thing to do is expand queries. keeping track in drifts–influence across different fields is hard to achieve but it does not solve by using ontologies to say this is solving the problem

host: if I search dna testing how are you mapping it to say “did you mean” kind of improvements.

g: collaborative fltering: no
m: no,
e: no, but it is an area where search engines might be moving to make use of the community that is using it.

host: are you capturing click throughs?

e: we don’t have log analysis on which pages are being clicked on
m: yes
g: of course

Barbara:Q? the problem with taxonomy is that they don’t solve the problem by going back in time, but what about clustering, relevance ranking in a certain field of expertise, if I want to use equity as a legal term that is one grouping but if I want to use it as a company that is another search, I would like to be able to narrow search. Northern light and ask.com do this.

host: so disambiguation

e: diff approach all documents are classified by scheme you can assure that you are only search within a specific scheme. based on training set took articles from classification they knew and used terms to reclassify terms in scirus
m:no we don’t but it is an interesting term. difficult to solve
g: cluster instances of same work, tricky to figure out right granularity for it to be good for the user, attractive and easy to use for specific or artificial examples, to do it across the board and so it is easy for the user would be quite difficult

host: date of document

e: in scirus you can,
m:you can sort by date,
host:problem is they web site doesn’t tell you when items were last updated.
g: yes, they have to be able to see if pdf and doc are same. Continuum – question -is it useful as a result, if not it shoud be removed. how to figure out date.. in some cases you can, depends on data, on academic scholarly cases it is possible because it says so. harvest full text and analyze document itself, do not rely on info provided to them. possible in some cases to do a reasonable job.

audience: do none of you search informal scholarly literature. what sources.
g: can you define informal literature – audience:blogs
m:no blogs
e: we don’t focus on blogs, but if we determine it is scientific then it is absorbed.

host: who decides if something is scholarly?

e: analyze terms in documents and based on that (vocab) they are marked as scientific, use relevancy score.. it is robotic

host:will satirical scientific journal be included
e:great trust in algorithm
g: at this point they don’t include blogs because they don’t feel confident that they can decide what is valuable.

audience:pre print server issues?
g: yes they
m: yes actively try to find OA
e:yes if it is scientific

host:self archiving issue is something to ponder

audience: what id A&I opened their info to you what would you do with it? (structured xml)
m:we would love to have high quality metadata.
e:work with constructing, medline database in scirus and use data in there
g:would take it in a heartbeat.

audience:that is the problem, what search engines are not getting is the problem. disconnect between electronic publishing systems and getting it into the search engines.

e:we work closely with publisher and we don’t agree with that. open archiving initiative is making good crossroads. we work with 10 publishers and just signed agreement with crossref-metadata and then full text, protocol developed by crossref. Estimation that there are 4000 stm publishers.
g: if a&I were able to provide data in xml then they would take it in a heartbeat. dealing with the

barbara:bottom line. services handling info needs when librarians are not there. what can we do to help you do a better job, for example open access good for everyone should we contribute to lobby for this?

G:the bills are paid by the librarians, if you want a comprehensive search , content that is licensed must be indexed, it will improve search dramatically.
m: provision of content and make sure it indexed, putting structure on content, classification of blogs for example and publishing of metadata etc..

host: interesting to see who defines standards

e:working together with librarians is key. they are the experts on what users need, way IR are licensed is a cooperation. no one disappearing.

audience:synergy between formats? (pdf/html)

m:xps,
e:no one solution pdf is dominant form right now

audience: user behavior observances?

e: people start broad, they are afraid to miss information. they want tools to help them refine, they have a few products they go to for information.
m: breadth of use interested in scholarly works
g:the only unusual thing is that they are different from normal searches, advanced search page not used very often only used to limit time period to last five years.