ALEG
Weekly Report - Week 10, 6 July 2001
What I've done
- Dataloading
- SA Women Bib: Inspected the 400 Anthologies which could not be automatcally matched to
existing works on AUSTLIT. About 100 of these were manually matched, and the rest
were used to generate new anthologies on AUSTLIT. This left the last SAWW task
to process the 6700 minor publications! The first part of this load has been
completed, resulting in 2800 new works (poems, short stories, essays, etc).
Some of these 2800 are "duplicates" due to variations in agent name and work
title, but these can only be found by manual inspection. About half of the
6700 works have a non-journal as a primary source: these sources have now been
processed. However, the other half have a journal as source.
Matching these sources reliably has proven very difficult (as it was even
when loading the original AUSTLIT data), and I spent almost 2 days writing
automated match algorithms and then manually matching what I could.
Next week, I'll create the necessary new periodical and periodical issue
works (as there are hundreds of new periodicals uncovered by the SAWW database),
and finally link the 3386 minor publications to their journal sources!
- Miscellaneous queries and adhoc data analysis reports, mainly
associated with 11,000+ works without worktype/form/genre classification.
- Removed test works and agents created during the training and since.
- Various minor maintenance user interface changes to improved efficiency
and warn of deletion of works containing many parts.
- A few more queries for Marie-Louise's aSAL presentation!
- Kerry Kilner visited Thursday/Friday which gave Annette, Marie-Louise
and I an opportunity to work through some issues related to recording
uncertain information, handling complex agent-to-agent pseud/writing names
and presentation options.
- System
Following the massive patches of last Friday, a few problems arose on
Monday related to a user file (the sendmail configuration) brazenly
overwritten by a patch, and a change in the web server log rotation
interface apparently caused by a patch to the operating system shell
command. Both were very suprising and took a while to diagnose!
As far as I know, Sun have not yet responded to our patch queries
of last week (but Fran has been away for a few days).
I've received an update to the Oracle software (8.1.7.1.0) which
claims to address some specific issues of which we are aware (security,
an internal error on compressed indices and parsing performance on
complex queries), which I'll apply the next time we need to perform
some system maintenance.
Next Week
- To Do list. Work through the (mostly) minor issues on the to-do
list provided by Annette/Kerry/Marie-Louise.
- Finalise SA Women Writers Bib load - create minor publication source periodicals and link to works
- More time on planning session on the new user interface with Marie-Louise
and Annette.
- The new thesaurus arrived on Thursday - start processing it and converting
current subject terms.
- Implement changes discussed with Kerry last Thurs/Friday
Next few weeks
- First known dates.
- Pick up the ball I dropped regarding the Z39.50 interface to NLA
holdings.