Anyone working even briefly with archives learns immediately that cataloguing is an art, not a science, and that the successful use of archives demands familiarity with its cataloguers’ idiosyncrasies. In the first part of his meticulous “Some Problems in ECCO (and ESTC),” James May provides a hard look at cataloguing problems in both ECCO and ESTC. May’s article has inspired two forthcoming roundtable discussions at EC/ASECS and ASECS and this blog, so it makes sense to provide at least a cursory review of his arguments here. For May, bibliographical problems limit ECCO “as a set of digital facsimiles” (20). Below is a list of the topics he covers.
Holdings: Identifying the copy digitized is problematic because ECCO lists only the library holding the source copy, not the shelfmark.
- If the library holds more than one copy of the digitized text, “readers can’t know what is digitized unless its identifiable from MS annotations on the copy digitized” (21).
- If the source library’s cataloguing is ambiguous, duplicate entries for the same copy can result. Among the examples citied is the case of the National Library of Scotland, which provides different shelfmarks for each volume of Edward Young’s 2-volume Poetical Works. ESTC and ECCO thus list two copies of Young’s Poetical Works in that library, though there is only one (2-vol.) copy.
- There are “editions in the ESTC (some reproduced on ECCO) that are not separate editions but only reissues of earlier editions” (23).
Attribution errors:
- Examples include the false attribution of An Account of the Two Brothers, Perseus and Demetrius, . . . Collected from the Grecian History, written by the author of Busiris, . . . the Universal Passion, Satires & c. to Edward Young. Additionally, both ESTC and ECCO “fail to list Young as the author of A Sea-Piece (Dodsley, 1755).
False claims regarding publishers and places of publication, and dates:
- Faulty information taken from title pages is often absorbed uncritically into the ESTC and ECCO. For May, “one problem with the ESTC and ECCO is that nobody surveys and edits its results” (25). He suggests that “there ought to be a way for scholars to post notes tagged to ESTC and ECCO entries for other scholars to read—a suggestion Rob Hume made . . . two years ago” in The Eighteenth-Century Intelligencer (n.s. 21.1.16).
Format errors: The format of smaller books in particular is often incorrectly listed. ECCO’s entry for The Works of Dr. Jonathan Swift, vols. 15-17 (1765), incorrectly calls those volumes 12mos rather than 18mos.
Incomplete or misrepresented works: “ECCO offers 16 of the 17 volumes of The Works of Johnathan Swift,. D.D: D.S.P.D. With Notes . . . By J. Hawksworth (Dublin: Williams, 1767-1768)” (25). Sometimes frontispieces are missing.
Legibility of ECCO’s digital images:
- The images lack crispness
- They “fail to reproduce red-lettering on title-pages” (26).
- Some pages are unreadable or incomplete. See Vol. 3 of Smollett’s Continuation of the Complete History of England, 1762 (pages 167-68, 258, 321, or 328).
- Footnotes and marginalia are sometimes obscured.
- Stains, and gutter loss are problems, with the latter obscuring “the third on all versos between pp. 14 and 24 of Swift’s A Tale of a Tub 1711: ESTC N136369, T49839, and N13640.
- “Images are sometimes distorted relative to their proportions in height and width” (26).
- Later editions are less frequently digitized, though they are often rarer and thus arguably in greater need of digitized preservation than earlier editions. Many of these subsequent editions were published in Ireland or Scotland, and thus ECCO is correspondingly weaker for a study of Scottish and Irish books and booktrade.
- First and revised editions are sometimes neglected. As May argues, “roughly half of the pre-1775 editions of Young are to be found in ECCO” (29). Only five of the nine editions of The Force of Religion appear in ECCO.
Failed Searches:
- As May points out, “one must know to exclude ‘not’ from a title search” (27)
- Sometimes searches for titles and dates fail if the work has not been tagged with the proper date
- ECCO’s searches miss “a certain percentage of words” (27).
Selectivity: Like Adam Matthews, which digitized periodicals, ECCO did not use a team of scholars in the process of selecting what should and should not be digitized. May argues that “more scholarly rigor was no doubt needed when the filming by Gale and its predecessors was done to decide which copy should be filmed” (28).
May concludes that “scholars need to provide a little noisy feedback to corporate ventures like ECCO if future projects are to benefit from their expertise” (29).
July 28, 2009 at 10:15 am |
As I mentioned in the outset of my summary, James May’s article provides a hard look at ECCO, which clearly enriches scholarship. It’s the scholar’s responsibility to be aware of the text-base’s bibliographical weaknesses. That said, precisely because both ECCO and ESTC are so promising, there is certainly room for systematic improvement of their bibliographical entries.
Particularly useful is May’s repetition of Robert Hume’s suggestion that there ought to be a way for ESTC and ECCO to allow scholars to post notes tagged to faulty entries. For this to be effective, there probably also needs to be an editorial committee charged with reviewing such notes and recommending changes. If the editors of ECCO and ESTC feel that notes tagged to entries interferes with the display of information, perhaps there could be a blog set up where scholars could flag problematic entries. Neither of these suggestions is as systematic as what is required. Are there systematic ways to address the problems May identifies?
LikeLike
July 28, 2009 at 10:20 pm |
The stumbling block to being able to flag or correct an entry by posting a note would probably not arise from concerns about disrupting the appearance of a page or a fear of cluttering it. A hyperlink/pop-up balloon, for instance, would easily address such worries. Rather, the problem in allowing access to general users would be a legitimate (in many ways) concern about quality control and vetting the information: Is the person posting the correction knowledgeable? Does the new information actually correct an error or does it lead to new ones?
Establishing an accesible online form such as the ESTC’s current password-protected form for contributing libraries, Contributing to the ESTC: Web Matching, which includes instructions for entering minor bibliographic corrections, is one possibility. But that also would require ESTC to have a sufficient staff to review these changes and verify the accuracy of the corrections. As Jim May’s 2001 article asked, James E. May, “Who Will Edit the ESTC? (And Have You Checked OCLC Lately?),” (Analytical and Enumerative Bibliography, n.s. 12 (2001), 288-304)?
As for ECCO, I am not sure if there’s really an editor–let alone a multiple editors–for this database. There may well be, but I am under the impression that the staff for this specific tool consists of the database’s project manager (Scott Dawson), area marketing reps, tech support, and people who handle permissions and the like–but no purely editorial body. Yet I may well be wrong (and I hope I am). Because Gale created this database from Research Publications’s The Eighteenth Century microfilm collection and incorporated records from ESTC, establishing an editorial staff devoted to the collection once this database was up and on the market may not have seemed necessary at the time. While I did find an advisory board for the Burney Collection (Dr. Moira Goff, Early Printed Collections, British Library; Dr. Brycchan Carey, Reader in English Literature, Kingston University, London; and Markman Ellis, Professor of Eighteenth-Century Studies, Queen Mary University, London) on the Gale-Cangage website, I did not see a board listed for ECCO.
LikeLike
July 29, 2009 at 7:17 am
It would be interesting to hear from editors at ESTC about whether a hyperlink option would be helpful. It could, of course, generate a lot of work and oversight without necessarily contributing a lot of aid. If a scholar or librarian discovered a bibliographical error on an ESTC record today, he or she could simply forward the information to an editor on the ESTC web site. But this isn’t a systematic solution to the problem.
I would like a fuller understanding of the plans for development at both ESTC and ECCO.
LikeLike
July 28, 2009 at 12:30 pm |
Thanks, Anna, for this very fine overview of Jim May’s article. Some of the issues that Jim identifies here hold relevancy to Hugh Amory’s distinction between entries (or “hits’) and “books” in his article, “Pseudodoxia Bibliographica, or When is a Book Not a Book? When It’s a Record” (The Scholar & the Database: Papers Presented on 4 November 1999 at the CERL Conference Hosted by the Royal Library, Brussels, 2 [2001]: 1-14). One recommendation Amory offers is improvements in the MARC fields:
Robin Alston’s Review of Snyder’s “History of the ESTC“ offers a richly detailed view of the conflicting opinions regarding the direction of the ESTC, decisions made, cataloguing issues, types of errors, and more. The piece shows clear divisions in approach and obviously represents Alston’s perspectives of the development of the ESTC as a database and missed opportunities. His comments that the project became shaped to serve librarians and not scholars perhaps explains some of the problems that Jim May identifies in his article and experienced by other users. Alston’s remarks near the end of his article sketch the work he sees as still needed:
The issue of correcting ESTC records by those who are not working at the BL and Riverside also raises concerns about quality control–and the problem of oversight of the corrections being made. I do know that in 1998 or so some contributing institutions (the Folger for one) were given access/authority to make adjustments, but I am not sure if that access extends now to all contributing institutions.
On a different note, I will offer a very positive experience I had using an image of a title-page from the ECCO database to verify a holding’s record. I had ordered a digital image of the title page for a particular edition of The Northern Atalantis from the Kansas Spencer Research Library to accompany an article I had written. When questions arose about whether the copy at the Spencer Library was indeed the one I wanted, I sent the ECCO image (which was taken from a copy in the BL) to the Spencer librarian to confirm that the Kansas copy was indeed the same edition I had seen at the BL (the ESTC indicated that it was). The image helped the librarian determine that the Spencer copy was not the same as the BL one and that there was an error in the Kansas catalogue (now happily corrected and sent to ESTC for correction, too).
LikeLike
July 28, 2009 at 1:43 pm |
Alston’s suggestion that it will take 25 years to address these
issues seems reasonable to me–as is his concern regarding the likelihood of sustaining support for that period.
It would be interesting to know whether the ESTC editors are considering how to strengthen MARC fields.
I like your positive narrative about the value of using an ECCO
image to correct a bibliographical entry in a Kansas catalogue.
AB
LikeLike
July 29, 2009 at 7:26 am |
It seems to me that Jim covers four categories of error within ECCO and ESTC that require attention:
1. Holdings information (including attribution errors, false claims, and format errors)
2. Legibility issues
3. Searching mechanism
4. Selectivity (including incomplete or misrepresented works)
Of these, the least urgent may be “legibility,” which for the most part seems fairly good to me. In fact, sometimes it is easier to read the black and white digitized page than the original. (For a compelling disagreement, see Diana Kichuk’s excellent “Metamorphosis: Remediation in Early English Books Online (EEBO),” Literary and Linguistic Computing 22:3 (2007), 291-303.) Kichuk provides stunningly clear instances of poor visibility in EEBO’s collection and some of her criticisms would seem to apply to ECCO. And honestly, I would not want to only read digitized pages.
Holdings, searching, and selectivity, however, seem to be both urgent and inextricably interrelated problems. Does ECCO have a scholarly editorial board charged with reviewing and correcting such problems?
LikeLike
July 29, 2009 at 12:34 pm |
As I mentioned above, I am under the impression that ECCO does not have a specific scholarly editorial board. While I view the searching mechanisms and interface as issues clearly under Gale-Cengage’s domain and control, problems with holdings information (including attribution errors, false claims, and format errors) seem to originate with ESTC information and thus were inherited by Gale. The speed and accessibility enabled by the online availability of these texts have now perhaps made such errors more noticeable–I don’t remember this type of discussion arising from the use of the Eighteenth Century Collection microfilmed by Research Publications. As for selectivity, it seems that decisions about what texts to include were really made by Research Publications when they started to microfilm eighteenth-century texts in 1982.
LikeLike
July 29, 2009 at 12:37 pm |
The following is a document prepared by Gale-Cengage that provides an overview of ECCO’s origins (and a brief history of ESTC, too):
As this history details, MARC records for ECCO will soon include a “Subject Field” supplied by the Library of Congress. Gale is incorporating this information, but it should be remembered that it is not the entity responsible for what appears in the subject field.
LikeLike
July 30, 2009 at 8:01 am |
This conversation and analysis of the impact of online bibliography on the profession could not occur if the ESTC had not become available for free on the BL website in 2006. Alston’s review of Snyder’s History makes clear the inevitable conflicts and struggles behind such an ambitious project. We owe all those involved in the ESTC’s history–past, present, and future–a great debt. That we want more in the way that James May suggests is an indication of the ESTC’s success.
We may feel freer to criticize ECCO because its cost begs the question of its value. If, in fact, there is no editorial board at ECCO, is this acceptable?
LikeLike
August 1, 2009 at 12:17 am |
I may well be wrong about ECCO not currently having a specific editorial board (I’ve written Scott Dawson to ask if this is indeed the case). Yet, I also understand why once ECCO (Part 1) was complete and ready as a product (i.e., the microfilm digitized, database prepared, search interface established, etc., etc.) that it might not have occurred to Cengage-Gale to establish an editor/editorial staff for this product. In other words, Cengage-Gale may well have initially viewed their role as primarily a delivery/access provider of a set of texts that had already been subject to complete editorial overview by Research Publications and ESTC. Cengage-Gale seems sincerely concerned about the quality of its product, so if it did not (does not) have an editorial person or persons, the need to do so–even if it is a person who would coordinate the incorporating the revisions to ESTC (as well as information already in ESTC but omitted from ECCO)–would now seem more evident.
When I approached ESTC and Gale about the EC/ASECS session, both were extremely responsive–and both were already familiar with Jim May’s article and had already been in discussion with one another about how to address some of the issues that Jim’s piece raised. As an outsider, I would think it would be important that efforts to improve ECCO be coordinated with ESTC editors and their editing efforts.
The legibility issue, search capabilities, OCR, and the like would seem to be issues that fall under technological improvements. Holdings and selectivity are issues that would require scholarly/editorial expertise to improve.
That many of these problems existed with the microfilm collection (e.g., which editions were selected to be filmed) but did not generate (to my knowledge) the discussions taking place about ECCO’s shortcomings is quite interesting. Search capability and accessibility (notwithstanding the limitations imposed by cost) have brought these texts to a wider group of scholars and graduate students–and the bibliographic issues may not be foremost in the minds of new users who are now accessing these texts. The opening portion of Stephen Tabor’s “ESTC and the Bibliographic Community” is especially relevant here.
LikeLike
August 3, 2009 at 11:22 am |
Scott Dawson at Gale-Cengage confirmed that ECCO currently does not have an editorial board for ECCO for many of the reasons I gave above. However, he noted that G-C is considering the need to create one in light of the issues raised by Jim May’s article and others–and he envisions that this matter will be part of the discussions at the EC/ASECS and ASECS sessions this fall and spring.
LikeLike
August 4, 2009 at 10:26 am |
This is a promising development. Thanks, Eleanor, for contacting Scott.
LikeLike