Digital History

Reflection on Digital History

Mirror Against A Wall

This image was taken from PublicDomainPictures.net. Photographer: Lynn Greyling. http://www.publicdomainpictures.net/view-image.php?image=59258&picture=mirror-against-a-wall This image falls under the Public Domain License.

This may sound really naïve but it never occurred to me that I would need a sound knowledge of how the internet worked for a digital history module. Being computer literate and having grown up with the internet I thought the module would be a walk in the park. But within the first week I realised I would need to research and understand how things such as the search function really worked. Because even this simple feature (which I have used without ever once wondering how it worked) can have an impact on the usefulness of a digital history project and has an even greater impact on how easy it is to use.

This brings me to my other misconception. I thought the module would consist of essay writing on historical topics using only digital history projects as research sources. I’m pleased that I was wrong. Had I not learned how to create a digital history project, I would never have thought to evaluate their usefulness for historians. Considering there are loads of digital history projects available than I initially thought, the ability to identify which project is more useful and easier to use is essential when thinking about the way historical research is carried out and how technological advances shape historians research methods.

The role technological advances have in shaping digital history projects impressed upon me the problematic nature of digital history. When weighing up the best method of digitising primary sources, the basis on which method is selected seems to be determined by expense. Discussions on the advantages and disadvantages of digitisation and representation methods highlights how historians are trapped between wanting to create a detailed and useful tool and the lack of funding to enable them to use the best digitisation methods available.

I’ve come to realise historians (and historical institutions for that matter) have moved towards social media as a way of getting around this problem. Take for instance the British Library’s Flickr account. The move to hosting their digitised images on a social media site has many advantages for the British Library. Firstly, they only had to create a page image as the images can be tagged and searched for by the tags. No need for OCR, advanced search features or even XML or API. Secondly, by placing it on a social media website there is no hosting fees, no need to update their software or buy more servers to host all this digital data. Flickr does this all for them.

Personally, I think there are issues with hosting the images on Flickr (removing the image from its original context is just one of them). But hosting the images on Flickr allows for open access to images otherwise withheld from the public domain and the encouragement of tagging by Flickr users has created a crowd sourced project. Public engagement is often a stipulation for project funding, yet the expensive nature of digital history methods can make this difficult. I think this is why so many historians have taken to Twitter and blogging to disseminate their research. I was quite surprised to see historians had a sizeable presence on social media. Who knew there were so many Twitterstorians!

An online historical community has many benefits for the academic historian. Research can be shared as easily as a retweet, it allows historians to perfect their writing skills and can give valuable feedback not only from other historians but from the general public who can often be a untapped source of knowledge. Personally, I’m pleased to know that once I have left university my involvement with academic history doesn’t have to stop. I can still interact with and follow historical research.

Bibliography of Links

‘Big Data for Dead People: Digital Readings and the Conundrums of Positivism’, Historyonics – Tim Hitchcock’s Blog, http://historyonics.blogspot.co.uk/2013/12/big-data-for-dead-people-digital.html; consulted 27th March 2014

Cohen, Daniel J and Rosenzweig, Roy, ‘Becoming Digital’, Digital History, http://chnm.gmu.edu/digitalhistory/appendix/1.php; consulted 27th March 2014

‘Historians on Twitter’, Active History, http://www.activehistory.co.uk/historians-on-twitter/; consulted 27th March 2014

‘The British Library’, Flickr, https://www.flickr.com/people/britishlibrary/; consulted 27th March 2014

‘What is Public Engagement?’, National Co-ordinating Centre for Public Engagement, http://www.publicengagement.ac.uk/what; consulted 27th March 2014

Advertisements
Standard
Digital History

Critique of the Clergy of the Church of England Database

The Clergy of the Church of England Database (CCEd) is a relational database (a database which stores information in multiple tables) which links primary sources relating to the clerical careers of the Church of England between 1540 and 1835. The creators of the database feel its contents are of use to the general public and genealogists but it will be best utilised by political and social historians, wanting to trace individual career paths, understand the structure of the Church of England or determine patterns in clerical migration.

Hompage of the CCEd

Homepage of the CCEd

The presentation of the database is simple and clear. The layout is minimal and does not distract the user with garish or numerous images.

Homepage of CCEd Evaluated for Accessibility and Design

Evaluation of Web Design for Accessibility

One of its best features is the how to use the database section. However, the navigation is a bit cumbersome and it often suggests using another section but does not link to it.

For the CCEd a web database is the most appropriate tool, permitting quick and complicated queries to be carried out from the web page. The ease of updating from any computer and the ability to link records, allow the project to create career narratives for an in-depth analysis of the sources. These narratives save historians the time and hassle in trying to plot the career of clergymen themselves and can quickly show them the major events taking place in clerical careers.

Career Narrative of William Paley

Example of Career Narrative

This simple database with limited visualisations would be relatively cheap to create and maintain compared to high-end technical supported databases. But is complex enough to hold a large amount of data (the CCEd contains 1,250,000 individual records).

A big data project (like CCEd) allows for both close reading and distant reading. Patterns and trends in the structure of the church and clerical migration can be ascertained through distant reading. However, we can lose the human element by looking at big patterns; the individual experience can challenge the overall arching trends. Engagement and imagination is an essential part of a historian’s interaction with primary sources and close reading can provide such interaction. However, direct engagement with the primary sources is not facilitated by the database.

Digitisation Methodologies

The data capture method of textual input, although time consuming, increases the accuracy of the information captured. Especially when compared to other methods, such as Optical Character Recognition, which struggles with early manuscripts and handwriting (it so renowned for its mistakes there is even a Twitter account satirising it!). However, the selection of only very specific information contained in the primary sources calls into question whether other valuable information has been missed?

Old Typed Print Scanned by OCR with Terrible Replication

Example of OCR Going Wrong. Image from A Report and Review of the Scanning Claim by the Editor at janelead.org (Link via Image)

It is understandable for a project of this magnitude to want to contain only the most essential information but a low resolution page image of the primary source would help the historian feel connected to the primary source without taking up too much storage space.

Although, a page image cannot be searched or manipulated, for the purpose of the database it would not have to be. It could simply function as a standalone feature, adding another layer of understanding to the interpretation of the sources. The image quality would not have to be high either, as long as the source retained its readability when zoomed in.

Old handwritten register scanned into a digital format and presented as a page image

Page Image of an Old Register. Image from the Wellcome Library, who retains the copyrights.

Instead the user is presented with a ‘screen format’ of the records used, giving the user no feel for the primary source and certainly no engagement with it.

Screen Format Version of Primary Sources. Information is presented in typed up tables.

Example of Screen Format of Primary Sources

To facilitate the dissemination of work interpreting the records in the database, the website has its own online journal. This is where the website uses XML to facilitate the searching of articles, although it does not provide any transparency in the use of this tool. The limited use of XML is due to the search engine within the database itself which does not have the time consuming disadvantage of having to create building blocks as XML does.

Overall, the database renders the primary sources redundant. The pre-selection of sources and of the information required from them, the presentation of their data in field format, the lack of images of the primary sources and the methods of analysis (record linkage and career narratives) seem to place an emphasis on the database as a source of historical information rather than on the primary sources.

For historians, who like to read the primary sources, the extraction of information must rub against the bone. Could there have been other contextual information that might have been contained within the primary source?

Bibliography of Links

‘Advanced’, Clergy of the Church of England Database, http://theclergydatabase.org.uk/how-to-use-the-database/advanced/; consulted 1st March 2014

‘Big Data for Dead People: Digital Readings and the Conundrums of Positivism’, Historyonics – Tim Hitchcock’s Blog, http://historyonics.blogspot.co.uk/2013/12/big-data-for-dead-people-digital.html; consulted 1st March 2014

‘Bibliography of sources used in the Database’, Clergy of the Church of England Database, http://theclergydatabase.org.uk/reference/bibliography-of-sources-used-in-the-database/; consulted 28th February 2014

‘Close Reading’, University of Warwick, http://www2.warwick.ac.uk/fac/arts/english/currentstudents/undergraduate/modules/fulllist/second/en227/closereading/; consulted 1st March 2014

Cohen, Daniel J and Rosenzweig, Roy, ‘Appendix – Database’, Digital History, http://chnm.gmu.edu/digitalhistory/appendix/1.php; consulted 28th February 2014

Cohen, Daniel J and Rosenzweig, Roy, ‘Becoming Digital – Digitizing Text: What Do You Want to Provide?’, Digital History, http://chnm.gmu.edu/digitalhistory/digitizing/2.php; consulted 1st March 2014

‘Contents of Database’, Clergy of the Church of England Database, http://theclergydatabase.org.uk/about/about-the-database/content-of-database/; consulted 1st March 2014

‘Data Capture’, University of Oxford, http://digital.humanities.ox.ac.uk/methods/datacapture.aspx; consulted 1st March 2014

‘How to Use the Database’, Clergy of the Church of England Database, http://theclergydatabase.org.uk/how-to-use-the-database/; consulted 28th February 2014

‘Information for Genealogists’, Clergy of the Church of England Database, http://theclergydatabase.org.uk/information-for-genealogists/; consulted 28th February 2014

‘Information for General Public’, Clergy of the Church of England Database, http://theclergydatabase.org.uk/information-for-general-pubilc/; consulted 28th February 2014

‘Interpreting Career Narratives’, Clergy of the Church of England Database, http://theclergydatabase.org.uk/how-to-use-the-database/interpreting-career-narratives/; consulted 1st March 2014

‘Introduction to XML’, W3Schools, http://www.w3schools.com/xml/xml_whatis.asp; consulted 1st March 2014

‘Journal’, Clergy of the Church of England Database, http://theclergydatabase.org.uk/journal/; consulted 1st March 2014

‘OCR (Optical Character Recognition)’. TechTarget, http://searchcontentmanagement.techtarget.com/definition/OCR-optical-character-recognition; consulted 1st March 2014

‘OCR Fail’, Twitter, https://twitter.com/OCRfail; consulted 1st March 2014

Schulz, Kathryn, ‘What is Distant Reading?’, The New York Times, http://www.nytimes.com/2011/06/26/books/review/the-mechanic-muse-what-is-distant-reading.html?pagewanted=all&_r=2&; consulted 1st March 2014

‘Welcome to the CCEd’, Clergy of the Church of England Database, http://theclergydatabase.org.uk/; consulted 28th February 2014

‘What are Relational Databases?’, How Stuff Works, http://computer.howstuffworks.com/question599.html; consulted 28th February 2014

‘When OCR Goes Bad: Google’s Ngram Viewer & The F-Word’, Search Engine Land, http://searchengineland.com/when-ocr-goes-bad-googles-ngram-viewer-the-f-word-59181; consulted 1st March 2014

Standard
Digital History

An Evaluation of Bob Nicholson’s Blog – Digital Victorianist

Out of the three blogs I have looked at this is my favourite blog. Nicholson has injected it with humour and his own personality and the style and layout really appeal to me. Plus it helps that I like his field of research.

I’d say Nicholson’s blog is in between the cluttered and stylised look of Trevor Owens blog and the minimalistic and simple layout of Melissa Terra’s blog. Although the layouts for all three blogs are pretty similar, Nicholson’s use of a colour scheme and background image make his blog look much more stylised and sleeker. The background image, which is just as busy as Owens, doesn’t distract or overpower the look of the blog as Owens’ does. Perhaps because it’s black and white and the rest of the blog is in black and yellow that there is not too much of a contrast between them. The colour scheme of charcoal and Dijon mustard, tone down the image and the white text is easy to read on the black background. This colour scheme is even incorporated into Nicholson’s logo of a top hat with a computer mouse sitting on it, framed by a mustard coloured cog (sun?).

Image

The use of a header which contain an About Page, a Research Page, a Tit-Bits Page and a search feature, allows the blog to have a cleaner and less cluttered look while still retaining all of Nicholson’s important information and features. His About Page is provides a detailed academic biography and has a tongue in cheek picture of Nicholson dressed up as a Victorian Gent. His research page includes his published research and his PhD thesis. One of the things I particularly like, and it’s something the other two blogs did not feature, is the front page which has a small summary of the blog, with a image and the date it was posted. This allows the user to skim through the posts when browsing for topics relevant to their individual needs. Of course once you’ve found a post you want to read, you just click on it and you’re redirected to the full post.

Nicholson’s writing style is also what makes the blog easy to read. Just like the other two blogs, Nicholson writes in the first person, in an informal and conversational manner but refrains from using colloquial language. His posts are injected with some satire and irony, take for example his joke about the NSA and Terrorists in his post about Gale. Part of the easy readability of Nicholson’s posts is the word limit he seems to stick to. Posts seem to range between 800-1500 words, with some much shorter than this. But even his longer posts don’t feel like a chore to read as he uses images and hyperlinks to break up the text. Although Melissa Terra also used images and hyperlinks to break up her text and clarify her arguments, I found her posts to be far less enjoyable then Nicholson’s. That’s not due to a bias towards his research topics but due to the length of the posts. From reading the title of Terra’s post on representations of academics in children’s books I thought I would enjoy reading it. But the post is around 3,360 words and no amount of images stopped the “I’m reading an article rather than a blog” feeling.

As I’ve already noted the features in the header I won’t reiterate them here except for the Tit-Bits Page. This page is quite fun to view as behaves as a gallery, linking the images Nicholson’s posted to his twitter account to his blog. You can even view these images as a slide show. By having links to his other social media profiles and accounts (such as his Reddit Profile, his YouTube, Twitter & Facebook accounts) as icons in the top right hand corner of the page, the right hand side of his blog is freed up for other features without looking cluttered. These features include:

  • Recent Tweets – linked to his Twitter Account (perhaps not a very important feature seeing as he already has a link to his Twitter account.
  • Recent Posts to the Blog
  • Archive
  • Categories
  • Search Bar (Again, something which is repeated and doesn’t need to feature here)
  • Links to blogs similar to his own

My favourite feature of Nicholson’s blog is the header bar which follows you down the page and the little button which pops up after you scroll which brings you back to the top of the page.

Image

I think this feature is something which could be utilised in Melissa Terra’s blog especially as she doesn’t have a front page and her blogs can be rather large.

Overall, despite using similar layouts and features, I prefer Bob Nicholson’s blog, it’s quirky name, the humour in the writing, the layout and style make it a very interesting and entertaining blog to read.

Standard
Digital History

An Evaluation of Melissa Terra’s Blog

Melissa Terra’s blog is another Point of View blog and as stated in her About Her paragraph, it is Terra’s personal blog. The purpose of her blog is similar to Trevor Owens blog, in that Terra uses it to express her opinions on issues and topics in digital humanities, digital cultural heritage and academia, but uses it also as a space for her to explore her research ideas and results.

In terms of style, again, Terra is similar to Owens, Terra uses the same layout as Owens. There is no front page with summaries of her posts, rather her posts are displayed one after the other. There is the use of a header where Terra has the title of her blog and a brief description of it, with a background of a light green block of colour.

Image

There is loads of white space making the posts and features stand out and the lack of a colour scheme means images won’t clash with the layout and style of the overall blog.

As for the posts and Terra’s writing style, this too both matches and differs from Owens. The writing style is in the first person, is conversational and does not use colloquial  language but there is still the feeling of reading an essay or article. Whether this is because the content of her posts are often about her research or projects or because her paragraphs can be a bit lengthy for a blog. The posts, overall, are a bit skimpy with the images or visual media, especially when compared to Owens. Some posts completely lack images while others are almost entirely made up of them. It seems Terra’s uses images where she thinks they would clarify her arguments. This can be a problem with some of her longer posts, one post is 2,106 words long without a single image breaking it up.

Terra’s layout, like Owens, also places the blogs features on the right hand side of the page, following the page as it goes down. Terra’s features include:

  • About Her Paragraph

Image

  • Search Bar
  • Links to her books on Amazon
  • Links to essential reading on digital humanities websites and blogs

Image

  • Links to humanities computing blogs

Image

  • A box showing who her followers are – this could be good for networking
  • Blog Archive Section

It is the simple and clean design of the blog and the limited and carefully chosen features which allows Terra’s blog to use the same layout as Owen without losing clarity and purpose. The length of the posts could be cut down as blog audiences usually are used to posts of about 1000 words and there is the danger of Terra losing her audiences attention. Though this depends on her audience, I would hazard a guess and say most of her audience are fellow academics interested in digital humanities and are used to reading long essays and articles anyway.

Standard
Digital History

An Evaluation of Trevor Owens Blog

Trevor Owens is a digital archivist and game enthusiast and could almost be described as a professional blogger. Having started blogging in 2006 for personal purposes, his blog has evolved into a quasi professional-personal blog with the emphasis on professional.

This point of view blog has become a place for him to explore issues and thoughts concerning digital archiving, digital history and online resources but it still has a personal element to it as the posts seem to relate to his personal interests as well as being shaped by his career interests.

This dual purpose can be seen in the style of his blog. Although a busy background image, the old adventure game picture, akin to the old Atari adventure games, shows the mixed purpose of Owen’s blog. It brings his two interests together, digital history and gaming, and could be a representation of cultural heritage and the history of gaming.

Image

The background picture aside, the layout of Owen’s blog is clear, I wouldn’t say it’s simple or minimalistic but the framing of his posts in a white rectangle makes the posts easier to read and limits the distraction factor of his background picture. The posts themselves are titled with a bold and large font, separating each one from the other but as Owens has a large amount of posts it would have been better if he had a front page, which gave a brief outline of each post so the user could quickly skim them for the most relevant post for their needs.

Having said that, Owen’s posts are relatively short and interspersed with images and pictures, videos, bullet points, hyperlinks and short paragraphs. The style of writing is easy to read; it uses the first person and is personal but doesn’t use colloquial language.

Image

In terms of features, this blog has it all. There is:

  • a separate About page
  • a separate CV Page
  • a section on the front page with a description of Owens (albeit it’s only a short paragraph

Image

Image

  • A link to Owen’s Stack Exchange profile
  • A section showing the recent comments on his posts
  • A box showing how many people are subscribed to his collective feeds
  • Tags allowing quick searching through his blog for specific and related posts
  • His Twitter feed showing his latest tweets, how many followers he has (3,482) and a button allowing you to follow his tweeter account from the blog
  • A short message on typos in the blog and his writing style
  • Archive section

While this is an impressive collection of features, which clearly shows Owens is very computer and internet savvy. The functions of these features can get a bit lost in the layout. I appreciate Owens wants to house his large internet presence under one roof, but some of the features become defunct due to the layout of his blog.

The search bar could easily sit up in the corner of the page, either in the white box or as part of the background picture. It doesn’t need to be kept on the side, although it is still relatively high up enough that you don’t have to scroll down most of the posts to find it.

The extra about section he has put on the front page is surplus to requirements. Owens obviously has a need to show everyone who he is but if someone really wanted to find out who he is and what he is about than they would use the About Page he put on the header of his blog. The message on typos could also have gone in the About Page.

The Archive feature he has is great for a blog which has been running for such a large amount of time. However, it was only after I had trawled through each page manually to find out how long he had been running the blog did I notice the Archive section on the right hand side of the page. Really a feature like this should be more obvious and user friendly and could have been placed on the header along with his About and CV Pages.

Image

By having the majority of his features run down the length of the blog, the blog looks cluttered and some of the features can be missed. I do like how Owens has used the blog as a way of advertising himself, not only by having a CV but also by linking to his other blogs and internet presence he is making the most out of his blog.

Standard
Digital History

Blogs and Social Media – A Comparison of Three Blogs

This is just a quick introductory post to say the next three posts are part of a series which evaluates and compares three blogs picked at random from a list provided by my Digital History module. Any criticisms of these blogs are my own and are not intended to offend but is simply my own personal preference on what I like and dislike about them.

The three blogs I have looked at are:

  • Trevor Owens – A nice general blog on all things Digital Humanist
  • Melissa Terras – A blog by the Professor of Digital Humanities, UCL
  • Digital Victorianist – Bob Nicholson’s blog about digital research into the Victorian age
Standard
Digital History

Introduction into Digital History

Hi,

I’m Sinead and I’m a final year student at the University of Hertfordshire, studying History. I’ve created this blog to accompany the digital history module I’m taking. It is my platform to examine and explore digital history projects and apply the things I’ve learnt from the module.

I’ll start the blog with a brief examination of the digital history project, the Old Bailey Online. I’ll briefly run through a few of its features.

Digitisation of Trials: Starting out with high definition page images of the trials, which were converted into GIF and JPEG files for easy uploading to the internet. The project then used a dual methodology for creating searchable text of the trials. Early trials were marked up using the method of double rekeying, while the later trials were created using OCR software and rekeying both methodologies were compared and manually resolved. While time consuming the manual typing and checking ensures the accuracy of the text and this is vital for creating a accurate and reliable body of resources and for the accuracy of searching. The inclusion of the page images of the trials with the searchable text allows for searchability while also keeping a feel of the primary source.

Searching: The search feature of the website is very useful. Firstly, it retains the Boolean search features most of us are used to using but it also has an advanced search option which offers a more structured search. A great search feature is the ability of generating statistics automatically for the user. However, this would not be possible without the use of marked up text.

Marked Up Text: Aside from the fact that the project informs you of the categories created for the use of statistics in the ‘about this project’ section, each trial has a link to the XML marked up text. This allows you to see the categories created that allows for structured searches and the creation of statistics. Again, this was carried out manually and with computer software. The great thing about using marked up text is the accessibility and speed with which a researcher can search through the 197,745 criminal trials, a daunting prospect if done manually. Yet the marked up text and the advanced search features allow the researcher to refine their results and receive them within seconds.

API: A feature I’ve only just learnt about and still need to fully understand but one that I can see the use for. The Old Bailey Online uses API to export data/statistics to Zotero (a bibliographical web tool) and Voyant Tools (a suite of tools for linguistic analysis). This software to software interface allows for data from the Old Bailey to be exported to either one of these other tools and removes the need for the research to do this manually.

Web Design: The design of the website allows for ease of use and accessibility. The headings are self explanatory, the search feature can be accessed through several links on every page of the site, the images are on a high quality for viewing digitally and the text is easily readable.

I’m sure anyone who has used this digital history project will already appreciate how good a resource it is for research into crime without me having to point out its features. Personally, I think the project is good because of its transparency in its methods of digitisation. The explanations it provides of how and why certain methods of digitisation were used is very helpful in introducing digital history to students.

Standard