Posts Tagged ‘transcript’

Amy’s blog - searching the 1911 census for Ada Maria Howard

Monday, March 15th, 2010

Findmypast’s Amy is taking her own journey back into her family’s past. Read on for her latest discoveries…

Now that I know my great-grandparents were alive, I can search to see if I can find them in the recently released 1911 census. I’ll take my great-grandmother Ada Maria Howard as an example to demonstrate how to find an ancestor.

I’ve selected the 1911 person search from findmypast’s census collection and have entered basic information to begin with, following the findmypast rule of thumb of ‘less is more’. I’ve entered Ada in the first name field (leaving the variants box ticked so that the search includes nicknames, middle names and initials) and Howard in the surname field.

Initial search - please click to enlarge

This search returned 314 results, meaning there were 314 Ada Howards living in England and Wales in 1911. Fortunately, I have a bit more information about my great-grandmother courtesy of her marriage certificate (featured in the post below) so was able to narrow these results down a bit.

I selected ‘redefine current search’ and added a birth year to the search terms. Ada was recorded as being 23 years old at the time of her marriage in 1926, so I’ve entered 1903 as the birth year. The search defaults to include births two years either side of 1903, a really handy tactic to avoid any age inaccuracies in the census. It’s not unusual to see a year or two shaved off of or added to ancestors’ ages in census returns – be it down to vanity, an attempt to conceal an illegitimate child or even simply that the head of household couldn’t recall the exact ages of his/her children.

Redefined search - please click to enlarge

By adding this rough date of birth, I managed to narrow the list of results down to just 27 possibilities. However, Ada’s marriage certificate had provided me with another handy bit of information – her father’s name, Ernest. So I selected ‘redefine current search’ again and then switched to ‘advanced search’ via the tabs at the top of the search screen (see below). Here I entered ‘Ernest Howard’ in the ‘other persons living in the same household’ search field.

Advanced search - please click to enlarge

This time, my search returned just three results. One of these, the top result in the image below, looked like a clear winner as this Ada Howard was living in Hertfordshire in 1911 – the same county my great-grandmother was married in 15 years later. In addition, the marriage certificate had informed me that Ada’s middle name was Maria – not matching the middle names of either of the other two search results.

Search results - please click to enlarge

To confirm that this was indeed the right Ada, I viewed the transcript – a typewritten version of the original census page.

1911 census transcript - please click to enlarge

You can see that the transcript showed me exactly what I had hoped it would; Ada Howard living as the daughter of Ernest Howard, a Chimney Sweep. If you look at the bottom of the transcript, you can also see that the address the family were living at in 1911 exactly matches that recorded on my great-grandmother’s marriage certificate – Chapel End, Buntingford. I then decided to view the original census image.

Original 1911 census image - please click to enlarge

The 1911 census is the first from which original householder schedules have survived – the other surviving censuses from 1841 to 1901 consist of the census enumerators’ summary books. This means that the 1911 census return you can see above was actually completed by my great-great-grandfather Ernest Howard – you can see his signature at the bottom right of the page. Next to Ernest’s signature, we are also told exactly how many rooms (rooms, not bedrooms!) their home had. In the Howard family’s case, there were seven people living in just four rooms.

The 1911 census return has also provided me with brand new information about Ada’s mother, my great-great-grandmother. Her name is a bit tricky to read on the census return, however it has been recorded as ‘Merey’ in the transcript. The census form also informs me that she was 40 years old in 1911 and had been married to 45-year-old Ernest for 22 years. These handy bits of information will enable me to search the BMD indexes once again, this time for my great-great-grandparents’ marriage and births. In addition, if I look over to the right of the census form, I can actually see where Ernest and Merey were both born.

My 1911 search has provided one final bit of rather shocking information. In 1911, my great-great-grandmother had given birth to an impressive 12 children but sadly only half of these had survived. The number six has been listed in the census form’s ninth column, recording the number of ‘children who have died’. The 1911 census return has provided me with a lot of new information about my family, however the most striking point is just how precarious my great-grandmother’s existence must have been!

Follow Amy’s research into her past on her blog.

Fields transcibed from the original page

Wednesday, February 4th, 2009

A number of people have asked why there is slightly more information on the original page than there is on the transcript. 

When we transcribe the census, we transcribe everything on the original form except for the number of people in the house. The reason we do not transcribe the number of people in the house is that we do not believe that it is a particularly useful piece of data to include in the search engine (very few people would know this information, although arguably it could be useful for sociologists analysing the data in bulk). The reason for creating the transcriptions is simply to allow us to build a search engine which can analyse the most useful information provided in the original pages and provide results based on this to guide you to the original pages.

So the only other information that is included on the original page but not on the transcription is the number of living children born to the marriage, number dead and number of rooms in the house.

Again the reason we do not include this on the transcript is because we do not believe that this information is particularly useful as a search field and it is therefore excluded from the search options as well. All other fields are included on the transcript as they are all available as options in the advanced search.

The concept and purpose of the transcripts on the 1911census site (and indeed all findmypast.com historical records) is to act simply as a finding aid for the original page.

We always recommend that family historians (as all good historians should) rely on the original record wherever possible as the single definitive source of truth, and also the source of those extra details - not necessarily useful to search for as unlikely to be known in advance with anything approaching certainty, but potentially valuable for further research.

Transcription process and accuracy levels

Wednesday, January 21st, 2009

There have been a few questions on transcription accuracy and our policy towards certain aspects of transcribing the records. We hope this post clears up a few questions!

The transcription accuracy of the 1911census.co.uk website at launch is in excess of 98.5% according to recent tests - this threshold is set as a requirement by the National Archives.

Transcribing the census is a massive exercise - every single digitised document has to be read and transcribed and this process results in over 7 billion keystrokes over the course of the project. Naturally in this volume of keystrokes, more than a few errors will be made.

However, during the transcription process, we do apply a number of processes (developed during our many years’ experience of digitising censuses and other historical documents) to correct the most obvious errors and keep inaccuracy to a minimum.

The 1911census in particular poses specific problems - because the household summaries are the core documents rather than enumerators’ books, the variety of the handwriting itself is significantly wider - in fact there are 8 million different hands writing returns, making interpretation of the handwriting a much more challenging task!

Now some good news - the 98.5% accuracy at launch will improve over time.

The first way that it will be improved is by users of 1911census.co.uk reporting errors to us. Each report is reviewed by hand by the transcription team and if the change is approved, the change is incorporated into the search results, usually within a month (when the next data upload is made to the website).

Our policy is to accept changes only if they match what is on the original page (i.e the household form). So if your ancestor made spelling mistakes on the original page, they will be carried through into the transcript. This is actually more common than you might think, so please be sure to check the original page before you assume that there is an error, rather than an accurate transcription of the original document.

The second way that we improve the quality of the transcription over time is by applying ‘data standardisation’ processes. This is basically a set of rules we develop over time as we identify errors and apply to the data. A basic standardisation that we apply for example is converting “Geo” to “George” and listing records from Kent, Surrey and Middlesex as “London” if they fall within the metropolitan London area. We are developing and applying more data standardisations over time to eliminate more of the current transcription errors and to make searching easier, but some of these processes are much easier to apply once the data is complete.

All of our transcriptions undergo thorough batch sampling, by the transcription house, by The National Archives and by our in-house Quality Control team. Any batch failing to meet the required level of accuracy is rejected and rekeyed.

One way of reducing transcription errors is by ‘double-keying’ every entry - this basically means getting the transcriptions done twice (by different people) and then comparing the two versions and eliminating differences by hand. However, the cost of doing this naturally doubles the transcription cost, would not improve the accuracy rate by a hugely significant degree (you can never reach 100%), and the costs would have had to have been passed on to the public – resulting in higher prices for the census service.

We could also have taken the route of transcribing fewer fields – just a name index, like the old pre-digital booklets – but feel that this would have resulted in fewer people being able to find their ancestors as it would narrow the number of fields you can search on. It would also have made the transcription much less useful for academic study, which is one of the uses to which 1911 census will be put when it is completed.

It is important to remember that the transcription is designed as a finding aid for the original documents, which should be viewed as the “source of truth”; happily most users are able to find their ancestors despite the inevitable errors that creep in.

We have also provided very flexible search options (using wildcards, for example), which, with some lateral thinking, can also help you track down those who do not appear on the first search. The search options had to be constrained at launch to allow for the volumes of people searching, but we have been unlocking these features as the week has worn on, and there is more to come (see other blog posts).