Posts Tagged ‘error’

“Logged in elsewhere” error message - a possible fix

Friday, January 30th, 2009

A few people who are using their findmypast.com login details have been getting the error message “this username is logged in elsewhere”. We have already posted a fix for it, but here is a shorter one suggested by one of our customers. We have tried it quickly and it appears to work but we have not reviewed it comprehensively yet. However, there is no danger in trying what is suggested below!

“Basically when the user gets this message within the 1911 census site all they have to do is click the link at the top of the page called ‘Search the 1911 census’.  This will take the user to the advanced search page but will also automatically log them in at the same time. “

Please let us know if this works for you or not by leaving comments. Have a good weekend, all.

 

Transcription process and accuracy levels

Wednesday, January 21st, 2009

There have been a few questions on transcription accuracy and our policy towards certain aspects of transcribing the records. We hope this post clears up a few questions!

The transcription accuracy of the 1911census.co.uk website at launch is in excess of 98.5% according to recent tests - this threshold is set as a requirement by the National Archives.

Transcribing the census is a massive exercise - every single digitised document has to be read and transcribed and this process results in over 7 billion keystrokes over the course of the project. Naturally in this volume of keystrokes, more than a few errors will be made.

However, during the transcription process, we do apply a number of processes (developed during our many years’ experience of digitising censuses and other historical documents) to correct the most obvious errors and keep inaccuracy to a minimum.

The 1911census in particular poses specific problems - because the household summaries are the core documents rather than enumerators’ books, the variety of the handwriting itself is significantly wider - in fact there are 8 million different hands writing returns, making interpretation of the handwriting a much more challenging task!

Now some good news - the 98.5% accuracy at launch will improve over time.

The first way that it will be improved is by users of 1911census.co.uk reporting errors to us. Each report is reviewed by hand by the transcription team and if the change is approved, the change is incorporated into the search results, usually within a month (when the next data upload is made to the website).

Our policy is to accept changes only if they match what is on the original page (i.e the household form). So if your ancestor made spelling mistakes on the original page, they will be carried through into the transcript. This is actually more common than you might think, so please be sure to check the original page before you assume that there is an error, rather than an accurate transcription of the original document.

The second way that we improve the quality of the transcription over time is by applying ‘data standardisation’ processes. This is basically a set of rules we develop over time as we identify errors and apply to the data. A basic standardisation that we apply for example is converting “Geo” to “George” and listing records from Kent, Surrey and Middlesex as “London” if they fall within the metropolitan London area. We are developing and applying more data standardisations over time to eliminate more of the current transcription errors and to make searching easier, but some of these processes are much easier to apply once the data is complete.

All of our transcriptions undergo thorough batch sampling, by the transcription house, by The National Archives and by our in-house Quality Control team. Any batch failing to meet the required level of accuracy is rejected and rekeyed.

One way of reducing transcription errors is by ‘double-keying’ every entry - this basically means getting the transcriptions done twice (by different people) and then comparing the two versions and eliminating differences by hand. However, the cost of doing this naturally doubles the transcription cost, would not improve the accuracy rate by a hugely significant degree (you can never reach 100%), and the costs would have had to have been passed on to the public – resulting in higher prices for the census service.

We could also have taken the route of transcribing fewer fields – just a name index, like the old pre-digital booklets – but feel that this would have resulted in fewer people being able to find their ancestors as it would narrow the number of fields you can search on. It would also have made the transcription much less useful for academic study, which is one of the uses to which 1911 census will be put when it is completed.

It is important to remember that the transcription is designed as a finding aid for the original documents, which should be viewed as the “source of truth”; happily most users are able to find their ancestors despite the inevitable errors that creep in.

We have also provided very flexible search options (using wildcards, for example), which, with some lateral thinking, can also help you track down those who do not appear on the first search. The search options had to be constrained at launch to allow for the volumes of people searching, but we have been unlocking these features as the week has worn on, and there is more to come (see other blog posts).

Slow internet connections / old bookmarks

Thursday, January 15th, 2009

We have just made an update to the site which increases the site’s tolerance of a user’s slow internet connection. At launch, we limited the time allowed for our servers to respond to slow internet connections to allow us to serve a greater number of pages to a greater number of people.

People on dialup or other slow connections are now less likely to get a “site busy” message.

We have also fixed the old links from the 1911 pre-launch website so anyone who bookmarked pages will now go straight through.

Content encoding error

Thursday, January 15th, 2009

A few people have been experiencing the following error. UPDATE: The reason for it happening is because when your browsers request a page from us, we compress it when we send it back. This allows us to send your page faster, and to send more pages to more people simultaneously. Some browsers (either older ones or ones where the deafult settings have been changed) do not accept compressed pages and have to be told to do so.

We recommend that you use the latest versions of Firefox, Safari (my personal favourite) or Google Chrome to access the site (and the internet in general).

“Content Encoding Error

The page you are trying to view cannot be shown because it uses an invalid or unsupported form of compression.

The page you are trying to view cannot be shown because it uses an invalid or unsupported form of compression.

* Please contact the web site owners to inform them of this problem.”

One of our customers (Matthew McGinlay) has very kindly posted a solution, as below. Thank you Matt!

The fix is for firefox

1) In the address bar type “about:config” (but without the inverted commas)
2) Tell it that you will be careful, Promise!!
3) In the filter type encoding
4) There should be one option reading “network.http.accept-encoding”
5) Click in the value column and type “true” (again without the inverted commas)
6) This should fix it and you can search to your hearts content.